GitHub - SeldonIO/MLServer: An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

GitHub - SeldonIO/MLServer: An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

github.com
Thumbnail of GitHub - SeldonIO/MLServer: An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

predibase β€’ GitHub - predibase/lorax: Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

traceloop β€’ GitHub - traceloop/openllmetry: Open-source observability for your LLM application, based on OpenTelemetry

young-geng β€’ GitHub - young-geng/EasyLM: Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.