DeepSpeed-FastGen

sgl-project GitHub - sgl-project/sglang: SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Understanding the Cost of Generative AI Models in Production

GitHub - BrunoScaglione/langtest: Deliver safe & effective language models