GitHub - sqrkl/lm-evaluation-harness: A framework for few-sh...

GitHub - sqrkl/lm-evaluation-harness: A framework for few-shot evaluation of language models.

GitHub - sqrkl/lm-evaluation-harness: A framework for few-shot evaluation of language models.

RelatedInsightsHighlights

GitHub - arthur-ai/bench: A tool for evaluating LLMs

GitHub - arthur-ai/bench: A tool for evaluating LLMs

Deep-ML

Just a moment...

researchgate.net

Thumbnail of Just a moment...

What I learned from looking at 900 most popular open source AI tools

Chip Huyen huyenchip.com

Thumbnail of What I learned from looking at 900 most popular open source AI tools