GitHub - jmorganca/ollama: Get up and running with Llama 2 and other large language models locally
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax. EasyLM can scale up LLM training to hundreds of TPU/GPU accelerators by leveraging JAX's pjit functionality.
Building on top of Hugginface's transformers and datasets, this repo provides an easy to use and easy... See more
Building on top of Hugginface's transformers and datasets, this repo provides an easy to use and easy... See more
young-geng • GitHub - young-geng/EasyLM: Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
2-5x faster 50% less memory local LLM finetuning
- Manual autograd engine - hand derived backprop steps.
- 2x to 5x faster than QLoRA. 50% less memory usage.
- All kernels written in OpenAI's Triton language.
- 0% loss in accuracy - no approximation methods - all exact.
- No change of hardware necessary. Supports NVIDIA GPUs since 2018+. Minimum CUDA Compute Cap
unslothai • GitHub - unslothai/unsloth: 5X faster 50% less memory LLM finetuning
- LLM Pro/Serious Use Comparison/Test: From 7B to 70B vs. ChatGPT! Winner: Synthia-70B-v1.2b
- LLM Chat/RP Comparison/Test: Dolphin-Mistral, Mistral-OpenOrca, Synthia 7B Winner: Mistral-7B-OpenOrca
- LLM Chat/RP Comparison/Test: Mistral 7B Base + Instruct
- LLM Chat/RP Comparison/Test (Euryale, FashionGPT, MXLewd, Synthia, Xwin) Winner: Xwin-LM-70B-V0.1
- New Mo
r/LocalLLaMA - Reddit
OpenAI Whisper - llamafile
Whisperfile is a high-performance implementation of OpenAI's
Whisper created by Mozilla Ocho as
part of the llamafile
project, based on the
whisper.cpp software written
by Georgi Gerganov, et al.
Model creator: OpenAI
Original models: openai/whisper-release
Origin of quantized weights: ggerganov/whisper.cpp
The model is packaged in... See more
Whisperfile is a high-performance implementation of OpenAI's
Whisper created by Mozilla Ocho as
part of the llamafile
project, based on the
whisper.cpp software written
by Georgi Gerganov, et al.
Model creator: OpenAI
Original models: openai/whisper-release
Origin of quantized weights: ggerganov/whisper.cpp
The model is packaged in... See more