Sublime

An inspiration engine for ideas

All Articles – Prakruti

prakrutimaniar.com

tevaplanter

terraplanter.com
Thumbnail of tevaplanter

DeepSeek_R1

DeepSeek-R1 enhances reasoning capabilities in large language models through innovative reinforcement learning techniques, outperforming prior models while addressing challenges like readability, language mixing, and distillation for smaller models.

Link

DeepSeek_R1

DeepSeek-R1 introduces two large language models, DeepSeek-R1-Zero and DeepSeek-R1, utilizing reinforcement learning for enhanced reasoning capabilities without supervised fine-tuning, along with distillation techniques for smaller models.

Link

Commons

joro.tech
Thumbnail of Commons