Sublime
An inspiration engine for ideas
All Articles – Prakruti
prakrutimaniar.comDeepSeek_R1
DeepSeek-R1 introduces two large language models, DeepSeek-R1-Zero and DeepSeek-R1, utilizing reinforcement learning for enhanced reasoning capabilities without supervised fine-tuning, along with distillation techniques for smaller models.
LinkBimal Chopra
@choprab
Tarun Bagri
@taruun
Abhinav Rai
@abhinav
goutham n
@goutham