Sublime
An inspiration engine for ideas
Edward Osborn
@edwardosborn
Geoff Mitchell
@geoffmitch
Autonomous agents
Darren LI and • 15 cards
Paul Ross
@paulross
Phil Smith
@philrobertovich
Leadership
RG Logan • 2 cards
Andrew Martin
@andrwmrtn
DeepSeek_R1
DeepSeek-R1 introduces two large language models, DeepSeek-R1-Zero and DeepSeek-R1, utilizing reinforcement learning for enhanced reasoning capabilities without supervised fine-tuning, along with distillation techniques for smaller models.
Link