Sublime

DeepSeek_R1

DeepSeek-R1 introduces two large language models, DeepSeek-R1-Zero and DeepSeek-R1, utilizing reinforcement learning for enhanced reasoning capabilities without supervised fine-tuning, along with distillation techniques for smaller models.

Link

Mark Perry • Just a moment...

F

Feri Kurniawan

@ferikurniawan

E

Euwyn Goh

@euwyngoh

b

beriwan

@beriwan

I

Ishan

@ishan

S

Sumanth

@s7manth

Ideation. Innovation. Transformation.

E

Ewan Barr

@ewan

R

Robert Tot Bagi

@rtotbagi