Sublime

An inspiration engine for ideas

DeepSeek_R1

DeepSeek-R1 introduces two large language models, DeepSeek-R1-Zero and DeepSeek-R1, utilizing reinforcement learning for enhanced reasoning capabilities without supervised fine-tuning, along with distillation techniques for smaller models.

Link
F

Feri Kurniawan

@ferikurniawan

E

Euwyn Goh

@euwyngoh

b

beriwan

@beriwan

I

Ishan

@ishan

S

Sumanth

@s7manth

E

Ewan Barr

@ewan

R

Robert Tot Bagi

@rtotbagi