Reading List
updated
Reinforcement Pre-Training
Paper
•
2506.08007
•
Published
•
263
A Survey on Latent Reasoning
Paper
•
2507.06203
•
Published
•
93
Language Models are Few-Shot Learners
Paper
•
2005.14165
•
Published
•
18
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
Transformer
Paper
•
1910.10683
•
Published
•
15
Training language models to follow instructions with human feedback
Paper
•
2203.02155
•
Published
•
24
LLaMA: Open and Efficient Foundation Language Models
Paper
•
2302.13971
•
Published
•
20
Paper
•
2310.06825
•
Published
•
56
Gemma 2: Improving Open Language Models at a Practical Size
Paper
•
2408.00118
•
Published
•
78
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper
•
2502.02737
•
Published
•
253
Paper
•
2504.07491
•
Published
•
133
Hierarchical Reasoning Model
Paper
•
2506.21734
•
Published
•
46
DeepSeek-V3 Technical Report
Paper
•
2412.19437
•
Published
•
74