Bangers 2025
updated
ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining,
Finetuning, and Decoding the Curse of Multilinguality
Paper
• 2510.22037
• Published
• 21
Less is More: Recursive Reasoning with Tiny Networks
Paper
• 2510.04871
• Published
• 509
The Dragon Hatchling: The Missing Link between the Transformer and
Models of the Brain
Paper
• 2509.26507
• Published
• 547
Scaling Language-Centric Omnimodal Representation Learning
Paper
• 2510.11693
• Published
• 104
MemMamba: Rethinking Memory Patterns in State Space Model
Paper
• 2510.03279
• Published
• 73
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement
Learning
Paper
• 2510.03259
• Published
• 57
Large Reasoning Models Learn Better Alignment from Flawed Thinking
Paper
• 2510.00938
• Published
• 59
Reasoning with Sampling: Your Base Model is Smarter Than You Think
Paper
• 2510.14901
• Published
• 48
Paper
• 2510.18212
• Published
• 36
Huxley-Gödel Machine: Human-Level Coding Agent Development by an
Approximation of the Optimal Self-Improving Machine
Paper
• 2510.21614
• Published
• 22