netzkontrast 's Collections LLMs
updated
Dolphin: Closed-loop Open-ended Auto-research through Thinking,
Practice, and Feedback
Paper
• 2501.03916
• Published
• 16
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Though
Paper
• 2501.04682
• Published
• 99
Agent Laboratory: Using LLM Agents as Research Assistants
Paper
• 2501.04227
• Published
• 95
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper
• 2501.05366
• Published
• 102
Entropy-Guided Attention for Private LLMs
Paper
• 2501.03489
• Published
• 14
Enabling Scalable Oversight via Self-Evolving Critic
Paper
• 2501.05727
• Published
• 72
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains
Paper
• 2501.05707
• Published
• 20
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Paper
• 2501.06842
• Published
• 16
Diving into Self-Evolving Training for Multimodal Reasoning
Paper
• 2412.17451
• Published
• 42
Fourier Position Embedding: Enhancing Attention's Periodic Extension for
Length Generalization
Paper
• 2412.17739
• Published
• 41
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
Paper
• 2412.14711
• Published
• 16
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Paper
• 2412.21187
• Published
• 40
ProgCo: Program Helps Self-Correction of Large Language Models
Paper
• 2501.01264
• Published
• 26
Tensor Product Attention Is All You Need
Paper
• 2501.06425
• Published
• 90
Transformer^2: Self-adaptive LLMs
Paper
• 2501.06252
• Published
• 55
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper
• 2501.08313
• Published
• 300
Evolving Deeper LLM Thinking
Paper
• 2501.09891
• Published
• 115
Agentic Context Engineering: Evolving Contexts for Self-Improving
Language Models
Paper
• 2510.04618
• Published
• 129
Sculptor: Empowering LLMs with Cognitive Agency via Active Context
Management
Paper
• 2508.04664
• Published
• 13
AgentFold: Long-Horizon Web Agents with Proactive Context Management
Paper
• 2510.24699
• Published
• 71
Memory as Action: Autonomous Context Curation for Long-Horizon Agentic
Tasks
Paper
• 2510.12635
• Published
• 17