No code yet
updated
Scaling Latent Reasoning via Looped Language Models
Paper
•
2510.25741
•
Published
•
221
Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models
Paper
•
2511.23319
•
Published
•
22
Focused Chain-of-Thought: Efficient LLM Reasoning via Structured Input Information
Paper
•
2511.22176
•
Published
•
4
FedRE: A Representation Entanglement Framework for Model-Heterogeneous Federated Learning
Paper
•
2511.22265
•
Published
•
1
What does it mean to understand language?
Paper
•
2511.19757
•
Published
•
9
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
Paper
•
2511.22570
•
Published
•
79
PromptBridge: Cross-Model Prompt Transfer for Large Language Models
Paper
•
2512.01420
•
Published
•
9
ChronosObserver: Taming 4D World with Hyperspace Diffusion Sampling
Paper
•
2512.01481
•
Published
•
2
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper
•
2512.02472
•
Published
•
50
Glance: Accelerating Diffusion Models with 1 Sample
Paper
•
2512.02899
•
Published
•
28
C^2DLM: Causal Concept-Guided Diffusion Large Language Models
Paper
•
2511.22146
•
Published
•
3
PretrainZero: Reinforcement Active Pretraining
Paper
•
2512.03442
•
Published
•
46
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains
Paper
•
2501.05707
•
Published
•
20
SelfGoal: Your Language Agents Already Know How to Achieve High-level
Goals
Paper
•
2406.04784
•
Published
•
2
Self-Improving Transformers Overcome Easy-to-Hard and Length
Generalization Challenges
Paper
•
2502.01612
•
Published
•
1
Bootstrapping Task Spaces for Self-Improvement
Paper
•
2509.04575
•
Published
•
5
Mind the Gap: Examining the Self-Improvement Capabilities of Large
Language Models
Paper
•
2412.02674
•
Published
WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World
Model
Paper
•
2504.21024
•
Published
•
2
Self-Improvement in Language Models: The Sharpening Mechanism
Paper
•
2412.01951
•
Published
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
•
2312.10003
•
Published
•
44
Self Rewarding Self Improving
Paper
•
2505.08827
•
Published
•
1
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Paper
•
2412.17256
•
Published
•
47
Self-Aware Feedback-Based Self-Learning in Large-Scale Conversational AI
Paper
•
2205.00029
•
Published
SPRIGHT: A Fast and Robust Framework for Sparse Walsh-Hadamard Transform
Paper
•
1508.06336
•
Published
OLMoE: Open Mixture-of-Experts Language Models
Paper
•
2409.02060
•
Published
•
78
Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in
Code Generation
Paper
•
2405.20092
•
Published
•
1
DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems
Paper
•
2512.06749
•
Published
•
26
DEER: Draft with Diffusion, Verify with Autoregressive Models
Paper
•
2512.15176
•
Published
•
41
HyperVL: An Efficient and Dynamic Multimodal Large Language Model for Edge Devices
Paper
•
2512.14052
•
Published
•
39
Robust and Calibrated Detection of Authentic Multimedia Content
Paper
•
2512.15182
•
Published
•
15
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning
Paper
•
2512.15687
•
Published
•
17
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper
•
2512.15745
•
Published
•
73