Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper β’ 2602.08354 β’ Published 16 days ago β’ 171
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper β’ 2602.08222 β’ Published 16 days ago β’ 268
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper β’ 2602.05400 β’ Published 20 days ago β’ 331
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents Paper β’ 2602.16855 β’ Published 10 days ago β’ 42
SLA2: Sparse-Linear Attention with Learnable Routing and QAT Paper β’ 2602.12675 β’ Published 12 days ago β’ 51
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories Paper β’ 2602.10809 β’ Published 14 days ago β’ 51
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs Paper β’ 2602.10388 β’ Published 14 days ago β’ 228
Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling Paper β’ 2602.09084 β’ Published 15 days ago β’ 27
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper β’ 2602.10604 β’ Published 14 days ago β’ 184
AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents Paper β’ 2602.06855 β’ Published 18 days ago β’ 73
Judging What We Cannot Solve: A Consequence-Based Approach for Oracle-Free Evaluation of Research-Level Math Paper β’ 2602.06291 β’ Published 19 days ago β’ 23
MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents Paper β’ 2602.02474 β’ Published 22 days ago β’ 56
MARS: Modular Agent with Reflective Search for Automated AI Research Paper β’ 2602.02660 β’ Published 22 days ago β’ 63
AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios Paper β’ 2601.20613 β’ Published 28 days ago β’ 10
PaperBanana: Automating Academic Illustration for AI Scientists Paper β’ 2601.23265 β’ Published 25 days ago β’ 199