The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 127 • 11
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published 4 days ago • 116 • 5
VL-JEPA: Joint Embedding Predictive Architecture for Vision-language Paper • 2512.10942 • Published Dec 11, 2025 • 45 • 6
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 2 days ago • 143 • 3
CloneMem: Benchmarking Long-Term Memory for AI Clones Paper • 2601.07023 • Published 6 days ago • 2 • 1
An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models Paper • 2408.00724 • Published Aug 1, 2024 • 2 • 1
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published 3 days ago • 106 • 3
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 5 days ago • 105 • 4
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 13 days ago • 41 • 3
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published 9 days ago • 158 • 6
Breaking the Sorting Barrier for Directed Single-Source Shortest Paths Paper • 2504.17033 • Published Apr 23, 2025 • 1
Transolver: A Fast Transformer Solver for PDEs on General Geometries Paper • 2402.02366 • Published Feb 4, 2024 • 1
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 135 • 19
ImLoc: Revisiting Visual Localization with Image-based Representation Paper • 2601.04185 • Published 10 days ago • 2 • 1
dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model Paper • 2512.02498 • Published Dec 2, 2025 • 1
MMFormalizer: Multimodal Autoformalization in the Wild Paper • 2601.03017 • Published 11 days ago • 102 • 7