Transolver: A Fast Transformer Solver for PDEs on General Geometries Paper • 2402.02366 • Published Feb 4, 2024 • 1
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 131 • 19
ImLoc: Revisiting Visual Localization with Image-based Representation Paper • 2601.04185 • Published 7 days ago • 2 • 1
dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model Paper • 2512.02498 • Published Dec 2, 2025 • 1
MMFormalizer: Multimodal Autoformalization in the Wild Paper • 2601.03017 • Published 8 days ago • 99 • 6
RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes Paper • 2601.05249 • Published 6 days ago • 43 • 3
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published 9 days ago • 93 • 8
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published 13 days ago • 114 • 4
Effort: Efficient Orthogonal Modeling for Generalizable AI-Generated Image Detection Paper • 2411.15633 • Published Nov 23, 2024 • 1
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 8 days ago • 95 • 9
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published 15 days ago • 58 • 6
Can We Trust AI Explanations? Evidence of Systematic Underreporting in Chain-of-Thought Reasoning Paper • 2601.00830 • Published 21 days ago • 2 • 3
Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits Paper • 2512.20578 • Published 22 days ago • 73 • 4
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization Paper • 2512.24615 • Published 15 days ago • 111 • 5
Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs Paper • 2510.01954 • Published Oct 2, 2025 • 13 • 3
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation Paper • 2601.00664 • Published 12 days ago • 51 • 3