Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models Paper • 2601.19834 • Published Jan 27 • 25
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published Nov 12, 2025 • 81
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published Nov 13, 2025 • 99
Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs Paper • 2510.13795 • Published Oct 15, 2025 • 59
RLVR-World: Training World Models with Reinforcement Learning Paper • 2505.13934 • Published May 20, 2025 • 16
Vid2World: Crafting Video Diffusion Models to Interactive World Models Paper • 2505.14357 • Published May 20, 2025 • 27