MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation Paper • 2601.06874 • Published Jan 11 • 12
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning Paper • 2507.08306 • Published Jul 11, 2025