FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper • 2601.13836 • Published 2 days ago • 28
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper • 2601.13836 • Published 2 days ago • 28
FutureOmni: Evaluating Future Forecasting from Omni-Modal Context for Multimodal LLMs Paper • 2601.13836 • Published 2 days ago • 28
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation Paper • 2402.11251 • Published Feb 17, 2024 • 1
Hint-before-Solving Prompting: Guiding LLMs to Effectively Utilize Encoded Knowledge Paper • 2402.14310 • Published Feb 22, 2024
CorefDiffs: Co-referential and Differential Knowledge Flow in Document Grounded Conversations Paper • 2210.02223 • Published Oct 5, 2022
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model Paper • 2406.12030 • Published Jun 17, 2024
Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices Paper • 2503.06063 • Published Mar 8, 2025
VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models Paper • 2504.13122 • Published Apr 17, 2025 • 20
LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding Paper • 2505.16983 • Published May 22, 2025 • 1
LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models Paper • 2510.13626 • Published Oct 15, 2025 • 45
RoboOmni: Proactive Robot Manipulation in Omni-modal Context Paper • 2510.23763 • Published Oct 27, 2025 • 54
MCM-DPO: Multifaceted Cross-Modal Direct Preference Optimization for Alt-text Generation Paper • 2510.00647 • Published Oct 1, 2025
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents Paper • 2512.23343 • Published 24 days ago • 27
XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation Paper • 2104.07412 • Published Apr 15, 2021
MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization Paper • 2601.01554 • Published 18 days ago • 53
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published Nov 14, 2025 • 185
RoboOmni Collection Proactive Robot Manipulation in Omni-modal Context • 8 items • Updated Oct 29, 2025 • 4