InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision Paper • 2512.01342 • Published Dec 1, 2025 • 15
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published Nov 5, 2025 • 52
OpenGVLab/VideoChat-Flash-Qwen2_5-7B_InternVideo2-1B Video-Text-to-Text • 9B • Updated May 16, 2025 • 513 • 5
OpenGVLab/VideoChat-Flash-Qwen2_5-2B_res448 Video-Text-to-Text • 2B • Updated Mar 16, 2025 • 1.7k • 26