acvlab/FantasyPortrait-Multi-Expr
Viewer
• Updated
• 30.5k • 33 • 6
Computer Vision; Multi-modality; Generative Models; Structure from Motion; Multi-view Stereo; Localization and Mapping; Argument Reality; Virtual Reality.
ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning
FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation