EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing Paper • 2512.06065 • Published Dec 5, 2025 • 28
VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory Paper • 2506.18903 • Published Jun 23, 2025 • 22
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model Paper • 2503.16282 • Published Mar 20, 2025 • 5
Semantic Score Distillation Sampling for Compositional Text-to-3D Generation Paper • 2410.09009 • Published Oct 11, 2024 • 15
CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor Paper • 2312.07661 • Published Dec 12, 2023 • 19