StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors Paper • 2512.16915 • Published 20 days ago • 37
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 20 days ago • 83
DINOv2 Collection DINOv2: foundation models producing robust visual features suitable for image-level and pixel-level visual tasks - https://arxiv.org/abs/2304.07193 • 5 items • Updated Aug 13, 2025 • 30
DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21, 2025 • 445
VertexRegen: Mesh Generation with Continuous Level of Detail Paper • 2508.09062 • Published Aug 12, 2025 • 38
4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time Paper • 2506.18890 • Published Jun 23, 2025 • 6
Point-MoE: Towards Cross-Domain Generalization in 3D Semantic Segmentation via Mixture-of-Experts Paper • 2505.23926 • Published May 29, 2025 • 5
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚 Aug 26, 2024 • 82
Frame In-N-Out: Unbounded Controllable Image-to-Video Generation Paper • 2505.21491 • Published May 27, 2025 • 16
Learning 3D Representations from Procedural 3D Programs Paper • 2411.17467 • Published Nov 25, 2024 • 9
RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning Paper • 2409.14674 • Published Sep 23, 2024 • 42
SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Task Planning Paper • 2307.06135 • Published Jul 12, 2023 • 14
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination Paper • 2406.05132 • Published Jun 7, 2024 • 30