8 24 326

Gaurang Bharti PRO

gbharti

https://gaurangbharti.netlify.app/

AI & ML interests

GPTs, Computer Vision, NLP

Recent Activity

liked a model 17 days ago

apple/Sharp

liked a model 17 days ago

DiffSynth-Studio/Qwen-Image-i2L

liked a model 25 days ago

depth-anything/DA3-BASE

View all activity

Organizations

liked 2 models 17 days ago

apple/Sharp

Image-to-3D • Updated 17 days ago • 5.38k • 296

DiffSynth-Studio/Qwen-Image-i2L

Updated 19 days ago • 241

liked a model 25 days ago

depth-anything/DA3-BASE

Image-to-3D • 0.1B • Updated Nov 15, 2025 • 18.3k • 21

New activity in gbharti/finance-alpaca about 1 month ago

Add LICENSE file

🤝 1

#6 opened about 1 month ago by

jewittje

liked a Space about 2 months ago

Depth Anything 3

🏢

337

Create detailed depth maps from images using Depth Anything 3

liked a dataset 2 months ago

nvidia/PhysicalAI-Robotics-GR00T-X-Embodiment-Sim

Updated 26 days ago • 831k • 180

upvoted a paper 3 months ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12, 2025 • 46

New activity in gbharti/finance-alpaca 3 months ago

good

#5 opened 3 months ago by

Jackrong

liked a Space 5 months ago

OmniAvatar

🐨

272

Generate podcast and tiktok style video avatars

liked a dataset 6 months ago

Vchitect/ShotBench

Viewer • Updated Jul 1, 2025 • 3.57k • 185 • 11

liked a model 6 months ago

Vchitect/ShotVL-7B

Image-Text-to-Text • 8B • Updated Sep 19, 2025 • 965 • 15

upvoted a paper 6 months ago

VideoPrism: A Foundational Visual Encoder for Video Understanding

Paper • 2402.13217 • Published Feb 20, 2024 • 38

liked a model 6 months ago

google/videoprism-base-f16r288

Video Classification • Updated Jul 29, 2025 • 151k • 92

upvoted a paper 6 months ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23, 2025 • 33

liked a model 7 months ago

ByteDance/LatentSync-1.6

Updated Jun 12, 2025 • 21.7k • 55

liked a dataset 7 months ago

opencompass/MMBench-Video

Preview • Updated Oct 9, 2024 • 374 • 9

liked a Space 8 months ago

Keysync Demo

📈

Generate synchronized video from audio and video inputs

liked a model 8 months ago

chancharikm/qwen2.5-vl-7b-cam-motion

Video-Text-to-Text • 8B • Updated Sep 19, 2025 • 222 • 17

upvoted 2 papers 8 months ago

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21, 2025 • 155

NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks

Paper • 2504.19854 • Published Apr 28, 2025 • 7