kevpan
kevpan
AI & ML interests
None yet
Organizations
None yet
vlm
-
Scaling Spatial Intelligence with Multimodal Foundation Models
Paper • 2511.13719 • Published • 47 -
Thinking with Images via Self-Calling Agent
Paper • 2512.08511 • Published • 23 -
DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models
Paper • 2512.15713 • Published • 17 -
In Pursuit of Pixel Supervision for Visual Pre-training
Paper • 2512.15715 • Published • 11
diffusion
vlm
-
Scaling Spatial Intelligence with Multimodal Foundation Models
Paper • 2511.13719 • Published • 47 -
Thinking with Images via Self-Calling Agent
Paper • 2512.08511 • Published • 23 -
DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models
Paper • 2512.15713 • Published • 17 -
In Pursuit of Pixel Supervision for Visual Pre-training
Paper • 2512.15715 • Published • 11
models 0
None public yet
datasets 0
None public yet