markendo/llava-extract-from-scratch-qwen3-0.6B Image-Text-to-Text • 1.0B • Updated about 1 month ago • 7
markendo/llava-extract-from-scratch-qwen3-1.7B Image-Text-to-Text • 2B • Updated about 1 month ago • 9
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models Paper • 2511.17487 • Published Nov 21 • 9
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models Paper • 2511.17487 • Published Nov 21 • 9 • 2
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration Paper • 2412.13180 • Published Dec 17, 2024 • 13
Extract+Think Collection Data and Models for Extract+Think as part of Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models • 5 items • Updated Nov 20
markendo/llava-extract-from-scratch-qwen3-1.7B Image-Text-to-Text • 2B • Updated about 1 month ago • 9
markendo/llava-extract-from-scratch-qwen3-0.6B Image-Text-to-Text • 1.0B • Updated about 1 month ago • 7