Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MiniMaxAI 's Collections
MiniMax-M2.1
MiniMax-M2
VTP
MiniMax-M1
SynLogic
One-RL-to-See-Them-All
MiniMax-Speech
MiniMax-01

VTP

updated 25 days ago

Towards Scalable Pre-training of Visual Tokenizers for Generation

Upvote
39

  • MiniMaxAI/VTP-Small-f16d64

    Image Feature Extraction • 0.2B • Updated 26 days ago • 17k • 11

  • MiniMaxAI/VTP-Base-f16d64

    Image Feature Extraction • 0.3B • Updated 26 days ago • 15.9k • 18

  • MiniMaxAI/VTP-Large-f16d64

    Image Feature Extraction • 0.7B • Updated 26 days ago • 21.9k • 13

  • Towards Scalable Pre-training of Visual Tokenizers for Generation

    Paper • 2512.13687 • Published 26 days ago • 100
Upvote
39
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs