Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Antigonish 's Collections
FINANCE
SPEECH TO TEXT
AGENTS
CHARACTER AI
RESEARCH ARXIV
TTS
PERSONALIZATION
VISION
GPT-OSS
DOCUMENT WRITER
PLAYGROUND
SPREADSHEET
LORAS
EMBEDDING
LAW
SEARCH
LEADERBOARD
HEALTH
VIDEO
WRITE
HARDWARE, VRAM
MODELS
SONGS
TRAINING
IMAGE EXPLANATION
IMAGES
OCR
SPACES

SPEECH TO TEXT

updated Jan 25
Upvote
-

  • Running
    Featured
    255

    Qwen3 ASR Demo

    👀
    255

    Transcribe audio files to text with language detection


  • Running on Zero
    Featured
    2.72k

    Whisper

    📉
    2.72k

    Transcribe audio files and YouTube videos into text


  • openai/whisper-large-v3

    Automatic Speech Recognition • Updated Aug 12, 2024 • 5.69M • • 5.46k

  • Running
    58

    Qwen3 Omni Captioner Demo

    🐠
    58

    Generate captions from audio


  • Qwen/Qwen3-Omni-30B-A3B-Captioner

    Any-to-Any • 32B • Updated Sep 22, 2025 • 8.67k • 204

  • nvidia/parakeet-tdt-0.6b-v3

    Automatic Speech Recognition • Updated Nov 27, 2025 • 164k • 698

  • LiquidAI/LFM2-Audio-1.5B

    Audio-to-Audio • Updated Jan 23 • 163 • 345

  • Running
    Featured
    1.23k

    Whisper Web

    🎤
    1.23k

    Transcribe spoken audio into written text


  • microsoft/VibeVoice-ASR

    Automatic Speech Recognition • 9B • Updated Jan 27 • 697k • 902
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs