Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
fixie-ai 's Collections
Ultravox v0.7
Ultravox v0.6
UltraVAD
Ultravox v0.5
Ultravox v0.4.1

Ultravox v0.5

updated Sep 12, 2025

Ultravox is a multimodal Speech LLM built around different pretrained LLMs (frozen) and the whisper-large-v3-turbo (fine-tuned) backbone.

Upvote
19

  • fixie-ai/ultravox-v0_5-llama-3_3-70b

    Audio-Text-to-Text • 0.7B • Updated Sep 12, 2025 • 147 • 32

  • fixie-ai/ultravox-v0_5-llama-3_1-8b

    Audio-Text-to-Text • 0.7B • Updated May 6, 2025 • 2.51k • 34

  • fixie-ai/ultravox-v0_5-llama-3_2-1b

    Audio-Text-to-Text • 0.7B • Updated Nov 27, 2025 • 303k • 67

  • fixie-ai/ultravox-v0_5-glm-4_5-355b

    Audio-Text-to-Text • 0.7B • Updated Sep 12, 2025 • 672 • 2
Upvote
19
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs