Mistral Just Released a Free Voice AI That Claims to Beat ElevenLabs

Voxtral TTS is open-weight, runs on your phone, clones voices in 3 seconds, and costs a fraction of the competition. ElevenLabs should be paying attention.

ElevenLabs built a business on being the best voice AI money can buy. Mistral just released something that might be better. For free.

On Thursday, the French AI lab dropped Voxtral TTS, its first text-to-speech model and what it’s calling the first frontier-quality open-weight voice model built specifically for enterprise use. The weights are public, you can run it yourself, and you never have to send a single audio frame to a third party. That’s a fundamentally different deal than anything ElevenLabs, OpenAI, or Deepgram are offering.

Mistral announced it with demo audio and a benchmark chart that makes the case better than any press release could:

🔊Introducing Voxtral TTS: our new frontier open-weight model for natural, expressive, and ultra-fast text-to-speech

🎭Realistic, emotionally expressive speech.
🌍Supports 9 languages and accurately captures diverse dialects.
⚡Very low latency for time-to-first-audio.
🔄Easily… pic.twitter.com/Q2mdo8UBVo
— Mistral AI (@MistralAI) March 26, 2026

What Voxtral TTS Actually Does

The model is 4 billion parameters, which sounds big until you realize it can run on a smartphone, a laptop, or even a smartwatch. It streams audio with around 70ms latency, meaning near-instant response for voice agents and automation workflows and real-time applications. It supports 9 languages out of the box: English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic.

The party trick is voice cloning. Voxtral TTS can adapt to a new voice using as little as 3 seconds of reference audio. Not a long recording session, not a fine-tuning job. Three seconds, and it captures accent, inflection, rhythm, and even the natural disfluencies that make a voice sound human rather than robotic.

In blind human evaluations, Mistral says Voxtral TTS was preferred over ElevenLabs Flash v2.5 at a 68% win rate on voice cloning tasks, and performed at parity with ElevenLabs v3 on flagship voice evaluations. Those are Mistral’s own numbers so take them with appropriate skepticism, but the independent developer community that’s been testing it this week seems to back the claim up.

You can try it yourself right now without downloading anything:

Experiment with Voxtral TTS directly in the Mistral Studio playground. Select one of the Mistral voices or record your own. pic.twitter.com/pL1YZGSF8x
— Mistral AI (@MistralAI) March 26, 2026

Why Open-Weight Changes Everything

Every major competitor in voice AI runs the same playbook. You hit their API, you pay per character or per second, and your audio passes through their servers. That’s fine until you start thinking about privacy, cost at scale, or what happens when their pricing changes.

Voxtral TTS flips that. You download the model, you run it on your own infrastructure, and you own the stack. For enterprises handling sensitive conversations, healthcare applications, or anything where audio data can’t leave your servers, that’s not just convenient. It’s the only option that works.

Mistral’s VP of science operations Pierre Stock put it plainly: “Our customers asked for a speech model. So we built a small-sized speech model that can fit on a smartwatch, a smartphone, a laptop, or other edge devices. The cost of it is a fraction of anything else on the market.”

The Bigger Picture

Voxtral TTS isn’t a random product launch. It’s the final piece of an audio pipeline Mistral has been quietly assembling for the past year. Voxtral Transcribe handles speech-to-text. Mistral’s language models handle reasoning. Forge lets enterprises fine-tune on their own data. Now Voxtral TTS handles the output layer. The whole stack is theirs, and the whole stack runs on your hardware if you want it to.

This matters for anyone building voice agents and customer support automation, accessibility tools, real-time translation, or any workflow where AI needs to talk back. The voice AI market crossed $22 billion globally this year and is projected to nearly double by 2034. Mistral is planting a flag in it with open weights and a genuinely competitive model.

If you’re already using tools like Make.com or n8n to build automations, Voxtral TTS is worth watching closely. A self-hostable voice layer that costs nothing to run changes what’s possible in no-code and low-code voice workflows. The things people are going to build on top of this over the next six to twelve months are going to get interesting fast.

For regular people the immediate impact is probably limited. Voxtral TTS is aimed at developers and enterprises right now, not consumer apps. But when a frontier-quality voice model is free and self-hostable, the tools built on top of it tend to move quickly. Check the VU tools directory for voice AI tools worth using today while Voxtral matures.

ElevenLabs still has brand recognition, a polished consumer product, and a head start. But being the best paid option in a market where a free open-weight alternative just showed up claiming to win blind taste tests is a harder position than it was last week.

AI Tools

Featured

OpenClaw: The Complete 2026 Deep Dive (Install, Cost, Hardware, Real Reviews & More)

There’s Now an App Store for AI Agents & It Already Has a Malware Problem.

Latest

10 GitHub Repos That Turn Claude From a Chatbot Into a Productivity Machine

Shopify Just Made 5.6 Million Stores Shoppable Inside ChatGPT & Yours Is Probably Already Live.

Perplexity Was Sharing Your Conversations With Meta and Google the Entire Time

Alibaba Just Killed the Thing That Made Qwen Matter

Trending →

CREAO Wants to Be the AI Agent You Don’t Have to Install

Four Companies Are Racing to Be Your Doctor’s Replacement & Here’s What They’re Actually Building.

America Is Using AI More Than Ever. It Also Thinks AI Is Going to Ruin Everything.

Mistral Just Released a Free Voice AI That Claims to Beat ElevenLabs.

Anthropic Accidentally Leaked Their Next AI Model. It’s Called Claude Mythos and It Scares Them.

Mistral Just Released a Free Voice AI That Claims to Beat ElevenLabs.

You Might Also Like ↷

Anthropic Accidentally Leaked Their Next AI Model. It’s Called Claude Mythos and It Scares Them.

The Guy Who Thinks ChatGPT Is a Dead End Just Raised $1 Billion to Prove It

A Meta Engineer Let an AI Agent Loose on an Internal Forum. It Went Rogue. Here’s What Actually Happened.

$120 Billion Raised. $25 Billion Burned. OpenAI Is Going Public Anyway.