Multilingual V2 Text to Speech
The most advanced emotionally-aware speech synthesis model. Natural, lifelike speech with high emotional range across 29 languages.
Audio Generator
Cost: -
Audio Preview
No audio generated yet
Key Features
Emotionally-Aware
High emotional range with contextual understanding for lifelike speech
29 Languages
Support for 29 languages with consistent voice quality across all
Premium Voice Library
20+ default voices plus access to 10,000+ community voices
Voice Customization
Adjust stability, similarity boost, style exaggeration, and speed
Long-Form Stable
Most stable model for long-form content up to 10,000 characters
Commercial License
Use generated audio for any commercial purpose
How to Use
Generate multilingual speech in three simple steps
Enter Text
Type or paste your text in any of the 29 supported languages (up to 10,000 characters)
Choose Voice
Select from 21 premium voices and customize parameters
Generate Audio
Click generate and download your natural-sounding audio
Enter Text
Type or paste your text in any of the 29 supported languages (up to 10,000 characters)
Choose Voice
Select from 21 premium voices and customize parameters
Generate Audio
Click generate and download your natural-sounding audio
Technical Specifications
Use Cases
Global Content
Create audio content for international audiences in their native languages
Localization
Localize videos, apps, and products with natural-sounding voices
Language Learning
Generate native speaker audio for language learning materials
Multilingual Websites
Add audio versions of content in multiple languages
International Marketing
Create marketing materials for global campaigns
Accessibility
Provide audio alternatives for multilingual audiences
Frequently Asked Questions
Find answers to common questions about this model
ElevenLabs Multilingual v2 is the most advanced emotionally-aware speech synthesis model, producing natural, lifelike speech with high emotional range and contextual understanding across 29 languages.
29 languages including English, Chinese, Japanese, Korean, Spanish, French, German, Italian, Portuguese, Hindi, Arabic, Dutch, Polish, Swedish, Turkish, Indonesian, and more.
20+ default voices including Aria, Roger, Sarah, Laura, Charlie, George, Callum, River, Liam, Charlotte, Alice, Matilda, Will, Jessica, Eric, Chris, Brian, Daniel, Lily, and Bill. Plus access to 10,000+ community voices.
Yes, the model automatically detects written languages and can handle code-switching, transitioning naturally between languages while maintaining voice consistency.
Adjust stability (0-1, controls consistency), similarity boost (0-1, voice matching), style exaggeration (0-1, emotional intensity), and speed (0.7x to 1.2x).
You can convert up to 10,000 characters per request, producing approximately 10 minutes of audio. This model is most stable for long-form content.
Yes, all audio generated through our platform can be used for commercial purposes including videos, ads, apps, and products.
Multilingual v2 offers superior emotional range and nuanced expression, ideal for voiceovers and audiobooks. Turbo v2.5 prioritizes speed with lower latency, better for real-time applications.
Start Creating with Multilingual v2
Transform your text into natural-sounding speech in 29+ languages