ElevenLabs
ElevenLabs is a leading AI voice family for expressive text-to-speech, multilingual narration, dialogue, voice design, and text-to-sound effects. On Martini, use Eleven v3, Multilingual v2, Turbo v2.5, Dialogue v3, and Sound Effects v2 in node-based audio workflows.
Eleven v3 is ElevenLabs' latest expressive speech synthesis model, built for emotional delivery, inline audio tags, and natural multi-speaker dialogue across 70+ languages. Multilingual v2 remains the stable choice for long-form narration, corporate voiceover, e-learning, and projects where consistency matters more than maximum expressiveness. Flash v2.5 is ElevenLabs' current low-latency recommendation, but Martini keeps Turbo v2.5 available for existing workflows that already depend on it; ElevenLabs says Turbo v2.5 is functionally equivalent to Flash v2.5 except Flash is usually lower latency. Sound Effects v2 uses the official eleven_text_to_sound_v2 model for whooshes, ambience, UI sounds, impacts, seamless loops, and production audio details. On Martini, these audio nodes can be chained with video, image, and script nodes, so a creator can draft a scene, generate narration, add SFX, and keep the full production graph together.
| Variant | Description |
|---|---|
| ElevenLabs TTS Eleven v3 | Expressive TTS via provider model eleven_v3, with audio tags, emotional delivery, 70+ languages, and a 5,000 character request limit. |
| ElevenLabs Dialogue Eleven v3 | Multi-speaker dialogue mode for natural conversations, character discussions, dramatic reads, and scripted exchanges. |
| ElevenLabs TTS Multilingual v2 | Stable, high-quality multilingual TTS for narration, e-learning, corporate video, and long-form audio in 29 languages. |
| ElevenLabs TTS Turbo v2.5 | Low-latency multilingual TTS kept for existing workflows; Flash v2.5 is ElevenLabs' newer low-latency recommendation. |
| ElevenLabs Sound Effects v2 | Text-to-sound generation via eleven_text_to_sound_v2 for ambience, impacts, transitions, UI feedback, loops, and cinematic layers. |
Connect ElevenLabs with video, image, script, and music nodes on Martini's infinite canvas. No GPU required — start free.
Get Started FreeUse Eleven v3 when expressive performance and dialogue matter, Multilingual v2 for stable long-form narration, Turbo v2.5 when you need compatibility with existing low-latency Martini workflows, and Sound Effects v2 for non-speech production audio.
Yes. Eleven v3 supports inline audio tags for emotion, delivery, and non-verbal reactions, and ElevenLabs exposes dialogue endpoints for natural multi-speaker audio.
No. ElevenLabs now recommends Flash v2.5 over Turbo v2.5 for new low-latency use cases because Flash is usually lower latency. Martini keeps Turbo v2.5 available for workflows that already rely on it.