2 Models Available

How to Clone a Voice With AI

A podcaster or course creator clones their own voice from a 30-second sample, then generates new narration without re-recording. On Martini's canvas, drop a clean reference clip into an audio node, route it into ElevenLabs Voice Cloning, Fish Audio S2-Pro voice cloning, or Minimax Voice Design, and chain the cloned voice into downstream script-to-speech, dubbing, or lip-sync nodes. Use this for founder-voice training narration, course modules, or localizing existing video. Only clone voices you own or have permission to use. Pick a model below to walk through the cloning workflow.

Try Free

Choose a Model to Get Started

ElevenLabs

ElevenLabs Eleven v3

ElevenLabs offers two voice cloning tiers that map directly to how much reference audio you have. Instant Voice Cloning trains on a 10-second sample and is ready in seconds — fine for internal narration drafts, prototype dubs, and personal video voiceover. Professional Voice Cloning needs 30+ minutes of clean studio audio, but the resulting voice can carry an entire course or audiobook without drifting. On Martini, both modes feed Eleven v3 (or Multilingual v2 for non-English work), so once your voice is registered you can generate new narration in 70+ languages with inline emotion tags. Critical: only clone voices you own or have explicit written permission to clone. ElevenLabs requires voice verification for your own voice, and consent matters whether the platform enforces it or not.

4 steps + 2 promptsView guide

Fish Audio

Fish Audio S2-Pro

Fish Audio S2-Pro is the open-source alternative to ElevenLabs cloning, with two real differentiators: natural-language bracket control inside the prompt (`[whispering]`, `[laughing nervously]`, `[pause]`) and an open serving stack you can self-host. Voice cloning needs a clean reference audio sample plus a matching transcript — Fish Audio uses the transcript text to disambiguate phonemes, so a misaligned transcript hurts cloning quality more than it does on ElevenLabs. Coverage is 80+ languages with automatic detection. Critical: only clone voices you own or have explicit written permission to clone. Fish Audio is open-source, which means consent enforcement is on you, not the platform — make the rights clearance explicit before you upload reference audio.

4 steps + 2 promptsView guide

More How-To Guides

This website uses cookies

We use cookies to keep Martini secure, remember your preferences, and, if you allow it, measure product performance. Read more

Strictly necessary

Required for authentication, security, payments, and core product flows.

Functionality

Remembers product preferences such as theme, language, and your most recent workspace.

Performance

Helps us understand product usage and site performance with PostHog, Vercel Analytics, Speed Insights, and Ahrefs.

Targeting

Allows marketing and advertising tags we may run through Google Tag Manager.

How to Clone a Voice With AI

Choose a Model to Get Started

ElevenLabs Eleven v3

Fish Audio S2-Pro

More How-To Guides

How to Generate AI Background Music

How to Create AI Voiceovers

How to Create an AI Podcast Intro

How to Generate AI Dialogue

This website uses cookies

How to Clone a Voice With AI

Choose a Model to Get Started

ElevenLabs Eleven v3

Fish Audio S2-Pro

More How-To Guides

How to Generate AI Background Music

How to Create AI Voiceovers

How to Create an AI Podcast Intro

How to Generate AI Dialogue