Voice Cloning

Voice Cloning creates custom voice models from audio samples, enabling personalized text-to-speech generation.

Overview

Property	Value
Provider	Various
Cost	100 credits
Modality	Audio
Duration	Variable
Prompt Required	Yes

What It's Best For

Custom voices — Create unique voice models
Brand consistency — Same voice across content
Personalization — Specific voice requirements
Character voices — Consistent characters
Localization — Same voice, different languages

Inputs

Text (Required)

The text to speak in the cloned voice.

Connection Color: 🟡 Yellow

Voice Sample (Required)

Audio sample to clone (10-30 seconds recommended).

Connection Color: 🟠 Orange

Configuration

Clone Strength

Type: Slider
Range: 0 - 1
Default: 0.8

How closely to match the original voice.

Speed

Type: Slider
Range: 0.5 - 2.0
Default: 1.0

Seed

Type: Number

Output

Type: Audio
Connection Color: 🟠 Orange

Use Cases

Brand Voice

Clone brand spokesperson voice,
maintain consistency across all content,
professional delivery.

Character Consistency

Clone character voice for game/animation,
same voice for all dialogue,
consistent personality.

Personalized Content

Clone specific voice for personalized messages,
birthday greetings, custom announcements.

Voice Sample Guidelines

Aspect	Recommendation
Length	10-30 seconds
Quality	Clear, no noise
Content	Natural speech
Format	WAV or MP3
Emotion	Match desired output

Tips for Best Results

Quality samples — Clear, noise-free audio
Consistent samples — Same voice quality
Natural speech — Avoid reading style
Adjust strength — Lower for more natural
Multiple samples — Better voice capture

Chatterbox — Quick voice cloning
XTTS-v2 — Cross-lingual TTS
Dia TTS — Multi-speaker