Voice Cloning
Voice Cloning creates custom voice models from audio samples, enabling personalized text-to-speech generation.
Overview
| Property | Value |
|---|---|
| Provider | Various |
| Cost | 100 credits |
| Modality | Audio |
| Duration | Variable |
| Prompt Required | Yes |
What It's Best For
- Custom voices — Create unique voice models
- Brand consistency — Same voice across content
- Personalization — Specific voice requirements
- Character voices — Consistent characters
- Localization — Same voice, different languages
Inputs
Text (Required)
The text to speak in the cloned voice.
Connection Color: 🟡 Yellow
Voice Sample (Required)
Audio sample to clone (10-30 seconds recommended).
Connection Color: 🟠Orange
Configuration
Clone Strength
Type: Slider
Range: 0 - 1
Default: 0.8
How closely to match the original voice.
Speed
Type: Slider
Range: 0.5 - 2.0
Default: 1.0
Seed
Type: Number
Output
Type: Audio
Connection Color: 🟠Orange
Use Cases
Brand Voice
Clone brand spokesperson voice,
maintain consistency across all content,
professional delivery.
Character Consistency
Clone character voice for game/animation,
same voice for all dialogue,
consistent personality.
Personalized Content
Clone specific voice for personalized messages,
birthday greetings, custom announcements.
Voice Sample Guidelines
| Aspect | Recommendation |
|---|---|
| Length | 10-30 seconds |
| Quality | Clear, no noise |
| Content | Natural speech |
| Format | WAV or MP3 |
| Emotion | Match desired output |
Tips for Best Results
- Quality samples — Clear, noise-free audio
- Consistent samples — Same voice quality
- Natural speech — Avoid reading style
- Adjust strength — Lower for more natural
- Multiple samples — Better voice capture
Related Models
- Chatterbox — Quick voice cloning
- XTTS-v2 — Cross-lingual TTS
- Dia TTS — Multi-speaker