Overview
Voice cloning allows you to create highly realistic custom AI voices from your own audio samples. This enables a personalized and branded voice experience for your applications.- Upload Audio: Provide clear audio samples of the voice you want to clone
- Generate Voice: Outspeed’s AI processes the samples to create a new voice
- Use in Apps: Integrate your custom voice into your agents and TTS requests
Cloned voices are currently only available for use with our Text-to-Speech (TTS) API.
Getting Started
- Visit Dashboard: Go to the Voice Upload page in your Outspeed Dashboard
- Upload Samples: Follow the instructions to upload your audio files
- Create Voice: Once processed, your new voice will be available to use
- Copy Voice ID: Copy the generated Voice ID for use in your code
Audio Requirements
- Length: Minimum 10 seconds of clear speech
- Size: Maximum 3 MB
- Quality: High-quality audio (e.g., studio recording, no background noise)
- Format: Only WAV files are supported
- Content: Natural speech, varied tone and pitch
Do not upload copyrighted material or impersonate individuals without consent. Ensure you have the necessary
rights for all uploaded audio.
Usage
Once you have your custom Voice ID, use it with our Text-to-Speech (HTTP):In TTS API Requests
In Dialogue Generation
Best Practices
- High-quality audio: Crucial for best results
- Consent: Always obtain consent if cloning a real person’s voice
- Monitor usage: Track how your custom voices perform in real applications