Voice Cloning

Overview

Voice cloning allows you to create highly realistic custom AI voices from your own audio samples. This enables a personalized and branded voice experience for your applications.

Upload Audio: Provide clear audio samples of the voice you want to clone
Generate Voice: Outspeed’s AI processes the samples to create a new voice
Use in Apps: Integrate your custom voice into your agents and TTS requests

Cloned voices are currently only available for use with our Text-to-Speech (TTS) API.

Getting Started

Visit Dashboard: Go to the Voice Upload page in your Outspeed Dashboard
Upload Samples: Follow the instructions to upload your audio files
Create Voice: Once processed, your new voice will be available to use
Copy Voice ID: Copy the generated Voice ID for use in your code

Audio Requirements

Length: Minimum 10 seconds of clear speech
Size: Maximum 3 MB
Quality: High-quality audio (e.g., studio recording, no background noise)
Format: Only WAV files are supported
Content: Natural speech, varied tone and pitch

Do not upload copyrighted material or impersonate individuals without consent. Ensure you have the necessary rights for all uploaded audio.

Usage

Once you have your custom Voice ID, use it with our Text-to-Speech (HTTP):

In TTS API Requests

{
  "model": "outspeed-tts-v2",
  "voice": "your-custom-voice-id",
  "text": "This is my custom voice."
}

In Dialogue Generation

{
  "model": "outspeed-tts-v2",
  "text": "*Narrator:* This is my custom voice. Character: Hello!",
  "speaker_voice": "your-custom-voice-id",
  "narrator_voice": "another-custom-voice-id"
}

Best Practices

High-quality audio: Crucial for best results
Consent: Always obtain consent if cloning a real person’s voice
Monitor usage: Track how your custom voices perform in real applications

Voices Send Text Messages

Get Started

Tools

Controls

Overview

Getting Started

Audio Requirements

Usage

In TTS API Requests

In Dialogue Generation

Best Practices

​Overview

​Getting Started

​Audio Requirements

​Usage

​In TTS API Requests

​In Dialogue Generation

​Best Practices

Overview

Getting Started

Audio Requirements

Usage

In TTS API Requests

In Dialogue Generation

Best Practices