TTS provider
Select your Text-to-Speech provider in the assistant settings. This option is available in Pipeline and Dualplex modes. Available providers:- ElevenLabs — high-quality voices with a wide selection of languages and accents.
- Cartesia — fast, low-latency synthesis, useful when response speed is a priority.
Choosing a voice
Each TTS provider has its own voice library. You can filter by gender, accent, or language depending on your provider. Browse the options in your assistant’s voice settings and preview voices before committing.Cloning a voice
Voice cloning creates a custom voice from an audio sample — ideal for brand consistency, matching a company spokesperson, or building a personal connection with callers. Cloning is available in Pipeline and Dualplex modes. Requirements by provider:| Provider | Requirements |
|---|---|
| Cartesia | Single audio file, at least 10 seconds, one speaker, no background noise |
| ElevenLabs | Audio samples totalling over 1 minute (max 5 minutes), one speaker, no background noise. Quality matters more than quantity. |
Start the cloning process
In your assistant’s voice settings, click Clone voice next to the voice selector.
Wait for processing
Processing typically takes a few minutes. You’ll see the voice appear in the dropdown when it’s ready.
Best practices
- Use high-quality audio — clearer samples produce better clones.
- Record with steady delivery — natural tone, no abrupt pauses or changes in volume.
- Eliminate background noise — record in a quiet room or use a quality microphone.
- Get proper consent — only clone voices you have permission to use.