Configuration best practices

Getting great results from your AI assistant comes down to picking the right engine settings. Use this guide when configuring a new assistant or troubleshooting an existing one.

1. Pick a mode

Mode	Why choose it	Recommended model
Dualplex (Beta)	Fast turn-taking + premium or cloned voices	Gemini Flash 2.0/2.5 or GPT-5 Realtime
Speech-to-speech	Fastest turn-taking and most natural flow	GPT-5 Realtime
Pipeline	Maximum control over voice and long-form replies	GPT-5 Mini

Experiment with all three modes. Record the same scenario in each and compare response time and caller satisfaction before committing to one.

See assistant modes for a full comparison.

2. Choose a transcriber (Pipeline only)

Transcriber	Accuracy	Latency	Best for
Azure	⭐⭐⭐⭐	Slower	Highest transcription fidelity
Gladia	⭐⭐⭐	Faster	Good all-rounder for most languages
Deepgram	⭐⭐⭐	Faster	Solid choice — test against Gladia for your language

Different languages, accents, and background noise affect each transcriber differently. Run a quick A/B test and keep the better performer.

3. Select an LLM model

Model	Strengths	Trade-offs
GPT-5 Mini	Balanced reasoning with low latency	May be slower than realtime models for rapid turn-taking
GPT-5 Realtime	Designed for ultra-low-latency voice turns	Best for speech-to-speech and Dualplex
GPT-4o	Strong reasoning and multimodal understanding	Higher latency
Gemini Flash 2.0/2.5	Ultra-fast for voice turns	Excellent for Dualplex and multimodal

If speed is critical, use GPT-5 Realtime (great for speech-to-speech) or Gemini Flash 2.0/2.5 (great with Dualplex). For richer reasoning, use GPT-4o or GPT-5 Mini and offset the higher latency with filler audio.

4. Noise cancellation

Turn ON when callers are on speakerphone or in noisy environments for cleaner transcription.
Turn OFF if words are being clipped or the assistant is missing parts of what callers say.

If your assistant isn’t hearing callers clearly, try disabling noise cancellation first.

5. Conversation timers

Parameter	Recommended	Why
Re-engagement interval	~30 s	Gives callers enough time to think. Lower values can feel pushy.
Max silence duration	~60 s	Prevents premature hang-ups while still ending truly silent calls.

Test with real calls — too low can interrupt callers, too high leaves awkward gaps.

6. Initial message

Mode	How it’s used	Best practice
Pipeline	Read exactly as written (TTS conversion)	Write the greeting verbatim: “Hello, this is Alex from…”
Dualplex	Read exactly as written (ElevenLabs TTS)	Write verbatim, then select your cloned voice
Speech-to-speech	Interpreted as a prompt by the model	Include instructions like “Greet the customer and say…” or prepend `say exactly:` to ensure literal output

7. Ambient sound

Ambient sound adds subtle background noise under the assistant’s voice to mask processing delays and create a more natural audio experience. It is enabled by default.

If your assistant isn’t hearing callers well, try turning off ambient sound or lowering its volume.

8. Endpointing sensitivity

Control when your assistant starts talking after a caller finishes speaking.

Setting	Effect	Use when
Lower sensitivity	Assistant responds faster after caller stops	You want snappy, quick-turn conversations
Higher sensitivity	Assistant waits longer before responding	Callers give longer, more detailed replies

If your assistant cuts off callers mid-sentence, increase sensitivity. If responses feel sluggish, decrease it.

9. Debug with the call transcript

If something isn’t working as expected:

Go to Call history

Navigate to the Call history page in your dashboard.

Open the last test call

Click on the most recent call you tested.

Review the transcript

The call transcript includes all function calls and their parameters, making it easy to spot where the assistant went off-track.

For a complete list of every toggle and slider, see the general settings reference.

Get Started

AI Assistants

Calls & Campaigns

Channels

Leads & Data

Integrations

Support

Configuration best practices

1. Pick a mode

2. Choose a transcriber (Pipeline only)

3. Select an LLM model

4. Noise cancellation

5. Conversation timers

6. Initial message

7. Ambient sound

8. Endpointing sensitivity

9. Debug with the call transcript

Get Started

AI Assistants

Calls & Campaigns

Channels

Leads & Data

Integrations

Support

Documentation Index

​1. Pick a mode

​2. Choose a transcriber (Pipeline only)

​3. Select an LLM model

​4. Noise cancellation

​5. Conversation timers

​6. Initial message

​7. Ambient sound

​8. Endpointing sensitivity

​9. Debug with the call transcript

1. Pick a mode

2. Choose a transcriber (Pipeline only)

3. Select an LLM model

4. Noise cancellation

5. Conversation timers

6. Initial message

7. Ambient sound

8. Endpointing sensitivity

9. Debug with the call transcript