inactive status and is ready to configure and activate.
Endpoint
POST /user/assistant
Engine modes
The assistant supports three engine modes, each with different requirements:| Mode | Description | Required field |
|---|---|---|
pipeline | Traditional STT → LLM → TTS pipeline | llm_model_id |
multimodal | Real-time multimodal AI | multimodal_model_id |
dualplex | Multimodal brain with custom TTS voice | multimodal_model_id |
Request
Core required fields
Name of the assistant. Maximum 255 characters.
Voice ID to use. Use the Get Voices endpoint with the
mode parameter to retrieve voices compatible with your engine mode.Primary language ID. Use the Get Languages endpoint to retrieve available languages.
Assistant type:
inbound or outbound.Engine mode:
pipeline, multimodal, or dualplex.Timezone identifier (e.g.,
America/New_York, Europe/Bucharest).Greeting message spoken at the start of a call. Maximum 200 characters.
System prompt defining the assistant’s behavior and personality.
Mode-specific fields
LLM model ID. Required for
pipeline mode. Use the Get Models endpoint to retrieve available models.Multimodal model ID. Required for
multimodal and dualplex modes. Use the Get Models endpoint to retrieve available multimodal models.Fallback LLM model ID for tool calls in multimodal/dualplex modes. Optional.
Turn detection sensitivity for multimodal/dualplex modes. Range: 0–1. Defaults to auto.
Secondary languages
Additional language IDs the assistant can speak. The assistant auto-detects and switches languages during the call.
Knowledge base
ID of the knowledge base to attach to this assistant.
How the assistant uses the knowledge base:
function_call— AI calls a function to search. Required formultimodalanddualplexmodes.prompt— Knowledge is injected into the system prompt. Pipeline mode only.
Phone number
ID of the phone number to assign. Must belong to your account.
Custom mid-call tools
IDs of custom mid-call tools to attach. Each tool must belong to your account.
Built-in tools
Built-in tools to enable on the assistant. Each tool object has a
type and type-specific fields.Voice and TTS settings
Voice stability. Range: 0–1. Higher values produce a more consistent voice.
Voice similarity to the original. Range: 0–1.
Speech speed multiplier. Range: 0.7–1.2.
LLM temperature. Range: 0–1. Lower values produce more deterministic responses.
Whether to enable emotional TTS synthesis.
Custom TTS provider ID. Auto-selected based on language if not provided.
Custom STT provider ID. Pipeline mode only. Auto-selected based on language if not provided.
Call behavior
Whether the caller can interrupt the assistant.
Whether to use filler audio while processing (e.g., “Hmm”, “Let me check”).
Custom filler phrases per category. Each category is an array of short strings.
Whether to record calls.
Whether to enable noise cancellation.
If
true, the assistant waits for the customer to speak first.Timing
Maximum call duration in seconds. Range: 20–1200.
Maximum silence duration before re-engagement, in seconds. Range: 1–360.
Maximum silence at call start before ending, in seconds. Range: 1–120.
Maximum ringing time before giving up, in seconds. Range: 1–60.
Re-engagement
Seconds between re-engagement attempts. Range: 7–600.
Custom re-engagement message prompt. Maximum 1000 characters.
Voicemail
Whether to end the call when voicemail is detected.
Message to leave on voicemail before hanging up. Maximum 1000 characters.
Endpoint detection
Voice activity detection type:
vad or ai.Endpoint sensitivity level. Range: 0–5.
Interrupt sensitivity level. Range: 0–5.
Minimum words spoken before an interruption is processed. Range: 0–10.
Ambient sound
Background ambient sound:
off, office, city, forest, crowded_room, cafe, or nature.Ambient sound volume. Range: 0–1.
Webhook configuration
Whether to enable post-call webhook notifications.
Webhook URL for post-call notifications. Required if
is_webhook_active is true.Whether to send webhooks only for completed calls.
Whether to include the recording URL in webhook payloads.
Post-call evaluation
Whether to enable AI post-call evaluation.
Schema for extracting structured data from calls.
Variables
Custom key-value pairs accessible in prompts via
{{variable_name}}.Conversation ended settings
Minutes of chat inactivity before the conversation is considered ended. Range: 1–1440.
Whether to allow re-triggering after the conversation ends due to inactivity.
Webhook URL called when a chat conversation ends due to inactivity.
Response
Confirmation message.
Example
201 Success
422 Validation error
New assistants are created with
inactive status. Activate them through the Intellixent portal or by attaching a phone number and enabling them.