Overview
Theinput_language
field in your session configuration provides a hint about the primary language users will speak.
However, the model can understand multiple languages simultaneously, regardless of the hint provided.
The AI agent will understand their speech and respond in the language corresponding to the chosen voice.
The response language depends on your chosen voice. See available voices for language support. The
input_language
setting is a hint to optimize recognition, but users can speak in any supported language.Supported Languages
Currently supported input language hints:- en - English (default)
- zh - Chinese (Mandarin)
- hi - Hindi
- de - German
- es - Spanish
- fr - French
- it - Italian
- ja - Japanese
- ko - Korean
- nl - Dutch
- pl - Polish
- pt - Portuguese
- ru - Russian
- sv - Swedish
- tr - Turkish
Configuration
Addinput_language
to your session configuration as a hint for the primary expected language:
Language Examples
English (Default)
Chinese (Mandarin) with Multilingual Support
Hindi with Multilingual Support
Language Detection
The system automatically detects which languages are spoken in each user input and includes this information in the transcription event:languages
array contains all detected languages in the user’s input, allowing you to understand the linguistic composition of each utterance.
You can listen to this event to understand the languages spoken in the user’s input.
Important Notes
- Hint, not restriction:
input_language
is a hint to optimize recognition, not a limitation - Multilingual support: Users can speak multiple languages in a single session or even single utterance
- Voice-dependent output: The AI agent responds in the language of the chosen voice (see available voices)
- Default behavior: Default input language value is
en
for English - Language detection: Each transcription includes detected languages in the
languages
array