STT models
Select an STT model using thestt_engine parameter in the layers.stt object. The following models are available:
| Model | Description |
|---|---|
tavus-auto | Automatically selects the best STT model for the conversation’s language. Recommended for most use cases. |
tavus-parakeet | Highest throughput, lowest latency for English and European languages. |
tavus-soniox | Purpose-built for Indian languages with broad multilingual coverage. |
tavus-whisper | Broad multilingual coverage across all supported languages. |
tavus-deepgram-medical | Domain-specific English STT optimized for clinical and healthcare vocabulary. English only. |
tavus-advanced | Deprecated. Still active but not recommended for new integrations. |
Choosing the right model
A language is listed for a model only if both STT and TTS coverage are available.| Category | Recommended model | Supported languages |
|---|---|---|
| General purpose | tavus-auto | All 43 languages |
| Indic languages | tavus-soniox | Bengali, English, Gujarati, Hindi, Kannada, Malayalam, Marathi, Punjabi, Tamil, Telugu + broad support for all other languages |
| English + European | tavus-parakeet | Bulgarian, Croatian, Czech, Danish, Dutch, English, Finnish, French, German, Greek, Hungarian, Italian, Polish, Portuguese, Romanian, Russian, Slovak, Spanish, Swedish, Ukrainian |
| Broad multilingual | tavus-whisper or tavus-soniox | All 43 languages |
| Medical (English) | tavus-deepgram-medical | English |
Using Smart Language Detection requires either
tavus-auto, tavus-soniox, or tavus-whisper.Configuring the STT layer
Define the STT layer under thelayers.stt object.
stt_engine
Set the STT model for transcription:
hotwords
Use this to prioritize certain names or terms that are difficult to transcribe.
Example configuration
Below is an example persona with a configured STT layer using the recommendedtavus-auto engine:
Refer to the Create Persona API for a complete list of supported fields.

