Documentation Index
Fetch the complete documentation index at: https://docs.tavus.io/llms.txt
Use this file to discover all available pages before exploring further.
New Features
- Node.js plugin support for LiveKit integration: The LiveKit Agents integration now supports Node.js via the
@livekit/agents-plugin-tavusplugin, in addition to Python. Install withnpm install @livekit/agents @livekit/agents-plugin-tavus. Learn more - Real-time event timestamps: Every interaction event (
conversation.utterance,conversation.started_speaking, etc.) now carries atimestampfield delivered in real time on the respective Interaction event. Learn more - Transcript utterance timestamps: End-of-call transcripts now carry per-turn timing. The
application.transcription_readywebhook (and the same payload nested undereventsin the verboseGET /conversations/{id}?verbose=trueresponse) now includestimestamp(Unix epoch float, seconds — same field name as live interaction events),seconds_from_start,duration(seconds, float — same field name asconversation.stopped_speaking.duration), andinference_id(on assistant turns) on each transcript entry.
Enhancements
- New, and more detailed, error messages when a replica fails: Learn more
New Features
- AI Image Fixer API support: Create Replica now accepts an
auto_fix_training_imageproperty. Set it totrueto use Tavus’s AI Image Fixer to instantly fix any uploaded image to fit our requirements, eliminating the need for editing or recapturing photos. Learn more
New Features
- AI Image Fixer: Instantly fix any uploaded image to fit our requirements, eliminating the need for editing or recapturing photos.
New Features
- Frame Checker (Video to Replica): Get instant feedback, before recording, on whether your camera setup meets our requirements.
Enhancements
- Less footage required (Video to Replica): We now only require 1 minute of video, down from the 2 minutes previously needed.
- Simpler in-portal recording flow (Video to Replica): A streamlined recording experience in the portal to help you capture high-quality recordings.
New Features
- Image to Replica: Build a replica from a single still. Drop in a photo, illustration, or brand mascot.
New Features
- Voice Activity Detector improvements: Resulting in a smoother conversational experience in noisy environments. This is automatically rolled out to all users.
- Expanded Recording Storage Support: Conversation recordings can now be delivered to Google Cloud Storage (GCP) and Azure Blob Storage, in addition to AWS S3. Learn more
Enhancements
- Tavus Components Library Updates: Improved audio-video sync, plus new chat components and closed captions with streaming support. Learn more
New Features
- Wake Phrase: Personas can now stay silent until they hear a specific phrase, similar to how voice assistants like Siri or Alexa work. Configure it via the
wake_phraseparameter in the Conversational Flow layer. The persona still hears everything that is said and responds with full conversation history once the wake phrase is detected. Learn more
New Features
- Idle Engagement: Replicas can now proactively re-engage by speaking to the user after a period where the user is silent. Eagerness of this feature can be configured via the
idle_engagementparameter in the Conversational Flow layer. Learn more
New Features
- Speaking Events: Two new events —
conversation.started_speakingandconversation.stopped_speaking— fire for both the replica and the user with arolefield ("replica"or"user") identifying the speaker.conversation.stopped_speakingincludes aninterruptedboolean and adurationfield (in seconds). Learn more - Conversation Diagnostics: A new diagnostics surface for inspecting what happened in a conversation — including packet loss, network connection, FPS, and more — designed to make debugging significantly faster. Click on any conversation in the Developer Portal to access its diagnostics page.
Enhancements
- Non-Interruptible Custom Greetings: Custom greetings now finish entirely before users can interrupt speech. Previously, participants could talk over a
custom_greeting; now the replica completes the greeting before it begins listening. Learn more - Improved Turn-Taking Latency: Significant TTS optimizations reduce turn-taking latency, resulting in faster and more natural back-and-forth during conversations.
New Features
- Streaming Utterance Event: A new
conversation.utterance.streamingevent progressively reports what has been said during a conversation turn for both replica and user utterances. Use it to power closed captioning and build accurate transcripts — especially when a user interrupts the replica, since the streaming event reflects only the words actually spoken rather than the full LLM response. Learn more - Pronunciation Dictionaries: Define custom pronunciation rules so your persona says brand names, technical terms, acronyms, and foreign words exactly right. Choose between simple alias substitution (e.g., “Tavus” → “TAH-vus”) or precise IPA phonetic notation. Create a dictionary once and attach it to a persona via the TTS layer — any updates automatically propagate to all linked personas with zero extra latency at conversation time. Learn more
New Features
- Voice Isolation: Filter background noise from participant audio to improve conversation quality. Configure it via the
voice_isolationparameter in the Conversational Flow layer. Learn more
Changes
- Chat Interrupt History: Personas now know when they have been interrupted. This allows the persona to pick back up where it left off, and also improves objectives adherence.
New Features
- Expanded ASR Model Selection: You can now choose from five specialized speech-to-text engines via the
stt_engineparameter. New models includetavus-parakeet,tavus-soniox,tavus-whisper, andtavus-deepgram-medical. Usetavus-autoto automatically route to the best model for each conversation. Learn more
Enhancements
- 30% Faster Phoenix-4 Boot Time: Phoenix-4 conversations now boot 30% faster, significantly reducing the time from conversation creation to readiness.
Changes
conversation.replica_interruptedEvent Removed: Theconversation.replica_interruptedapplication message has been removed from interaction events. This event was deprecated in a previous backend update. Useconversation.replica.stopped_speakingwith theinterrupted: trueproperty to detect interruptions instead.durationandinterruptedFields on Replica Stopped Speaking: Theconversation.replica.stopped_speakingevent now includes adurationfield (how long the replica spoke in seconds) and aninterruptedfield (true/false) indicating whether the replica was interrupted by the user. Learn more
New Features
- Event Ordering and Turn Tracking: All server-broadcasted interaction events now include
seqandturn_idxfields.seqis a globally monotonic sequence number for ordering events that may arrive out of order, andturn_idxgroups related events from the same conversation turn. Learn more
Enhancements
- 30% Faster Phoenix-4 Boot Time: Phoenix-4 conversations now boot 30% faster, significantly reducing the time from conversation creation to readiness.
Enhancements
- EU ElevenLabs BYOK Support: Customers can now bring their own ElevenLabs API key from EU-region accounts.
Enhancements
- Improved Knowledge Base Retrieval: Optimized underlying infrastructure to improve utterance to utterance response times, particularly when
rag_search_qualityis set toquality.
New Features
- Expanded Tavus-Hosted LLM Selection: Added new Tavus-hosted LLM options including models from Gemini, Claude, and GPT families.
tavus-gpt-ossis recommended as the default. Legacy modelstavus-gpt-4.1,tavus-gpt-4o, andtavus-gpt-4o-miniare now deprecated. Learn more → - Visual RAG: CVI now supports visual retrieval-augmented generation. Upload custom image explanations that are matched and queried via vision embeddings, giving your persona richer visual context during conversations.
Changes
- Persona
contextField Deprecated: Thecontextfield has been deprecated in favor of a unifiedsystem_promptfield. Existingcontextvalues have been automatically merged into system prompts. The API remains backward compatible, but we recommend using onlysystem_promptgoing forward.
New Features
- Raven-1 Perception Model: Introduced Raven-1, a multimodal perception model with audio emotion analysis and enhanced visual awareness. Raven-1 captures user emotion from audio in real time (sub-100ms audio perception latency), enabling personas to respond with greater emotional intelligence. The model is now the default for all new personas. Enable it by setting
perception_model_namein your persona configuration. Learn more → - Private Rooms: Require authentication to join conversations for enhanced security. When enabled, we return a JWT meeting token that users must include when entering the room. Learn more
Enhancements
- Upgraded Transcription Engine: Upgraded transcription engine with 3x improvements in word error rates (WER).
New Features
- Website Crawling for Knowledge Base: You can now enable link crawling when creating knowledge base documents. Configure crawl
depthandmax_pagesto automatically discover and ingest content from linked pages. Additionally, existing crawled documents can now be recrawled to keep knowledge base content up to date.
Changes
- PlayHT TTS Removed: PlayHT has been fully removed as a supported TTS engine. All personas previously using PlayHT should migrate to Cartesia or ElevenLabs.
New Features
- Hard Delete for Conversations: Conversations can now be permanently deleted via the API using the
hard=truequery parameter. Use this for GDPR compliance or data cleanup workflows.
Enhancements
- Default TTS Model Updated to Sonic-3: The default text-to-speech model has been updated to Sonic-3 across all new personas, delivering improved voice quality and naturalness.
- LiveKit Connection Stability: Extensive reliability improvements to the LiveKit-based transport layer, including fixes for connection timeouts, track publishing hangs, event loop starvation, and ping timeout issues.
Changes
- Default LLM Migrated to
tavus-gpt-oss: The default LLM for all new personas is nowtavus-gpt-oss. All remainingtavus-llama-4personas have been automatically migrated. Legacy Tavus-Llama model references have been removed.
New Features
- LLM Temperature & Top-P Parameters: You can now configure
temperatureandtop_pparameters for both Tavus-hosted LLMs and custom LLMs via theextra_bodyfield in your persona’s LLM configuration. Learn more →
Enhancements
- Text Echo Language Accuracy: Text echoes now correctly use the input language for conversion, improving accuracy in multilingual conversations.
New Features
- Test Mode for Conversations: You can now start conversations in test mode, where the replica does not join. Validate your setup, integrations, and conversational flows without incurring costs or using concurrency slots. Set
test_mode: truewhen creating a conversation. Learn more →
Enhancements
- Fuzzy Search for Personas: Search now supports fuzzy matching for personas, allowing users to find results based on partial matches of UUIDs or names.
New Features
- Memories: CVI now remembers context across conversations. Every conversation builds on the last with full context and time/date awareness, enabling use cases like adaptive tutoring, mentorship, and recurring consultations. Learn more →
- Knowledge Base (RAG): Bring your own data to conversations instantly. Upload documents or links and get grounded answers with ~30ms retrieval latency. Power AI recruiters, support agents, travel guides, and more with domain-specific knowledge. Learn more →
- Objectives & Guardrails: Define clear goals, branching logic, and measurable outcomes for your personas while keeping conversations safe, compliant, and on-brand. Ideal for complex workflows and regulated industries. Learn more →
- Persona Builder: A guided creation flow in the Developer Portal to shape AI personas with goals, behaviors, and style — then test or launch within minutes.
New Features
- Events Console: A new events console in the Developer Portal lets you monitor everything happening during a conversation in real time — from message flows to system activity.
- Conversation Transcripts & Perception Analysis: View full conversation details directly in the Developer Portal, including transcripts with speaker roles and perception analysis showing how your AI persona sees, hears, and responds.
New Features
- Persona Layer Controls: Enable or disable layers like Sparrow directly within a Persona and adjust sensitivity settings in real time from the Developer Portal side panel.
- Persona Editing in Developer Portal: We’ve added new editing capabilities to help you refine your Personas more efficiently. You can now update system prompt, context, and layers directly in our Developer Portal, plus duplicate existing Personas to quickly create variations or use them as starting points for new projects. Find these new features in your Persona Library at platform.tavus.io.
Enhancements
- Interaction Events Playground Improvements: Major updates to the Interaction Events Playground including correct
properties.contextformat and append vs overwrite toggle.
New Features
- Multilingual Settings in Developer Portal: You can now specify the language of a conversation directly in the Developer Portal, including a new multilingual option for dynamic, real-world interactions.
New Features
- Multilingual Conversation Support: CVI now supports dynamic multilingual conversations through automatic language detection. Set the language parameter to “multilingual” and CVI will automatically detect the user’s spoken language and respond in the same language using ASR technology.
- Audio-Only Mode: CVI now supports audio-only conversations with advanced perception (powered by Raven) and intelligent turn-taking (powered by Sparrow-1). Set
audio_only=truein your create conversation request to enable streamlined voice-first interactions.
Enhancements
- Fixed CVI responsiveness issue: Resolved an issue where CVI would occasionally ignore very brief user utterances. All user inputs, regardless of length, now receive consistent responses.
- Expanded tavus-llama-4 context window: Increased maximum context window to 32,000 tokens. For optimal performance and response times, we recommend staying under 25,000 tokens.
Changes
- Added a new recording requirement to Training from a video : Start the talking segment with a big smile.
Enhancements
Enhancements
- Major Phoenix 3 Enhancements for CVI:
- Increased frame rate from 27fps to 32fps, significantly boosting smoothness.
- Reduced Phoenix step’s warm boot time by 60% (from 5s to 2s).
- Lipsync accuracy improved by ~22% based on AVSR metric.
- Resolved blurriness and choppiness at conversation start.
- Enhanced listening mode with more natural micro expressions (eyebrow movements, subtle gestures).
- Greenscreen mode speed boosted by an additional ~1.5fps.
- Enhanced CVI Audio Quality: Audio clicks significantly attenuated, providing clearer conversational audio.
- Phoenix 3 Visual Artifacts Fix: Resolved visual artifacts in 4K videos on Apple devices, eliminating black spot artifacts in thumbnails.
New Features
- Launched LiveKit Integration : With Tavus video agents now integrated into LiveKit, you can add humanlike video responses to your voice agents in seconds.
- Persona API : Enabled patch updates to personas.
Enhancements
- Resolved TTS (Cartesia) stability issues and addressed hallucination.
- Phoenix 3 Improvements:
- Fixed blinking/jumping issues and black spots in videos.
- FPS optimization to resolve static and audio crackling.

