New Features
- Expanded ASR Model Selection: You can now choose from five specialized speech-to-text engines via the
stt_engineparameter. New models includetavus-parakeet,tavus-soniox,tavus-whisper, andtavus-deepgram-medical. Usetavus-autoto automatically route to the best model for each conversation. Learn more - Voice Isolation: Filter background noise from participant audio to improve conversation quality. Configure it via the
voice_isolationparameter in the Conversational Flow layer. Learn more
Enhancements
- 30% Faster Phoenix-4 Boot Time: Phoenix-4 conversations now boot 30% faster, significantly reducing the time from conversation creation to readiness.
Changes
conversation.replica_interruptedEvent Removed: Theconversation.replica_interruptedapplication message has been removed from the Interactions Protocol. This event was deprecated in a previous backend update. Useconversation.replica.stopped_speakingwith theinterrupted: trueproperty to detect interruptions instead.durationandinterruptedFields on Replica Stopped Speaking: Theconversation.replica.stopped_speakingevent now includes adurationfield (how long the replica spoke in seconds) and aninterruptedfield (true/false) indicating whether the replica was interrupted by the user. Learn more
New Features
- Event Ordering and Turn Tracking: All server-broadcasted interaction events now include
seqandturn_idxfields.seqis a globally monotonic sequence number for ordering events that may arrive out of order, andturn_idxgroups related events from the same conversation turn. Learn more
Enhancements
- 30% Faster Phoenix-4 Boot Time: Phoenix-4 conversations now boot 30% faster, significantly reducing the time from conversation creation to readiness.
Enhancements
- EU ElevenLabs BYOK Support: Customers can now bring their own ElevenLabs API key from EU-region accounts.
Enhancements
- Improved Knowledge Base Retrieval: Optimized underlying infrastructure to improve utterance to utterance response times, particularly when
rag_search_qualityis set toquality.
New Features
- Expanded Tavus-Hosted LLM Selection: Added new Tavus-hosted LLM options including models from Gemini, Claude, and GPT families.
tavus-gpt-ossis recommended as the default. Legacy modelstavus-gpt-4.1,tavus-gpt-4o, andtavus-gpt-4o-miniare now deprecated. Learn more → - Visual RAG: CVI now supports visual retrieval-augmented generation. Upload custom image explanations that are matched and queried via vision embeddings, giving your persona richer visual context during conversations.
Changes
- Persona
contextField Deprecated: Thecontextfield has been deprecated in favor of a unifiedsystem_promptfield. Existingcontextvalues have been automatically merged into system prompts. The API remains backward compatible, but we recommend using onlysystem_promptgoing forward.
New Features
- Raven-1 Perception Model: Introduced Raven-1, a multimodal perception model with audio emotion analysis and enhanced visual awareness. Raven-1 captures user emotion from audio in real time (sub-100ms audio perception latency), enabling personas to respond with greater emotional intelligence. The model is now the default for all new personas. Enable it by setting
perception_model_namein your persona configuration. Learn more → - Private Rooms: Require authentication to join conversations for enhanced security. When enabled, we return a JWT meeting token that users must include when entering the room. Learn more
Enhancements
- Upgraded Transcription Engine: Upgraded transcription engine with 3x improvements in word error rates (WER).
New Features
- Website Crawling for Knowledge Base: You can now enable link crawling when creating knowledge base documents. Configure crawl
depthandmax_pagesto automatically discover and ingest content from linked pages. Additionally, existing crawled documents can now be recrawled to keep knowledge base content up to date.
Changes
- PlayHT TTS Removed: PlayHT has been fully removed as a supported TTS engine. All personas previously using PlayHT should migrate to Cartesia or ElevenLabs.
New Features
- Hard Delete for Conversations: Conversations can now be permanently deleted via the API using the
hard=truequery parameter. Use this for GDPR compliance or data cleanup workflows.
Enhancements
- Default TTS Model Updated to Sonic-3: The default text-to-speech model has been updated to Sonic-3 across all new personas, delivering improved voice quality and naturalness.
- LiveKit Connection Stability: Extensive reliability improvements to the LiveKit-based transport layer, including fixes for connection timeouts, track publishing hangs, event loop starvation, and ping timeout issues.
Changes
- Default LLM Migrated to
tavus-gpt-oss: The default LLM for all new personas is nowtavus-gpt-oss. All remainingtavus-llama-4personas have been automatically migrated. Legacy Tavus-Llama model references have been removed.
New Features
- LLM Temperature & Top-P Parameters: You can now configure
temperatureandtop_pparameters for both Tavus-hosted LLMs and custom LLMs via theextra_bodyfield in your persona’s LLM configuration. Learn more →
Enhancements
- Text Echo Language Accuracy: Text echoes now correctly use the input language for conversion, improving accuracy in multilingual conversations.
New Features
- Test Mode for Conversations: You can now start conversations in test mode, where the replica does not join. Validate your setup, integrations, and conversational flows without incurring costs or using concurrency slots. Set
test_mode: truewhen creating a conversation. Learn more →
Enhancements
- Fuzzy Search for Personas: Search now supports fuzzy matching for personas, allowing users to find results based on partial matches of UUIDs or names.
New Features
- Memories: CVI now remembers context across conversations. Every conversation builds on the last with full context and time/date awareness, enabling use cases like adaptive tutoring, mentorship, and recurring consultations. Learn more →
- Knowledge Base (RAG): Bring your own data to conversations instantly. Upload documents or links and get grounded answers with ~30ms retrieval latency. Power AI recruiters, support agents, travel guides, and more with domain-specific knowledge. Learn more →
- Objectives & Guardrails: Define clear goals, branching logic, and measurable outcomes for your personas while keeping conversations safe, compliant, and on-brand. Ideal for complex workflows and regulated industries. Learn more →
- Persona Builder: A guided creation flow in the Developer Portal to shape AI personas with goals, behaviors, and style — then test or launch within minutes.
New Features
- Events Console: A new events console in the Developer Portal lets you monitor everything happening during a conversation in real time — from message flows to system activity.
- Conversation Transcripts & Perception Analysis: View full conversation details directly in the Developer Portal, including transcripts with speaker roles and perception analysis showing how your AI persona sees, hears, and responds.
New Features
- Persona Layer Controls: Enable or disable layers like Sparrow directly within a Persona and adjust sensitivity settings in real time from the Developer Portal side panel.
- Persona Editing in Developer Portal: We’ve added new editing capabilities to help you refine your Personas more efficiently. You can now update system prompt, context, and layers directly in our Developer Portal, plus duplicate existing Personas to quickly create variations or use them as starting points for new projects. Find these new features in your Persona Library at platform.tavus.io.
Enhancements
- Interactions Protocol Playground Improvements: Major updates to the Interactions Protocol Playground including correct
properties.contextformat and append vs overwrite toggle.
New Features
- Multilingual Settings in Developer Portal: You can now specify the language of a conversation directly in the Developer Portal, including a new multilingual option for dynamic, real-world interactions.
New Features
- Multilingual Conversation Support: CVI now supports dynamic multilingual conversations through automatic language detection. Set the language parameter to “multilingual” and CVI will automatically detect the user’s spoken language and respond in the same language using ASR technology.
- Audio-Only Mode: CVI now supports audio-only conversations with advanced perception (powered by Raven) and intelligent turn-taking (powered by Sparrow-1). Set
audio_only=truein your create conversation request to enable streamlined voice-first interactions.
Enhancements
- Fixed CVI responsiveness issue: Resolved an issue where CVI would occasionally ignore very brief user utterances. All user inputs, regardless of length, now receive consistent responses.
- Expanded tavus-llama-4 context window: Increased maximum context window to 32,000 tokens. For optimal performance and response times, we recommend staying under 25,000 tokens.
Changes
- Added a new recording requirement to Replica Training : Start the talking segment with a big smile.
Enhancements
Enhancements
- Major Phoenix 3 Enhancements for CVI:
- Increased frame rate from 27fps to 32fps, significantly boosting smoothness.
- Reduced Phoenix step’s warm boot time by 60% (from 5s to 2s).
- Lipsync accuracy improved by ~22% based on AVSR metric.
- Resolved blurriness and choppiness at conversation start.
- Enhanced listening mode with more natural micro expressions (eyebrow movements, subtle gestures).
- Greenscreen mode speed boosted by an additional ~1.5fps.
- Enhanced CVI Audio Quality: Audio clicks significantly attenuated, providing clearer conversational audio.
- Phoenix 3 Visual Artifacts Fix: Resolved visual artifacts in 4K videos on Apple devices, eliminating black spot artifacts in thumbnails.
New Features
- Launched LiveKit Integration : With Tavus video agents now integrated into LiveKit, you can add humanlike video responses to your voice agents in seconds.
- Persona API : Enabled patch updates to personas.
Enhancements
- Resolved TTS (Cartesia) stability issues and addressed hallucination.
- Phoenix 3 Improvements:
- Fixed blinking/jumping issues and black spots in videos.
- FPS optimization to resolve static and audio crackling.

