Understanding Conversational Flow
Conversational flow encompasses the subtle dynamics that make conversations feel natural:- Turn-taking: How the replica decides when to speak and when to listen
- Turn commitment: How firmly the replica holds its conversational turn once it starts speaking
- Interruptibility: How easily the replica can be interrupted by the user
- Active listening: How the replica provides verbal acknowledgments while listening
All conversational flow parameters are optional. When not explicitly configured, the layer remains inactive. However, if you configure any single parameter, the system will apply sensible defaults to all other parameters to ensure consistent behavior.
Configuring the Conversational Flow Layer
Define the conversational flow layer under thelayers.conversational_flow object. Below are the parameters available:
1. turn_detection_model
Specifies the model used for detecting conversational turns.
-
Options:
sparrow-0: Standard turn detection modelsparrow-1: Advanced turn detection model - faster, more accurate, and more natural thansparrow-0(recommended)time-based: Simple timeout-based turn detection
-
Default:
sparrow-0(when any conversational flow parameter is provided)
2. turn_taking_patience
Controls how eagerly the replica claims conversational turns. This affects both response latency and the likelihood of interrupting during natural pauses.
- Options:
low: Eager and quick to respond. May interrupt natural pauses. Best for rapid-fire exchanges or customer service scenarios where speed is prioritized.medium(default): Balanced behavior. Waits for appropriate conversational cues before responding.high: Patient and waits for clear turn completion. Ideal for thoughtful conversations, interviews, or therapeutic contexts.
low: Fast-paced customer support, quick information lookups, casual chatmedium: General purpose conversations, sales calls, presentationshigh: Medical consultations, legal advice, counseling sessions
3. turn_commitment
Controls how aggressively the replica will barge in and take its turn at the start of speaking. This affects the replica’s willingness to start talking even when the user may still be speaking.
- Options:
low: Less aggressive barge-in. The replica waits more clearly for the user to finish before starting to speak.medium(default): Balanced barge-in behavior. The replica will start speaking when it detects an appropriate opportunity.high: More aggressive barge-in. The replica will more readily start speaking even if the user may still be talking, allowing for more dynamic and overlapping conversation.
low: Formal conversations, interviews, scenarios where the user should complete their thoughtsmedium: General conversations, Q&A sessions, guided interactionshigh: Fast-paced conversations, collaborative brainstorming, scenarios requiring quick back-and-forth exchanges
4. replica_interruptibility
Controls how sensitive the replica is to user speech while the replica is talking. Determines whether the replica stops to listen or keeps speaking when interrupted.
- Options:
low: Less interruptible. The replica keeps talking through minor interruptions.medium(default): Balanced sensitivity. Responds to clear interruption attempts.high: Highly sensitive. Stops easily when the user begins speaking, maximizing user control.
low: Educational content delivery, storytelling, guided onboardingmedium: Standard conversations, interviews, consultationshigh: User-driven conversations, troubleshooting, interactive support
turn_commitment and replica_interruptibility work together to create natural conversational dynamics. turn_commitment controls how aggressively the replica will barge in at the start of its turn, while replica_interruptibility controls how easily the replica can be interrupted once it’s already speaking.5. active_listening
Controls the frequency of backchannel responses (like “yeah”, “mhmm”, “I see”) while the user is speaking. These verbal cues signal attentiveness and engagement.
This feature is currently in English-only beta. Backchannel responses will only be generated for English conversations.
- Options:
off(default): No backchannel responses during user speechlow: Infrequent backchannels, minimal verbal acknowledgmentmedium: Moderate backchannels at natural conversation breakshigh: Frequent backchannels, active engagement signals
off: Formal presentations, legal contexts, recorded sessionslow: Professional consultations, technical supportmedium: Coaching sessions, sales calls, general conversationshigh: Therapy sessions, counseling, empathetic support conversations
Relationship with STT Layer (Sparrow-0)
The Conversational Flow layer provides advanced configuration for Sparrow-1, which supersedes the legacy Sparrow-0 configuration in the STT layer. When you configure the Conversational Flow layer withturn_detection_model set to sparrow-1, these settings override the corresponding Sparrow-0 settings in the STT layer.
Parameter Mapping: Sparrow-0 to Sparrow-1
Here’s how Sparrow-0 (STT layer) parameters map to Sparrow-1 (Conversational Flow layer):| Sparrow-0 (STT Layer) | Sparrow-1 (Conversational Flow Layer) | Notes |
|---|---|---|
participant_pause_sensitivity | turn_taking_patience | Controls how long to wait before responding |
participant_interrupt_sensitivity | replica_interruptibility | Controls how easily the replica can be interrupted |
| N/A | turn_commitment | New in Sparrow-1: Controls barge-in behavior |
| N/A | active_listening | New in Sparrow-1: Controls backchannel responses |
Migration Guide
If you’re currently using Sparrow-0 settings in the STT layer and want to upgrade to Sparrow-1: Before (Sparrow-0):Note the inverted mapping:
participant_pause_sensitivity: "high"(quick response) →turn_taking_patience: "low"(eager)participant_interrupt_sensitivity: "low"(hard to interrupt) →replica_interruptibility: "high"(easy to interrupt)
Default Behavior
When the conversational flow layer is not configured, all parameters default toNone and the layer remains inactive. However, if you configure any single parameter, the system automatically applies the following defaults to ensure consistent behavior:
turn_detection_model:sparrow-0turn_taking_patience:mediumturn_commitment:mediumreplica_interruptibility:mediumactive_listening:off
Example Configurations
Example 1: Customer Support Agent
Fast, responsive, and easily interruptible for customer-driven conversations:Example 2: Medical Consultation
Patient, thoughtful, with engaged listening for sensitive conversations:Example 3: Educational Instructor
Committed to delivering complete information with minimal interruption:Example 4: Minimal Configuration
Configure just one parameter—others will use defaults:turn_detection_model:sparrow-0turn_commitment:mediumreplica_interruptibility:mediumactive_listening:off
Best Practices
Match Flow to Use Case
Choose conversational flow settings that align with your application’s purpose:- Speed-critical applications: Use
lowturn-taking patience - Thoughtful conversations: Use
highturn-taking patience - Important information delivery: Use
highturn commitment andlowinterruptibility - User-controlled interactions: Use
lowturn commitment andhighinterruptibility
Balance Patience and Commitment
The combination ofturn_taking_patience and turn_commitment creates different conversational feels:
| Patience | Commitment | Result |
|---|---|---|
| Low | Low | Rapid, flexible, back-and-forth |
| Low | High | Quick to start, committed to finish |
| High | Low | Thoughtful but flexible |
| High | High | Deliberate, complete responses |
Consider Cultural Context
Conversational norms vary across cultures. Some cultures prefer:- More overlap and interruption (consider lower commitment, higher interruptibility)
- Clear turn-taking with pauses (consider higher patience, lower interruptibility)
Test with Real Users
Conversational flow preferences can be subjective. Test your configuration with representative users to ensure it feels natural for your audience.Refer to the Create Persona API for the complete API specification and additional persona configuration options.

