> ## Documentation Index > Fetch the complete documentation index at: https://docs.tavus.io/llms.txt > Use this file to discover all available pages before exploring further. # Training from an Image > Upload a headshot and select a voice_name to create a replica without recording a training video. Use this path when you call [Create Replica](/api-reference/phoenix-replica-model/create-replica) with **`train_image_url`** and **`voice_name`**. The image file must be reachable at a **publicly accessible URL** (for example a presigned S3 GET URL), same as for video uploads. We recommend using the [Developer Portal](https://platform.tavus.io/dev/replicas/create) to upload an image as it provides real-time validation to ensure it meets all requirements before training. ## Image Requirements Upload a clear, front-facing headshot that meets the following requirements: * **Formats:** JPG or PNG * **Minimum resolution:** 512×512 pixels * **Only one person** visible in the image * **Head and shoulders** clearly visible in frame * **No glasses, hats, or face-covering accessories** * **Avoid visible jewelry** such as large earrings or necklaces * **Keep hair behind the shoulders** and away from the face and neck * **Use even lighting** with minimal shadows across the face Screenshot 2026 05 08 060356

Image-based training is a faster and simpler way to create a replica without recording a training video. It offers a simpler setup and is ideal for quick prototyping or AI-generated characters. Images will **not** work if they contain multiple people, subjects under 18, non-human characters, visible accessories (such as glasses, headphones, or jewelry), hair in front of shoulders, off-center framing, or unnatural poses such as leaning or lying down. ## AI Image Fixer If your uploaded image doesn't fully meet the requirements above, set **`auto_fix_training_image`** to `true` when calling [Create Replica](/api-reference/phoenix-replica-model/create-replica). Tavus's AI Image Fixer instantly fixes the uploaded image to fit our requirements, eliminating the need for editing or recapturing photos. ```json theme={null} { "replica_name": "my_image_replica", "train_image_url": "https://example.com/headshot.png", "voice_name": "anna", "auto_fix_training_image": true } ``` ## How `voice_name` works Image-based training does not create a new voice from your source material. Instead, you must set **`voice_name`** to a **stock voice** identifier slug (for example `anna`). This selects a voice tied to an existing Tavus stock replica so the trained replica has a usable default voice.

Example `voice_name` values

Below are **example** `voice_name` slugs with a short sample clip for each.

```text theme={null} benjamin ```

```text theme={null} james ```

```text theme={null} liam ```

```text theme={null} anna ```

```text theme={null} julia ```

```text theme={null} ivy ```

When you run **Conversational Video Interface (CVI)** sessions later, you are **not locked** into that stock voice for every conversation. You can attach a [persona](/api-reference/personas/create-persona) whose TTS layer uses an external voice (from Cartesia or ElevenLabs). See [Text-to-Speech (TTS)](/sections/conversational-video-interface/persona/tts) for how to set `external_voice_id` and related fields. ### Consent, rights, and acceptable use By using the image training API, you **affirm that you have the rights** to use the image you supply (for example likeness and publicity rights where applicable). Tavus may **reject** images that appear to depict unauthorized or impermissible subjects. Replica training typically takes **3–4 hours**.