Descript lets you create AI avatars that animate in sync with your speaker's voice. You can upload a photo of a human face or generate an avatar using a text prompt—no camera required. It's a flexible way to personalize your content while working within Descript's text-based editing workflow.
This article covers:
- How to create a custom avatar with an image
- How to create an AI-generated avatar from a text prompt
- How to update or change an avatar in your composition
- Best practices for using custom avatars
Create a custom avatar with an image
- Assign a speaker to your script.
- Click the speaker label and choose Assign avatar.
- Click Upload photo and select a supported image file (
.jpeg
,.png
, or.webp
). - Preview your image and click Assign avatar to apply it.
- Once your script is finalized, click Generate avatar to animate your image and sync it with the speaker.
If your photo doesn't meet formatting or safety requirements, Descript will prompt you to upload a different one.
Create a custom avatar from a text prompt
- Assign a speaker to your script.
- Click the speaker label and choose Assign avatar to [speaker name].
- In the Text prompt tab, enter a description to generate an avatar, or click the Inspire button to use a suggested prompt. For best results, describe a realistic, human-like figure.
- Descript will generate three avatar options. You can:
- Select one of the images to apply, or
- Continue iterating on your prompt. Previous generations remain visible as long as the avatar modal stays open.
- Click Assign avatar to apply your selection.
- Once your script is finalized, click Generate avatar to animate the image and sync it with the speaker.
Generation workflow and timing
- Avatar generation uses avatar minutes, not AI voice minutes.
- Minutes are calculated based on total spoken audio duration.
- You’ll see a modal showing your remaining avatar minutes and estimated usage before confirming.
- The current max length of an avatar generation is 12 minutes.
Avatar generation continues in the background if you close the project. You’ll receive an email when it’s ready.
Managing and updating custom avatars
To update a custom avatar, click the speaker label and choose Update speaker’s avatar. This updates the avatar across your project while preserving its size, position, and visibility in the scene editor.
If the avatar doesn’t appear in the scene:
- Open the Scene panel and click the "show layer" icon
- Drag the avatar layer above other visuals if needed
Or use Replace media in the scene editor to swap it back into view.
Avatar layers aren’t visible in the timeline but can be cropped, styled, or repositioned in the scene editor. You can also apply effects like greenscreen.
Best practices for using custom avatars
Whether you’re uploading a photo or generating an avatar from a text prompt, use a clear, human-like face with even lighting and a relaxed, natural expression. Avoid using images or prompts that include animals, objects, or abstract characters—they won’t animate reliably.
Head position and framing
Attribute | Best Practices |
---|---|
Framing | Use a close-up headshot with shoulders slightly visible. The subject should be centered and squared to the camera. |
Head Position | Face forward with the head upright. Avoid angled or three-quarter views that distort motion. |
Glasses, mouth, and eyes
Attribute | Best Practices |
---|---|
Glasses | Avoid heavy reflections. Eyes must be clearly visible. |
Mouth | Ensure the mouth is unobstructed so the AI can sync lip shapes accurately. |
Eyes | Eyes must be open and clearly visible. Avoid shadows or squinting that obscure the eye shape. |
Background, foreground, and lighting
Attribute | Best Practices |
---|---|
Background | Remove people or animals in the background—they won’t animate and may distract from the avatar. |
Foreground | Keep the subject unobstructed. Avoid props or objects that block the face or shoulders. |
Lighting | Use soft, even lighting to avoid harsh shadows. |
Contrast | Ensure good contrast between the subject and the background for clean separation. |
File upload requirements
Attribute | Best Practices |
---|---|
Image Content | Photos may be rejected if they contain celebrities or inappropriate content, such as nudity. |
File Types | Supported formats: JPEG, PNG, and WEBP. |
File Size | Maximum file size is 10MB. |
Aspect Ratio | Use a 16:9 aspect ratio for best compatibility with Descript’s scene editor. |