Descript lets you create AI avatars that animate in sync with your speaker’s voice. Upload your own photo, generate an avatar from a text prompt, or use an image as a base and customize it with a prompt to create a unique look.
This article covers three ways to create a custom avatar:
- Creating a custom avatar from scratch
- Turning an uploaded image into an avatar
- How to include an image with a prompt
Plus:
- How to save your avatar images
- Avatar generation workflow and timing
- How to update or change an avatar in your composition
- Best practices for using custom avatars
Usage note
On current plans, this feature uses AI Credits. Learn more about tracking your Media Minutes and AI Credits.
Legacy and Sunset plans track usage differently. See our Understanding your Legacy and Sunset plan guide for details.
Create a custom avatar with a text prompt
Create a new avatar by entering a description and Descript will generate a unique image based entirely on your prompt.
- Assign a speaker to your script.
- Click the speaker label and choose Assign avatar to [speaker name].
- Click the Create a custom avatar button.
- In the Text prompt tab, enter a description to generate an avatar, or click the Inspire button to use a suggested prompt. For best results, describe a realistic, human-like figure.
- (Optional) Choose a style for your avatar from the available options in our style library.
- Descript will generate three avatar options. You can:
- Select one of the images to apply, or
- Continue iterating on your prompt. Previous generations remain visible as long as the avatar modal stays open.
- Click Assign avatar to create the avatar and assign it to your speaker.
- Once your script is finalized, click Generate avatar to animate the image and sync it with the speech audio.
Upload a photo to create your custom avatar
Upload a photo to use as your avatar. Be sure to reference the best practices for image guidelines that will yield the best results.
- Assign a speaker to your script.
- Click the speaker label and choose Assign avatar to [speaker name].
- Click the Create a custom avatar button.
- Click the Upload an image button and select a supported image file (
.jpeg,.png, or.webp). - Click the Create button to complete the upload.
- If you just want to use the photo with no additional changes, click the Assign avatar button in the bottom right-hand corner.
If your photo doesn't meet formatting or safety requirements, Descript will prompt you to upload a different one.
Add an image to your prompt
Want to give your avatar a new look while keeping the same base image? In the preview pane on the right side of the avatar window, hover over the image and click Add image to prompt. This adds the image to your text prompt, and Descript will generate your next avatar using both the prompt and the attached image.
To see the entire workflow from start to finish, check out the tutorial video below:
Save your avatar images
To keep a copy of any generated avatar image, click the Download icon. Saved avatar images can be uploaded to use as a base for a new avatar or restyled with a new prompt.
Closing the avatar customization window will clear your avatar history. Make sure to download any avatar images you want to keep before closing the window.
Manage or update custom avatars
To update a custom avatar, click the speaker label and choose Update speaker’s avatar. This updates the avatar across your project while preserving its size, position, and visibility in the scene editor.
Best practices for using custom avatars
Whether you’re uploading a photo or generating an avatar from a text prompt, use a clear, human-like face with even lighting and a relaxed, natural expression. Avoid using images or prompts that include animals, objects, or abstract characters—they won’t animate reliably.
Head position and framing
| Attribute | Best Practices |
|---|---|
| Framing | Use a close-up headshot with shoulders slightly visible. The subject should be centered and squared to the camera. |
| Head Position | Face forward with the head upright. Avoid angled or three-quarter views that distort motion. |
Glasses, mouth, and eyes
| Attribute | Best Practices |
|---|---|
| Glasses | Avoid heavy reflections. Eyes must be clearly visible. |
| Mouth | Ensure the mouth is unobstructed so the AI can sync lip shapes accurately. |
| Eyes | Eyes must be open and clearly visible. Avoid shadows or squinting that obscure the eye shape. |
Background, foreground, and lighting
| Attribute | Best Practices |
|---|---|
| Background | Remove people or animals in the background—they won’t animate and may distract from the avatar. |
| Transparent Background | Avatars created using an image with background transparency may result in a blurred and/or poor quality background. |
| Foreground | Keep the subject unobstructed. Avoid props or objects that block the face or shoulders. |
| Lighting | Use soft, even lighting to avoid harsh shadows. |
| Contrast | Ensure good contrast between the subject and the background for clean separation. |
File upload requirements
| Attribute | Best Practices |
|---|---|
| Image Content | Photos may be rejected if they contain celebrities or inappropriate content, such as nudity. |
| File Types | Supported formats: JPEG, PNG, and WEBP. |
| File Size | Maximum file size is 10MB. |
| Aspect Ratio | For the best results when uploading images for custom avatars, use widescreen horizontal photos (16:9). Other formats, like tall portrait images (9:16) or non-standard ratios, may not display correctly. |