Use generate image and generate video to create AI-generated visuals directly in Descript. You can generate images or video from an empty scene, from AI tools, or from the Scene Editor toolbar.
This article covers generating new AI media. If you need to sync existing visuals to updated audio, see Regenerate video instead.
This article covers:
- Use cases for AI-generated images and video
- Before getting started
- How to generate an image or video
- Model-specific parameters
- Known limitations
- Troubleshooting AI image and video generation
Usage note
On current plans, this feature uses AI Credits. Learn more about tracking your Media Minutes and AI Credits.
Legacy and Sunset plans track usage differently. See our Understanding your Legacy and Sunset plan guide for details.
Use cases for AI-generated images and video
AI-generated visuals can help when you don't have existing footage, need supporting visuals, or want to quickly explore a visual direction.
Before getting started
Before you generate images or video, we recommend familiarizing yourself with scenes and the Scene Editor.
Generate an image or video
When you add a new empty scene, three options appear directly on the canvas. This is the quickest way to start generating media. You can also open the generate media pane from the AI tools panel in the right-hand sidebar.
- Add a new empty scene. There are several ways to do this:
- Create new scene boundaries by typing two backslashes
//in the script. - Place your playhead where you'd like to create a new scene boundary, and select the Split button from the transport.
- Use the + buttons between existing scenes to create a new scene.
- Create new scene boundaries by typing two backslashes
- Click Generate on the empty scene canvas.
- The generate media side pane opens. Select the Image or Video tab.
- Type a prompt describing what you want to create.
- Choose a model, aspect ratio, and duration (for video) or batch size (for images). Adjust settings as needed.
- Click Generate.
- Browse the results in the option grid, then select one to add it to your scene, or use it to continue iterating on your content.
Prefer to use Underlord?
Ask Underlord to generate media. Note that you won't be able to adjust your generative video model parameters when using Underlord.
Model-specific parameters
Each model offers different customization options. The parameters you see update automatically based on your selected model (only supported options are shown). Learn more about available generative media models.
- Attach a file (image only): add a reference file to include as part of your prompt.
- Style: Descript offers preset styles to guide the creation of your generated media.
- Aspect ratio: Choose from available sizes.
- Number/Duration: Number of generated images or duration of your generated video.
- Resolution: Set output quality.
- Audio generation: Available on select models.
- Start and End Frame: Creates smooth transitions between two images. Learn more about using Start and End Frame.
Known limitations
- Prompts don't carry over between tabs. Switching from the Image tab to the Video tab (or vice versa) starts a fresh prompt. Copy your prompt text before switching if you want to reuse it.
- Generation history is separate for each media type. Image generation history only appears in the Image tab, and video generation history only appears in the Video tab.
- Model capabilities vary. Some video models support first frame, last frame, or reference image inputs—others don't. Unsupported options appear as disabled buttons.
Troubleshooting AI image and video generation
If image or video generation fails, the error usually comes from model safety filters, temporary provider issues, or model-specific limitations. Try the steps below, then generate again.
Image generation was rejected based on content
This can happen when a generated video contains a checkered placeholder layer with a message like "Custom prompt required" or "Image generation rejected based on content." It means the image generation provider rejected the prompt Descript sent.
To replace the placeholder layer:
- Select the placeholder layer in the editor.
- Select Add Media in the hover menu.
- Choose a media source, upload a file, or generate a new image with your own prompt.
- Replace the placeholder with the new media.
The content could not be processed because it was flagged by a content checker
This error means your prompt or input image was flagged by the selected model provider's safety filters.
To fix this:
- Remove sensitive or policy-violating language from the prompt.
- Use neutral, descriptive language.
- If you're using a reference image, try a different image or remove it and generate from text only.
An internal server error occurred
This error usually means the selected model could not process the prompt, reference image, or generation request.
To fix this:
- Try generating again after a few minutes.
- Remove any reference images and generate from a text-only prompt.
- Rewrite the prompt using simpler, more descriptive language.
- Switch to a different available model. See Generative image and video models in Descript for model details.
Unable to generate video due to content restrictions
This general error can happen because of rate limiting, a temporary provider issue, or content policy restrictions.
To fix this:
- Wait a few minutes, then try again.
- If you're using AI Video Maker, simplify your script:
- Remove section headers, speaker labels, and bracketed notes.
- Delete extra spaces or special characters.
- Translate non-English scripts to English before generating.
- Rewrite your prompt with simpler, more descriptive language.
- Try generating a still image first, then use that image as a reference input when generating video.