Generate AI images and video

Use generate image and generate video to create AI-generated visuals directly in Descript. You can generate images or video from an empty scene, from AI tools, or from the Scene Editor toolbar.

Before you generate images or video, we recommend familiarizing yourself with scenes and the Scene Editor.

This article covers:

Usage note

On current plans, this feature uses AI Credits. Learn more about tracking your Media Minutes and AI Credits.

Legacy and Sunset plans track usage differently. See our Understanding your Legacy and Sunset plan guide for details.

Generate an image or video

When you add a new empty scene, three options appear directly in the Scene Editor. This is the quickest way to start generating media. You can also open the generate media pane from the AI tools panel in the right-hand sidebar.

  1. Add a new empty scene. There are several ways to do this:
    • Create new scene boundaries by typing two backslashes / / in the script.
    • Place the playhead where you’d like to create a new scene boundary, and select the Split button from the transport.
    • Use the + buttons between scene segments in the timeline to create a new scene.
  2. Three buttons will appear in the Scene Editor. Click Generate on the empty scene canvas.

    Generate button on an empty scene canvas

    Note, this specific UI flow is rolling out gradually and may not be available to all users yet. If you don’t see it in your app, check back soon.
  3. The generate media side pane opens. Select the Image or Video tab.
  4. Type a prompt describing what you want to create.
  5. Choose a model, aspect ratio, and duration (for video) or batch size (for images).
  6. Click Generate. For video, use the dropdown to select Video or Video with audio (supported by select models).
  7. Browse the results in the option grid, then select one to add it to your scene, or use it to continue iterating on your content.

Prefer to use Underlord?

You can also ask Underlord to generate video by selecting an image and typing “turn this into video” or “generate a video of [description]”. You can also specify duration, aspect ratio, and audio preferences in your prompt.

Model-specific parameters

Each model offers different customization options. The parameters you see update automatically based on your selected model (only supported options are shown). Learn more about available generative media models.

  • Attach a file (image only): add a reference file to include as part of your prompt.
  • Style: Descript offers preset styles to choose from to guide the creation of your generated media
  • Aspect Ratio: Choose from available sizes
  • Number/Duration: Number of generated images or duration of your generated video
  • Resolution: Set output quality
  • Audio Generation: Available on select models
  • Start and End Frame: Creates smooth transitions between two images. Learn more about using Start and End Frame

Known limitations

  • Prompts don't carry over between tabs. Switching from the Image tab to the Video tab (or vice versa) starts a fresh prompt. Copy your prompt text before switching if you want to reuse it.
  • Generation history is separate for each media type. Image generation history only appears in the Image tab, and video generation history only appears in the Video tab.
  • Model capabilities vary. Some video models support first frame, last frame, or reference image inputs — others don't. Unsupported options will appear as disabled buttons.

What should I use AI generated video for?

Click any link below to read a full blog post with tips and techniques.

  1. Animated title cards

  2. Looping backgrounds

  3. Social shorts

  4. Bespoke b-roll