Converting AI Speech to Editable Audio in Descript

AI Speakers in Descript is a powerful tool for generating speech from text, but there are important differences between AI voice clips and recorded audio. Converting AI speech to audio unlocks more editing options, allowing you to fine-tune the content in your project.

Why Convert AI Speech to Audio?

AI voice clips and stock voices behave differently than recorded audio in your script. For example:

By converting an AI voice clip to audio, the generated speech becomes an audio layer that behaves like recorded audio. This means you can:

  • Edit the audio in the Timeline.
  • Apply effects, trims, or crossfades.
  • Gain more precise control over playback and alignment.

How to Convert AI Speech to Audio

Convert in the Script

To convert an AI voice clip directly in your script:

  1. Click on the placeholder text of the AI voice clip.
  2. Select the ✔️ check mark to confirm and convert the text to audio.

Tip: Once converted, the audio behaves like a standard recorded audio clip in your project.

Script Conversion Screenshot

Convert in the Timeline

To convert an AI voice clip using the Timeline:

  1. Hover over the AI voice clip in the Timeline.
  2. Select Convert to Audio from the options.

If you don’t see the Timeline in your editor:

  • Click Show Timeline in the bottom-left corner of the editor to reveal it.

Timeline Conversion Screenshot

Frequently Asked Questions

What happens when I convert AI speech to audio?

The AI-generated speech is converted into an audio file, making it fully editable in the Timeline.

Can I undo the conversion?

Yes, you can undo the conversion using version history or undo shortcuts (Cmd + Z on Mac or Ctrl + Z on Windows). If you need to modify the text, delete the audio and regenerate the AI speech. Otherwise, converting to audio is a permanent change.