AI Speakers in Descript is a powerful tool for generating speech from text, but there are important differences between AI voice clips and recorded audio. Converting AI speech to audio unlocks more editing options, allowing you to fine-tune the content in your project.
Why Convert AI Speech to Audio?
AI voice clips and stock voices behave differently than recorded audio in your script. For example:
- You cannot adjust word boundaries in AI voice clips.
- You cannot edit AI voice clips directly in the Timeline.
By converting an AI voice clip to audio, the generated speech becomes an audio layer that behaves like recorded audio. This means you can:
- Edit the audio in the Timeline.
- Apply effects, trims, or crossfades.
- Gain more precise control over playback and alignment.
How to Convert AI Speech to Audio
Convert in the Script
To convert an AI voice clip directly in your script:
- Click on the placeholder text of the AI voice clip.
- Select the ✔️ check mark to confirm and convert the text to audio.
Tip: Once converted, the audio behaves like a standard recorded audio clip in your project.
Convert in the Timeline
To convert an AI voice clip using the Timeline:
- Hover over the AI voice clip in the Timeline.
- Select Convert to Audio from the options.
If you don’t see the Timeline in your editor:
- Click Show Timeline in the bottom-left corner of the editor to reveal it.
Frequently Asked Questions
What happens when I convert AI speech to audio?
The AI-generated speech is converted into an audio file, making it fully editable in the Timeline.
Can I undo the conversion?
Yes, you can undo the conversion using version history or undo shortcuts (Cmd + Z on Mac or Ctrl + Z on Windows). If you need to modify the text, delete the audio and regenerate the AI speech. Otherwise, converting to audio is a permanent change.