Troubleshooting and using AI speakers

This page will guide you through some of the most common questions and issues while using AI speakers.

AI speaker is mispronouncing a word
Sometimes, an AI speaker might mispronounce a word or phrase. We've created a guide that goes over some helpful tips on getting the pronunciation right.
Why is the AI speaker generating more words than I select?
That's normal! AI speakers are designed to generate a little more on either side to give you some editing flexibility in case words don't transition perfectly. You can use the trim tool to line things up and a cross-fade to blend them together.
When I use AI speakers, it inserts black frames into the project
Text-to-speech and Regenerate currently do not work over sequences. When used on a sequence, the video will be removed.

We're currently working on improvements to this (see here for more info). In the meantime, try the following workaround:
  1. Convert the AI voice clip into an audio layer.
  2. Select the AI audio clip and cut (Cmd + X or Ctrl + X).
  3. Use the Trim tool to restore the original audio/video in the Script track.
  4. Paste the AI audio clip as a layer above the original audio.
  5. Use the Blade icon Blade tool to split the Script track, then mute the replaced portion in the Layer panel.
  6. Adjust as needed.
Audio isn't generating for the selected text

If you're typing text and no audio generates, try the following:

  • Ensure speech generation is enabled for your speaker. If not, create a new AI Speaker.
  • Try duplicating the project and check again.
  • Try creating a new project to see if the issue persists.
The AI-generated speech is too fast or too slow
AI speech uses a predefined voice model, so cadence and pacing are consistent. Here are some tips that might help.
AI speaker doesn't match my accent
The current model is based on US English pronunciation. We're exploring broader accent support in the future—upvote this on our feedback board. In the meantime, try using the Translate feature to create a version of your voice in another language.
Unable to adjust the clip speed of AI-generated speech
To adjust the speed of AI-generated clips, you must first click Convert to audio. Once converted, you can adjust speed using the selection toolbar.
The AI-generated speech has unexpected background noise or artifacts
Unexpected audio artifacts are usually caused by issues in the source media. Try to eliminate:
  • Static or sudden loud sounds
  • Background noise (appliances, traffic, music)
  • Excessive mouth noise or breathing
My AI Speaker isn't pronouncing my (non-English) language correctly
AI speakers currently support English only. Support for additional languages is in development—share your feedback here.