Skip to content

Feature: Batch Subtitle Dubbing / Speech Synthesis

Supported subtitle or text formats for dubbing: srt/txt

If you have many subtitle files or txt files and want to create dubbing for them in batch, you can choose this feature.

Upload your SRT files or plain text and batch synthesize them into dubbing files (such as WAV or MP3) using the selected TTS engine. Fine-tune the speaking speed, volume, and pitch with precision.

tts

  • Top large button: Drag and drop or click to import one or more SRT/TXT files.
  • Bottom large text box: Manually input text for dubbing. If you want to dub a large block of text, copy and paste it here. For SRT files, use the top button to import.
  • Subtitle language: The language of your subtitles. This option determines which characters (voices) are available.
  • Dubbing source: The default is EdgeTTS, Microsoft’s free dubbing service supporting all languages. Other options include paid online APIs or free open-source projects requiring local deployment. Choose based on your needs; some sources require filling in an API key in Menu → TTS Settings. - Click here for full details on all dubbing sources
  • Select voice: After choosing the subtitle language and dubbing source, select a character/voice you want to use.
  • Preview dubbing: After selecting a voice, a preview button will appear for you to listen to a sample.
  • Speed change percentage: Default is 0. A value greater than 0 increases the speaking speed by the given percentage. For example, 10 means a 10% increase; -10 means a 10% decrease in speed.
  • Auto speed up: Different languages and characters have different speaking speeds, so dubbing duration may not exactly match the original subtitle duration. If this checkbox is selected, and the dubbing is longer than the subtitle duration, it will be forcibly shortened to fit the subtitle's time range.
  • Remove silence between subtitles: There is usually a gap between two subtitles. If selected, this gap will be removed, making the audio continuous. Only effective when Auto speed up is not selected.
  • Volume+: Similar logic to Speed change percentage. A value greater than 0 increases the volume by that percentage; a value less than 0 decreases it.
  • Pitch+: Default is 0, ranging from -50 to 50. Pitch changes from dull to sharp.
  • Output format: Default output is WAV audio. MP3 and M4A are also available.
  • Save to original location: If selected, the generated dubbing audio will be saved in the same location as the original SRT subtitle file.
  • Open output directory: Click to open the folder where the generated dubbing audio is saved.