Skip to content

Dubbing Channels will intelligently dub your subtitles line by line. You can also freely adjust the speech rate, volume, and pitch to make the audio effect better suit your needs.

Friendly Tip: Some advanced dubbing services, such as OpenAI, Microsoft Azure, ByteDance Volcano Engine, etc., require you to provide a personal API address or Secret Key (SK) to use. Don't worry, the setup is very simple!

Just click "Menu" -> "TTS Settings" at the top of the software, find the corresponding option, and enter your information.

We have integrated various types of dubbing services for you, from completely free to professional-grade, from cloud services to local deployment, meeting the needs of different users.

Click on the channel name to view detailed usage instructions for that channel.

I. Out-of-the-Box Type (Free & Convenient)

These services require no complex configuration and are perfect for beginners or quick production.

  • Edge-TTS (Free)
    • Introduction: This is a free dubbing service provided by Microsoft, with natural and excellent sound quality.
    • Note: If used heavily in a short period, it may fail or generate silent audio segments due to Microsoft's access frequency limits.

II. Professional Cloud Services (Require API Key Application)

These services are usually provided by large companies, offering stable performance and a wide range of voice options, but some require payment.

  • Microsoft Azure TTS

    • Introduction: A professional-grade speech service under Microsoft, offering various highly realistic voices with excellent results.
    • Requirements: Need to activate the speech service on the Microsoft Azure official website.
  • OpenAI-TTS

    • Introduction: Provided by OpenAI, the developer of ChatGPT, featuring leading voice technology and very natural effects.
    • Requirements: Need an OpenAI account and obtain an SK key.
  • ByteDance Volcano Engine TTS

    • Introduction: Provided by ByteDance (parent company of TikTok), offering authentic Chinese pronunciation and rich voice options.
    • Requirements: Need to activate the service and create an application on the Volcano Engine official website.
  • Alibaba Qwen-TTS

    • Introduction: A speech synthesis service provided by Alibaba, leveraging its powerful large model technology.
    • Requirements: Need to activate the "Bailian" service on Alibaba Cloud and apply for an API Key.
  • Gemini-TTS

    • Introduction: TTS service provided by Google.
    • Requirements: Need access to Google services and apply for a Gemini API Key. Free accounts have very strict request limits and may only succeed in dubbing a few subtitle lines.
  • Elevenlabs.io TTS

    • Introduction: A popular company focused on AI audio technology, offering diverse and highly expressive voices.
    • Requirements: Need to apply for an API Key on their official website Elevenlabs.io. Free accounts have many limitations and may not be sufficient for a complete project.
  • 302.AI Dubbing

    • Requirements: Need to visit 302.ai official website to apply for an App Key.
  • Minimaxi Dubbing

    • Requirements: Need to go to minimaxi.com to apply for an API Key and top up before use.

III. Local Deployment Type (High Flexibility & Suitable for Advanced Users)

If you pursue higher flexibility, want to clone your own voice, or prefer not to rely on third-party cloud services, local deployment is your best choice. This usually requires some technical skills and good computer specifications.

  • GPT-SoVITS

    • Introduction: A very popular open-source project that can clone anyone's voice with just a few audio samples, with impressive results.
    • Requirements: Need to deploy the service on your own computer.
  • CosyVoice

    • Introduction: Another excellent local deployment solution, providing high-quality speech synthesis.
    • Requirements: Need to deploy in a local environment.
  • ChatTTS

    • Introduction: A TTS model specifically designed for dialogue scenarios, capable of generating very natural and emotional conversational speech.
    • Requirements: Need to deploy its ChatTTS-UI service locally.
  • Fish-TTS

    • Requirements: Need to deploy in a local environment.
  • ChatterBox-TTS

    • Requirements: Need to deploy the service locally and use it with the api.py file.
  • Kokoro-TTS

    • Requirements: Need to deploy in a local environment.
  • F5-TTS / Spark / IndexTTS / DiaTTS / VoxCPM

    • Introduction: These five channels share a single settings backend.
    • Requirements: All require local deployment and configuration of their respective API addresses in Menu -> TTS Settings -> F5TTS.

IV. Ultimate Customization Type

  • Custom TTS-API
    • Introduction: If none of the above channels meet your needs, or if you have developed your own speech service, this option offers unlimited possibilities.
    • Requirements: Possess some programming skills to write or deploy an API interface that meets our data format requirements.