Skip to content

ByteDance Volcano Speech Synthesis

Speech synthesis, also known as text-to-speech, has many excellent open-source options, such as GPT-SoVITS, ChatTTS, and the free edge-tts. Of course, there are also commercial services like ByteDance Volcano's speech synthesis service. While free options are attractive, those seeking higher quality might prefer commercial services. Especially with the advancement of large models, the price of commercial APIs is becoming increasingly affordable, making them a viable choice.

Version 2.88 and later adds ByteDance Volcano Engine's speech synthesis service, supporting 8 languages: Chinese, English, Japanese, Portuguese, Spanish, Thai, Vietnamese, and Indonesian. Chinese support also includes various dialects such as Northeastern Mandarin and Sichuanese Mandarin. It offers 20,000 free requests, enough for approximately 10 hours of synthesized speech.

Supported Chinese Voices

Only a portion of Chinese voices are shown here. View the other 7 language voices here: https://www.volcengine.com/docs/6561/97465

Many Chinese voices are supported, including various dialects and popular voices from Douyin, such as Xiao Shuai (Little Handsome) and Xiao Mei (Little Beauty) narration voices.

Voice Namevoice_type
Can Can 2.0BV700_V2_streaming
Yang YangBV705_streaming
Sunny YouthBV123_streaming
Anti-involution YouthBV120_streaming
Generic Son-in-lawBV119_streaming
Ancient Style Young LadyBV115_streaming
Powerful Middle-aged UncleBV107_streaming
Simple YouthBV100_streaming
Gentle LadyBV104_streaming
Cheerful YouthBV004_streaming
Sweet and Spoiling Young LadyBV113_streaming
Elegant YouthBV102_streaming
Sweet Little YuanBV405_streaming
Friendly Female VoiceBV007_streaming
Intellectual Female VoiceBV009_streaming
Cheng ChengBV419_streaming
Tong TongBV415_streaming
Friendly Male VoiceBV008_streaming
Dubbed Film Male VoiceBV408_streaming
Lazy Little SheepBV426_streaming
Fresh Literary Female VoiceBV428_streaming
Chicken Soup Female VoiceBV403_streaming
Wise ElderBV158_streaming
Loving GrandmaBV157_streaming
Rap BrotherBR001_streaming
Energetic Narration MaleBV410_streaming
Film Narration Xiao ShuaiBV411_streaming
Xiao Shuai Narration - Multi-emotionalBV437_streaming
Film Narration Xiao MeiBV412_streaming
Rich Young MasterBV159_streaming
Top Live StreamerBV418_streaming
Anti-involution YouthBV120_streaming
Steady Narration MaleBV142_streaming
Dashing YouthBV143_streaming

How to Activate

  1. Register, log in, and complete identity verification.

https://console.volcengine.com/

Open this address to register, log in, and complete identity verification.

  1. After entering the console, open the Speech Technology page as shown in the image below.

image.png

You can also directly access this address: https://console.volcengine.com/speech/app

Then, create an application as shown in the image below. The name and description can be anything; the key is to select "Speech Synthesis Service," then click "OK" to complete.

image.png

  1. Next, go to the speech synthesis page to activate the free trial.

Go to this address: https://console.volcengine.com/speech/service/8

Select the application you just created at the top, and click "Try it" to activate.

image.png

  1. Copy the three parameters and fill them into your video translation software.

The first is the cluster id, copy the name under cluster id as shown in the image.

image.png

The second is the App id, which can be found by scrolling down on the same page.

image.png

The third is the Access Token, located to the right of the App id. Copy it.

image.png

  1. Fill these into your video translation software. Open the Menu-TTS Settings--ByteDance Volcano Speech Synthesis window to fill them in. Save after testing and ensuring there are no problems.

image.png

Using it in Video Translation Software

After filling in and testing, select the target language in the software, then select ByteDance Volcano Speech Synthesis in the dubbing channel. You can click to preview each voice.

image.png

Select your preferred voice to start the dubbing process.

Special Note

If you have activated the formal version, only the General Male and General Female voices are available by default. Other voices need to be purchased and activated separately in the ByteDance Volcano backend.