ByteDance Volcano Speech Synthesis
Speech synthesis, also known as text-to-speech, has many excellent open-source options, such as GPT-SoVITS, ChatTTS, and the free edge-tts. Of course, there are also commercial services like ByteDance Volcano's speech synthesis service. While free options are attractive, those seeking higher quality might prefer commercial services. Especially with the advancement of large models, the price of commercial APIs is becoming increasingly affordable, making them a viable choice.
Version 2.88 and later adds ByteDance Volcano Engine's speech synthesis service, supporting 8 languages: Chinese, English, Japanese, Portuguese, Spanish, Thai, Vietnamese, and Indonesian. Chinese support also includes various dialects such as Northeastern Mandarin and Sichuanese Mandarin. It offers 20,000 free requests, enough for approximately 10 hours of synthesized speech.
Supported Chinese Voices
Only a portion of Chinese voices are shown here. View the other 7 language voices here: https://www.volcengine.com/docs/6561/97465
Many Chinese voices are supported, including various dialects and popular voices from Douyin, such as Xiao Shuai (Little Handsome) and Xiao Mei (Little Beauty) narration voices.
Voice Name | voice_type |
---|---|
Can Can 2.0 | BV700_V2_streaming |
Yang Yang | BV705_streaming |
Sunny Youth | BV123_streaming |
Anti-involution Youth | BV120_streaming |
Generic Son-in-law | BV119_streaming |
Ancient Style Young Lady | BV115_streaming |
Powerful Middle-aged Uncle | BV107_streaming |
Simple Youth | BV100_streaming |
Gentle Lady | BV104_streaming |
Cheerful Youth | BV004_streaming |
Sweet and Spoiling Young Lady | BV113_streaming |
Elegant Youth | BV102_streaming |
Sweet Little Yuan | BV405_streaming |
Friendly Female Voice | BV007_streaming |
Intellectual Female Voice | BV009_streaming |
Cheng Cheng | BV419_streaming |
Tong Tong | BV415_streaming |
Friendly Male Voice | BV008_streaming |
Dubbed Film Male Voice | BV408_streaming |
Lazy Little Sheep | BV426_streaming |
Fresh Literary Female Voice | BV428_streaming |
Chicken Soup Female Voice | BV403_streaming |
Wise Elder | BV158_streaming |
Loving Grandma | BV157_streaming |
Rap Brother | BR001_streaming |
Energetic Narration Male | BV410_streaming |
Film Narration Xiao Shuai | BV411_streaming |
Xiao Shuai Narration - Multi-emotional | BV437_streaming |
Film Narration Xiao Mei | BV412_streaming |
Rich Young Master | BV159_streaming |
Top Live Streamer | BV418_streaming |
Anti-involution Youth | BV120_streaming |
Steady Narration Male | BV142_streaming |
Dashing Youth | BV143_streaming |
How to Activate
- Register, log in, and complete identity verification.
https://console.volcengine.com/
Open this address to register, log in, and complete identity verification.
- After entering the console, open the Speech Technology page as shown in the image below.
You can also directly access this address: https://console.volcengine.com/speech/app
Then, create an application as shown in the image below. The name and description can be anything; the key is to select "Speech Synthesis Service," then click "OK" to complete.
- Next, go to the speech synthesis page to activate the free trial.
Go to this address: https://console.volcengine.com/speech/service/8
Select the application you just created at the top, and click "Try it" to activate.
- Copy the three parameters and fill them into your video translation software.
The first is the cluster id
, copy the name under cluster id as shown in the image.
The second is the App id
, which can be found by scrolling down on the same page.
The third is the Access Token
, located to the right of the App id
. Copy it.
- Fill these into your video translation software. Open the Menu-TTS Settings--ByteDance Volcano Speech Synthesis window to fill them in. Save after testing and ensuring there are no problems.
Using it in Video Translation Software
After filling in and testing, select the target language in the software, then select ByteDance Volcano Speech Synthesis in the dubbing channel. You can click to preview each voice.
Select your preferred voice to start the dubbing process.
Special Note
If you have activated the formal version, only the General Male and General Female voices are available by default. Other voices need to be purchased and activated separately in the ByteDance Volcano backend.