The content on this page originates from my open-source project release page. For the latest information, please click GitHub: https://github.com/jianchang512/stt/releases/0.0
faster-whisper Model Downloads, suitable for the stt project and the "pyvideotrans Video Translation & Dubbing" project's faster-whisper mode. For openai-whisper models, please scroll down.
small 415MBsmall Baidu Netdisksmall.en 415MB
large-v1 Baidu Netdisklarge-v1 huggingface
large-v2 huggingfacelarge-v2 Baidu Netdisk
large-v3 huggingfacelarge-v3 Baidu Netdisk
distil-whisper-medium.en 671MBdistil-medium Baidu Netdisk
distil-whisper-large-v2 1.27Gdistil-large-v2 Baidu Netdisk
distil-whisper-large-v3 1.3Gdistil-whisper-large-v3 Baidu Netdisk
After downloading, extract the archive. Copy the "models--Systran--faster-xx" folder from inside the archive to the
modelsdirectory. After extraction and copying, the folder list under themodelsdirectory should look like this:Archive contents
Correct folder list under the
modelsdirectory after placement
openai-whisper Model Downloads, only for the "pyvideotrans Video Translation & Dubbing Software" openai-whisper mode model download and use.
After downloading, place the .pt file into the models folder under the software directory.
FunASR Chinese Model Download
Baidu Netdisk Download (includes speech recognition, punctuation restoration, and noise reduction models): https://pan.baidu.com/s/1v5wagiid6-K7GX9Pif4reA?pwd=y2ef
Huggingface (Download address outside the firewall): https://huggingface.co/spaces/mortimerme/s4/resolve/main/FunASR-Chinese-models.7z?download=true
After downloading and extracting, you will see three folders:
iic,damo, and.__temp. Copy them to themodels/hubfolder of the video translation software, overwriting if necessary.
cuBLASxx.dll and cudnn Download
If you encounter "cublasxxx.dll not found" or the software crashes after enabling CUDA acceleration, please download the corresponding file. Then copy the dll files inside to the C:/Windows/System32 directory OR to the software root directory (where the exe is located).
Open a command prompt by typing cmd in any folder's address bar, then enter the command nvcc -V to check your current CUDA version.
For CUDA 11.x version, click here to download: https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA11_win_v4.7z
For CUDA 12.x version, click here to download: https://github.com/jianchang512/stt/releases/download/0.0/cuBLAS.and.cuDNN_CUDA12_win_v1.7z
uvr5 Model Download
After downloading, extract the archive. You will get a uvr5_weights folder. Copy this folder to the root directory of the video translation & dubbing software.
ffmpeg.exe Download
If you are on Windows and get a prompt saying the
ffmpegcommand is not found, you can download the following two files and place them in the software root directory or in theffmpegfolder under the software root directory.
https://github.com/jianchang512/stt/releases/download/0.0/ffmpeg.exe
https://github.com/jianchang512/stt/releases/download/0.0/ffprobe.exe
