v3.95 Changelog
- Added secondary recognition in the video translation feature. It can be seen by clicking
Set More Parameterson the main interface. - Added
VAD Type Switchunder Menu -> Tools -> Advanced Options. Default isten-vad, which is faster and more accurate. Switch back tosileroif issues are encountered. - Enabled batch inference by default for faster-whisper to increase speed. Modify
faster-whisper Batch Sizeto 1 in Advanced Options to switch to non-batch mode, which is slower but slightly improves accuracy. - Optimized model auto-download: first test
huggingface.co, then fall back to the domestic mirrorhf-mirror.comif it fails. - Added several custom options in Advanced Options to control hallucination issues.
- Added
Transcript Alignmentfeature, simply implemented based on thefaster-whisper local modeland diff algorithm. - Added
supertonicdubbing channel, supporting English, Korean, Spanish, Portuguese, and French. It's compact and fast. - Switched audio denoising to use modelscope.cn by default. Processing time will increase significantly, but denoising effect is better.
- Speaker diarization now supports: built-in Chinese/English model, pyannote3.0 model, Alibaba cam++ Chinese/English model, and reverb-diarization-v1 model. Can be switched in Advanced Options. Default uses the built-in Chinese/English model.
- During single video translation, when the subtitle editing interface pops up, you can open the subtitle folder, then manually or via third-party tools modify the generated subtitle file. Clicking
Continue Without Savingin that interface will then use the modified subtitle file, making it convenient for manual adjustments. - Added support for multi-GPU concurrency to speed up processing. Can be enabled by selecting
Multi-GPU Modein Advanced Options. - Heavy tasks now use a multi-process approach to better utilize multi-core CPUs. Can be sped up by setting a higher number in
Max Process Countin Advanced Options. - Advanced Options -> Subtitle Translation: Added
Include Full Subtitlecheckbox. When selected and using an AI translation channel, the original subtitle will also be sent to the AI, potentially achieving better translation quality. - Added command-line mode. For specific usage, see CLI Command Line.
- Optimized the default translation prompt.
- Added support for the open-source qwen-tts model.
