Important:
This requires a lot of configuration if run directly. Recommended way is to use UI Application: https://github.com/Sharrnah/whispering-ui which downloads this automatically.
Standalone Release File (3.1 GB):
Download Server:
Changelog (v1.3.15.1)
- [FEATURE] Update F5-TTS + add new F5-TTS language models (English, Chinese, French, German, Italian, Japanese, Spanish, Russian, Vietnamese, Malaysian)
- [FEATURE] Add option to change TTS volume
- [BUGFIX] Fix order of model loading
- [BUGFIX] Use configured F5-TTS compute device
- [BUGFIX] BigVGAN loading for F5-TTS
- [BUGFIX] module import order
Full Changelog: v1.3.14.8...v1.3.15.1