Skip to content

Latest commit

 

History

History
95 lines (64 loc) · 3.2 KB

README.md

File metadata and controls

95 lines (64 loc) · 3.2 KB

ComfyUI node to make text to speech audio with your own voice.

Using F5-TTS https://github.com/SWivid/F5-TTS

Instructions

  • Put in ComfyUI's "input" folder a .wav file of an audio of the voice you'd like to use, remove any background music, noise.
  • And a .txt file of the same name with what was said.
  • Press refresh to see it in the node

You can use the examples here...

Other languages / custom models...

You can put the model & vocab txt files into "models/checkpoints/F5-TTS" folder if you have any more models. Name the .txt vocab file and the .pt model file the same names. Press "refresh" and it should appear under the "model" selection.

Custom F5-TTS languages on huggingface

I haven't tried these... Finnish French German Greek Hindi Hungarian Italian Japanese Malaysian Norwegian Polish Portuguese BR Russian Spanish Turkish Vietnamese

Multi voices...

Put your sample voice files into the "input" folder like...

voice.wav
voice.txt
voice.deep.wav
voice.deep.txt
voice.chipmunk.wav
voice.chipmunk.txt

Then you can use prompts for different voices...

{main} Hello World this is the end
{deep} This is the narrator
{chipmunk} Please, I need more helium

BigVGAN models.

To use BigVGAN, you have to add a little dot to make it work with ComfyUI...

In the file custom_nodes/ComfyUI-F5-TTS/F5-TTS/src/third_party/BigVGAN/bigvgan.py

Add a little dot on the line at the top that says...

from utils import init_weights, get_padding

so it's becomes...

from .utils import init_weights, get_padding

Tips...

Install from git

It's best to install from ComfyUI-manager because it will update all your custom_nodes when you click "update all". With git, you will have to update manually.

Clone this repository into custom_nodes and run this to install from git

cd custom_nodes/ComfyUI-F5-TTS
git submodule update --init --recursive
pip install -r requirements.txt