From 92f71e203b3d31baa3fa49e404ccfae36428498f Mon Sep 17 00:00:00 2001 From: Sam Khoze <68170403+SamKhoze@users.noreply.github.com> Date: Tue, 18 Jun 2024 12:43:52 -0700 Subject: [PATCH] Update README.md --- README.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index c3c4da8..cfa20a9 100644 --- a/README.md +++ b/README.md @@ -190,13 +190,14 @@ This node Swaps, Enhances, and Restores faces from, video, and image. or higher ---- ### DeepFuze_TTS Node (Voice Cloning) 🎙️ +![DeepFuze TTS_Node](https://github.com/SamKhoze/ComfyUI-DeepFuze/blob/main/images/DeepFuze_TTS_Node.jpg) + **Languages:** **DeepFuze_TTS voice cloning supports 17 languages: English (en), Spanish (es), French (fr), German (de), Italian (it), Portuguese (pt), Polish (pl), Turkish (tr), Russian (ru), Dutch (nl), Czech (cs), Arabic (ar), Chinese (zh-cn), Japanese (ja), Hungarian (hu), Korean (ko) Hindi (hi).** -This node is used to clone any voice from typed input. The audio file should be 10-15 seconds long for better results and should not have much noise. +This node is used to clone any voice from typed input. The audio file should be 10-15 seconds long for better results and should not have much noise. To avoid any sample rate error, load MP3 audio and only work with [AudioScheduler](https://github.com/a1lazydog/ComfyUI-AudioScheduler) node. We are working on developing a converter node to solve this issue. -![DeepFuze TTS_Node](https://github.com/SamKhoze/ComfyUI-DeepFuze/blob/main/images/DeepFuze_TTS_Node.jpg) **Input Types:** - `audio`: An instance of loaded audio data. - `text`: Text to generate the cloned voice audio.