From 7c778bdca143cf82786c22568236364dc277e5d0 Mon Sep 17 00:00:00 2001 From: alexlnkp <79400603+alexlnkp@users.noreply.github.com> Date: Fri, 21 Jul 2023 20:59:59 +0700 Subject: [PATCH] Update README.md --- README.md | 79 +++++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 79 insertions(+) diff --git a/README.md b/README.md index 4a57ed3..1457ad3 100644 --- a/README.md +++ b/README.md @@ -1,3 +1,82 @@ +# Features: +- Experimental Formant Shift using StftPitchShift(tried using praat with praatio but to no avail) +- Auto-detect Index path for models selected + Auto-detect paths, no more default values like this: `E:\codes\py39\vits_vc_gpu_train\logs\mi-test-1key\total_fea.npy`, We're getting Root Dir and subfolders using +```python +os.path.abspath(os.getcwd()) +``` +- Audio files dropdown by autodetecting files in `/audios/` folder +- More stable version of Gradio(3.34.0) with support of Themes +- Removed `One Click Training` button in `Training` Tab, since it was glitchy and has confused a lot of users. +- Changed default training settings to be more optimal for newer users. +- Auto-open TensorBoard localhost URL when `tensor-launch.py` is executed +- RMVPE implemented in both inferencing and training (the one in `Training` tab doesn't work properly though, requires some additional work to do) + +# Known bugs: + +- `Unload voice to save GPU memory` button causes a Traceback: + +```python +Traceback (most recent call last): + File "D:\RVC-beta-v2-0618\runtime\lib\site-packages\gradio\routes.py", line 437, in run_predict + output = await app.get_blocks().process_api( + File "D:\RVC-beta-v2-0618\runtime\lib\site-packages\gradio\blocks.py", line 1349, in process_api + data = self.postprocess_data(fn_index, result["prediction"], state) + File "D:\RVC-beta-v2-0618\runtime\lib\site-packages\gradio\blocks.py", line 1252, in postprocess_data + self.validate_outputs(fn_index, predictions) # type: ignore + File "D:\RVC-beta-v2-0618\runtime\lib\site-packages\gradio\blocks.py", line 1227, in validate_outputs + raise ValueError( +ValueError: An event handler (get_vc) didn't receive enough output values (needed: 3, received: 1). +Wanted outputs: + [slider, slider, slider] +Received outputs: + [{'visible': False, '__type__': 'update'}] +``` + +## Installation: + +1. Simply either extract directly or use git clone + +2. Run `installstft.bat`. It'll automatically: + - Upgrade/Downgrade Gradio if its version isn't 3.34.0; + - Install `rmvpe.pt` if it hasn't been already installed; + - Install `StftPitchShift` if it hasn't been already installed; + + + +3. Done! You're good to go and use the RVC-WebUI Tweaked by me for you to use :) + +## Change Gradio Theme: + +- [OPTIONAL] Change Gradio's theme: + 1. Open `infer-web.py` in any code/text editing software (e.g. `notepad++`, `notepad`, `vscode`, etc) + + 2a. Press Ctrl+F and search for `with gr.Blocks(`, select the one that's not fully commented + + 2b. Go to line `1842`, you'll see the `with gr.Blocks(theme='HaleyCH/HaleyCH_Theme') as app:` + + 3. Go to [Gradio Theme Gallery](https://huggingface.co/spaces/gradio/theme-gallery): + + 3.1 Select any theme you like (e.g. [this one](https://huggingface.co/spaces/freddyaboulton/dracula_revamped)) + + 3.2 Look at the top of the page + + ![image](https://github.com/alexlnkp/Mangio-RVC-Tweaks/assets/79400603/59e3e6a9-bdda-4ede-8161-00ee957c1715) + + 3.3 Copy theme variable(in this case, it's `theme='freddyaboulton/dracula_revamped'`) + + 4. Replace `theme='HaleyCH/HaleyCH_Theme'` in `infer-web.py` with any value of a theme from [Gradio Theme Gallery](https://huggingface.co/spaces/gradio/theme-gallery) + +### Current Todo-list: + +- [ ] Fix `Unload voice to save GPU memory` button Traceback +- [ ] Add Accordions so people with Firefox browser get a much more compact GUI rather than [This](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/assets/79400603/67e0cc08-82a2-4dc3-86cf-e23d1dcad9f8). +- [ ] Fix weird way Median Filtering value inputted in a slider is utilized +- [ ] Replace regular refresh buttons with these tiny ones from [AUTOMATIC'S1111 Stable DIffusion](https://github.com/AUTOMATIC1111/stable-diffusion-webui) +![image](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/assets/79400603/fe37051e-4c95-4d30-9254-87d44436bb9e) +- [ ] Add a way to change the Gradio's theme from WebUI itself, like in [AUTOMATIC'S1111 Stable DIffusion](https://github.com/AUTOMATIC1111/stable-diffusion-webui) +![image](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/assets/79400603/7b93e167-564a-41d3-9a76-ded20063cdc3) +- [ ] Implement Praat in the GUI for f0 curve file manipulation and easier usage +

Mangio-RVC-Fork with v2 Support! 💻

A fork of an easy-to-use SVC framework based on VITS with top1 retrieval 💯. In general, this fork provides a CLI interface in addition. And also gives you more f0 methods to use, as well as a personlized 'hybrid' f0 estimation method using nanmedian.