From fe608ae6d3b84b38a68ab14ae542225b192a3e64 Mon Sep 17 00:00:00 2001 From: Mangio621 Date: Fri, 5 May 2023 00:54:08 +1000 Subject: [PATCH] In the processes of debugging extract_f0_feature. Updated readme and makefile instructions. --- Makefile | 4 ++-- README.md | 42 ++++++++++++++++++++++++++++++------------ infer-web.py | 2 ++ 3 files changed, 34 insertions(+), 14 deletions(-) diff --git a/Makefile b/Makefile index 0927cac..4a44d2a 100644 --- a/Makefile +++ b/Makefile @@ -4,7 +4,7 @@ help: ## Show this help and exit @grep -hE '^[A-Za-z0-9_ \-]*?:.*##.*$$' $(MAKEFILE_LIST) | sort | awk 'BEGIN {FS = ":.*?## "}; {printf "\033[36m%-30s\033[0m %s\n", $$1, $$2}' -install: ## Install dependencies +install: ## Install dependencies (Do everytime you start up a paperspace machine) apt-get -y install build-essential python3-dev ffmpeg pip install --upgrade setuptools wheel pip install --upgrade pip @@ -14,7 +14,7 @@ install: ## Install dependencies apt-get update apt -y install -qq aria2 -base: ## Download base files +base: ## Download base files (Do only once after cloning the fork) mkdir -p pretrained uvr5_weights git pull aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/pretrained/D32k.pth -d pretrained -o D32k.pth diff --git a/README.md b/README.md index 85ac6a7..67cb201 100644 --- a/README.md +++ b/README.md @@ -1,10 +1,10 @@
-

Mangio-RVC-Fork (Retrieval-based-Voice-Conversion)

-A fork of an easy-to-use SVC framework based on VITS with top1 retrieval.

+

Mangio-RVC-Fork (Retrieval-based-Voice-Conversion) 💻

+A fork of an easy-to-use SVC framework based on VITS with top1 retrieval 💯.

-> Please support the original [RVC repository](https://www.bilibili.com/video/BV1pm4y1z7Gm/). Without it, obviously this fork wouldn't have been possible. The Mangio-RVC-Fork aims to essentially enhance the features that the original RVC repo has in my own way. Please note that this fork is NOT STABLE and was forked with the intention of experimentation. Do not use this Fork thinking it is a "better" version of the original repo. Think of it more like another "version" of the original repo. Please note that this doesn't have a google colab. If you want to use google colab, go to the original repository. +> 💓 Please support the original [RVC repository](https://www.bilibili.com/video/BV1pm4y1z7Gm/). Without it, obviously this fork wouldn't have been possible. The Mangio-RVC-Fork aims to essentially enhance the features that the original RVC repo has in my own way. Please note that this fork is NOT STABLE and was forked with the intention of experimentation. Do not use this Fork thinking it is a "better" version of the original repo. Think of it more like another "version" of the original repo. Please note that this doesn't have a google colab. If you want to use google colab, go to the original repository. [![madewithlove](https://forthebadge.com/images/badges/built-with-love.svg)](https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI) @@ -27,13 +27,16 @@ A fork of an easy-to-use SVC framework based on VITS with top1 retrieval.
The dataset for the pre-training model uses nearly 50 hours of high quality VCTK open source dataset. > High quality licensed song datasets will be added to training-set one after another for your use, without worrying about copyright infringement. -# Summary -## Features that this fork (Mangio-RVC-Fork) includes that the original repo doesn't: +# Summary 📘 +## Features that this fork (Mangio-RVC-Fork) includes that the original repo doesn't ☑️ + Local inference with the conv2d 'Half' exception fix. apply the argument --use_gfloat to infer-web.py to use this fix. + f0 Inference algorithm overhaul: + Added pyworld dio f0 method. + Added torchcrepe crepe f0 method. (Increases pitch accuracy and stability ALOT) + Modifiable crepe_hop_length for the crepe algorithm via the web_gui ++ Paperspace integration + + Paperspace argument on infer-web.py (--paperspace) that shares a gradio link + + Make file for paperspace users ## This repository has the following features too: + Reduce tone leakage by replacing source feature to training-set feature using top1 retrieval; @@ -43,24 +46,36 @@ A fork of an easy-to-use SVC framework based on VITS with top1 retrieval.