2023-04-17 16:15:59 +08:00
2023-03-31 17:49:09 +08:00
2023-04-09 16:00:56 +00:00
2023-04-24 20:35:56 +08:00
2023-04-27 16:30:45 +00:00
2023-04-24 20:35:56 +08:00
2023-04-24 20:35:56 +08:00
2023-04-17 12:49:29 +00:00
2023-04-16 18:56:20 +08:00
2023-04-27 23:34:03 +08:00
2023-04-28 11:25:20 +08:00
2023-04-25 02:51:20 +00:00
2023-04-28 11:25:20 +08:00
2023-03-27 17:59:11 +08:00
2023-04-24 20:35:56 +08:00
2023-04-05 10:44:07 +08:00
2023-04-25 10:51:38 +08:00

Mangio-RVC-Fork (Retrieval-based-Voice-Conversion)

A fork of an easy-to-use SVC framework based on VITS with top1 retrieval.

Please support the original RVC repository. Without it, obviously this fork wouldn't have been possible. The Mangio-RVC-Fork aims to essentially enhance the features that the original RVC repo has in my own way. Please note that this fork is NOT STABLE and was forked with the intention of experimentation. Do not use this Fork thinking it is a "better" version of the original repo. Think of it more like another "version" of the original repo. Please note that this doesn't have a google colab. If you want to use google colab, go to the original repository.

madewithlove

Licence Huggingface

Discord



The original RVC Demo Video here!

Realtime Voice Conversion Software using RVC : w-okada/voice-changer

The dataset for the pre-training model uses nearly 50 hours of high quality VCTK open source dataset.

High quality licensed song datasets will be added to training-set one after another for your use, without worrying about copyright infringement.

Summary

Features that this fork (Mangio-RVC-Fork) includes that the original repo doesn't:

  • Local inference with the conv2d 'Half' exception fix. apply the argument --use_gfloat to infer-web.py to use this fix.
  • f0 Inference algorithm overhaul:
    • Added pyworld dio f0 method.
    • Added torchcrepe crepe f0 method. (Increases pitch accuracy and stability ALOT)
    • Modifiable crepe_hop_length for the crepe algorithm via the web_gui

This repository has the following features too:

  • Reduce tone leakage by replacing source feature to training-set feature using top1 retrieval;
  • Easy and fast training, even on relatively poor graphics cards;
  • Training with a small amount of data also obtains relatively good results (>=10min low noise speech recommended);
  • Supporting model fusion to change timbres (using ckpt processing tab->ckpt merge);
  • Easy-to-use Webui interface;
  • Use the UVR5 model to quickly separate vocals and instruments.

Features planned to be added during the fork's development

  • f0 crepe pitch estimation on training.
  • Improved GUI (More convenience).
  • Makefile for paperspace users.
  • Google colab notebook for this fork.

Installing the Dependencies

Using pip (python3.9.8 is stable with this fork)

Notice: faiss 1.7.2 will raise Segmentation Fault: 11 under MacOS, please use pip install faiss-cpu==1.7.0 if you use pip to install it manually.

pip install -r requirements.txt

Preparation of other Pre-models

RVC requires other pre-models to infer and train.

You need to download them from our Huggingface space.

Here's a list of Pre-models and other files that RVC needs:

hubert_base.pt

./pretrained 

./uvr5_weights

#If you are using Windows, you may also need this dictionary, skip if FFmpeg is installed
ffmpeg.exe

Then use this command to start Webui:

python infer-web.py

If you are using Windows, you can download and extract RVC-beta.7z to use RVC directly and use go-web.bat to start Webui.

There's also a tutorial on RVC in Chinese and you can check it out if needed.

Credits

Thanks to all contributors for their efforts

Description
*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other f0 methods, along with a hybrid f0 nanmedian method.
Readme MIT 24 MiB
Languages
Python 89.4%
Jupyter Notebook 4.8%
Shell 2.9%
Batchfile 2%
Makefile 0.9%