Files
voice-cloning-collab/README.md
2022-09-05 17:13:05 +08:00

939 B
Raw Blame History

voice-cloning

Version updates

2022.05.19 I calculated GE2E loss in encoder with cuda rather than originally-configured CPU. It speeds up the encoder training speed.
2022.07.15 I added Loss animation plot for synthesizer and vocoder.
2022.07.19 I added response time and Griffin-Lim vocoder results for demo_toolbox.
2022.07.29 I added model validation for encoder, synthesizer and vocoder.
2022.08.02 I added voxceleb train and dev data for encoder. I added noise reduce method for the output wav from vocoder.
noisereduce reference: https://github.com/timsainb/noisereduce
2022.08.06 I split the long text into short sentences using spacy for input of synthesizer. Make sure to install English dataset en_core_web_sm, say by python -m spacy download en_core_web_sm
2022.09.02 I set prop_decrease=0.6 for male and 0.9 for female in noisereduce function