Commit Graph

3503 Commits

Author SHA1 Message Date
Eren Gölge
505e2db6aa Fix VCTK Tacotron2-DDC recipe 2021-10-30 14:47:49 +02:00
Eren Gölge
9e2befb55c Add vctk tacotron2 recipe 2021-10-30 14:47:35 +02:00
Eren Gölge
7293abada2 Bump up to v0.4.2 2021-10-29 17:57:30 +02:00
Eren Gölge
2df0752e73 Model zoo tests (#900)
* Fix VITS model multi-speaker init

* Remove gdrive support in model manager

* Add model zoo tests
2021-10-29 17:54:16 +02:00
Eren Gölge
aaaa591485 Bump up version to v0.4.1 2021-10-26 19:24:17 +02:00
Eren Gölge
3ea1c2037b Fix model entry in .models.json 2021-10-26 19:14:29 +02:00
Eren Gölge
fa4ec83c6e Bump up version to v0.4.0 v0.4.0 2021-10-26 18:27:39 +02:00
Eren Gölge
3c99191e50 Merge pull request #888 from coqui-ai/dev
v0.4.0
2021-10-26 18:25:28 +02:00
Eren Gölge
ff88c72caa Update CONTRIBUTING.md
Fix testing command
2021-10-26 17:52:09 +02:00
Eren Gölge
035ed432bc Doc update (#889)
* Link source files from the docs

* Update glowTTS recipes for docs

* Add dataset downloaders
2021-10-26 17:41:33 +02:00
Eren Gölge
0cac3f330a Enable custom formatter in load_tts_samples 2021-10-26 13:07:11 +02:00
Eren Gölge
7c10574931 Gateway for TTS models 2021-10-26 13:04:51 +02:00
Eren Gölge
00becf2671 Fix import statements 2021-10-25 19:29:16 +02:00
Eren Gölge
4ba42acad7 Update .gitignore 2021-10-25 19:29:16 +02:00
Eren Gölge
027424dda8 Add VCTK fast_pitch and UK glow-tts 2021-10-25 19:29:16 +02:00
Eren Gölge
bdab788de3 Fix ljspeech download 2021-10-25 11:33:51 +02:00
Eren Gölge
44d47fd4cf Merge branch 'trainer_v1' into dev 2021-10-21 18:09:09 +00:00
Eren Gölge
7223e27e0a Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2021-10-21 18:08:55 +00:00
Eren Gölge
1e9a97560b Merge pull request #887 from coqui-ai/vctk_recipes
VCTK Recipes
2021-10-21 19:54:44 +02:00
Eren Gölge
a107bda5ec Merge pull request #831 from noranraskin/patch-1
Fix --list_models command in finetuning.md
2021-10-21 19:49:47 +02:00
Eren Gölge
25759d6a61 Split tests 2021-10-21 17:30:15 +00:00
Eren Gölge
d9c291b06c Update .gitignore 2021-10-21 16:29:06 +00:00
Eren Gölge
9e483fb4f0 Update ljspeech download 2021-10-21 16:29:06 +00:00
Eren Gölge
8cca3987aa Update documentation 2021-10-21 16:29:06 +00:00
Eren Gölge
016803beee Update notebooks 2021-10-21 16:29:06 +00:00
Eren Gölge
5e0d0539c5 Remove unmaintained notebooks 2021-10-21 16:29:06 +00:00
Eren Gölge
71180c7962 VCTK recipes (finally 🚀) 2021-10-21 16:29:06 +00:00
Eren Gölge
70e4d0e524 Fix grad_norm handling 2021-10-21 16:29:06 +00:00
Eren Gölge
a409e0f8f8 Update train_tts for multi-speaker 2021-10-21 16:29:06 +00:00
Eren Gölge
2b7d159383 Update BaseTTS for multi-speaker training 2021-10-21 16:29:06 +00:00
Eren Gölge
e62d3c5cf7 Use absolute imports for tts configs and models 2021-10-21 16:29:06 +00:00
Eren Gölge
82fed4add2 Make style 2021-10-21 16:05:51 +00:00
Eren Gölge
3cb07fb6b5 Fix SpeakerManager init with data items 2021-10-21 13:54:39 +00:00
Eren Gölge
aea90e2501 Comment synthesis.py 2021-10-21 13:53:45 +00:00
Eren Gölge
1987aaaaed Update d-vector reshape in synthesizer 2021-10-21 13:53:25 +00:00
Eren Gölge
3ab009ca8d Edit model configs for multi-speaker 2021-10-21 13:51:37 +00:00
Eren Gölge
cea8e1739b Update AlignTTS to use SpeakerManager 2021-10-20 18:22:41 +00:00
Eren Gölge
0e768dd4c5 Update comments 2021-10-20 18:21:26 +00:00
Eren Gölge
7c2cb7cc30 Update BaseTTS 2021-10-20 18:18:22 +00:00
Eren Gölge
330ee7d208 Comment BaseTacotron and remove unused funcs 2021-10-20 18:17:25 +00:00
Eren Gölge
aa25f70b95 Update ForwardTTS for multi-speaker 2021-10-20 18:16:41 +00:00
Eren Gölge
0ebc2a400e Implement _set_speaker_embedding in GlowTTS 2021-10-20 18:15:20 +00:00
Eren Gölge
3da79a4de4 Comment Tacotron2 model 2021-10-20 18:14:04 +00:00
Eren Gölge
92b6d98443 Set pitch frame alignment wrt spec computation 2021-10-20 18:12:38 +00:00
Eren Gölge
0a3d1cc7ee Pass speaker manager to the model in synthesizer 2021-10-20 18:11:36 +00:00
Eren Gölge
588da1a24e Simplify grad_norm handling in trainer 2021-10-19 16:33:04 +00:00
Eren Gölge
3c7848e9b1 Don't OOR values in train console log 2021-10-19 16:32:16 +00:00
Eren Gölge
c514351c0e Refactor multi-speaker init in BaseTTS-Tacotron1-2 2021-10-18 08:55:45 +00:00
Eren Gölge
127571423c Update multi-speaker init in BaseTTS 2021-10-18 08:54:41 +00:00
Eren Gölge
a0a5d580e9 Approximate audio length from file size 2021-10-18 08:54:02 +00:00