Commit Graph

3460 Commits

Author SHA1 Message Date
Eren Gölge
2447f42ca1 Implement tarring datasets 2021-10-01 16:29:45 +00:00
Eren Gölge
42f77e7185 Update librispeech deepspeech recipe 2021-10-01 13:34:00 +00:00
Eren Gölge
3aaf6a28e9 Update .gitignore 2021-10-01 12:42:46 +00:00
Eren Gölge
e50f924003 Update env vars in CI 2021-10-01 12:42:46 +00:00
Eren Gölge
8b4a37d8cb Add DeepSpeech LibriSpeech recipe 2021-10-01 12:42:46 +00:00
Eren Gölge
5ee01e5624 implement DeepSpeechConfig 2021-10-01 12:42:46 +00:00
Eren Gölge
e033b3d7bb Add mfcc to BaseAudioConfig 2021-10-01 12:42:46 +00:00
Eren Gölge
aee483f0a9 Implement STT downloaders 2021-10-01 12:42:46 +00:00
Eren Gölge
f60eea701f Implement STT tokenizer 2021-10-01 12:42:46 +00:00
Eren Gölge
1787b2303a Implement STTDataset 2021-10-01 12:42:46 +00:00
Eren Gölge
0c7a2eb948 Implement BaseSTT 2021-10-01 12:42:46 +00:00
Eren Gölge
d2323f0d98 Implement DeepSpeech 2021-10-01 12:42:46 +00:00
Eren Gölge
89cbfbc829 Add initial data downloaders for stt 2021-10-01 12:42:46 +00:00
Eren Gölge
4157e99d2d Allow custom padding value 2021-10-01 12:42:46 +00:00
Eren Gölge
355dfee98d Add mfcc to AudioProcessor 2021-10-01 12:42:46 +00:00
Eren Gölge
21cc0517a3 Fix WaveRNN test 2021-10-01 10:21:37 +00:00
Eren Gölge
4dbe7ed0de Fix all-zero duration case for GlowTTS 2021-10-01 09:24:26 +00:00
Eren Gölge
37959ad0c7 Make linter 2021-09-30 23:02:16 +00:00
Eren Gölge
0b1986384f Make style 2021-09-30 16:21:18 +00:00
Eren Gölge
7edbe04fe0 Fix WaveRNN config and test 2021-09-30 16:20:12 +00:00
Eren Gölge
55d9209221 Remote STT tokenizer 2021-09-30 14:58:26 +00:00
Eren Gölge
6d3b2d3cdd Update docs 2021-09-30 14:47:56 +00:00
Eren Gölge
f904dd4828 Share some ASCII ❤️ 2021-09-30 14:47:56 +00:00
Eren Gölge
4cacbf0d45 Fix WaveRNN test 2021-09-30 14:47:56 +00:00
Eren Gölge
5fa78ee69f Remove old Tacotron recipes 2021-09-30 14:47:56 +00:00
Eren Gölge
9631aab0e7 Fix imports 2021-09-30 14:47:56 +00:00
Eren Gölge
ba2b8c827f Update train_tts.py and train_vocoder.py 2021-09-30 14:47:56 +00:00
Eren Gölge
2e9b6b4f90 Refactor Speaker Encoder training 2021-09-30 14:47:56 +00:00
Eren Gölge
043dca61b4 Rename load_meta_data as load_tts_data 2021-09-30 14:47:56 +00:00
Eren Gölge
9f23ad6a0f Fix imports 2021-09-30 14:47:56 +00:00
Eren Gölge
16b70be0dd Add _set_model_args to BaseModel 2021-09-30 14:47:56 +00:00
Eren Gölge
9a0d8fa027 Update copy_model_files() 2021-09-30 14:47:56 +00:00
Eren Gölge
4163b4f2e4 Update Tacotron models 2021-09-30 14:47:56 +00:00
Eren Gölge
e27feade38 Fixup wavernn 2021-09-30 14:47:56 +00:00
Eren Gölge
45889804c2 Update VITS 2021-09-30 14:47:56 +00:00
Eren Gölge
4f94f91305 Update WaveRNN 2021-09-30 14:47:56 +00:00
Eren Gölge
3d5205d66f Update WaveGrad 2021-09-30 14:47:56 +00:00
Eren Gölge
fd95926009 Update GlowTTS 2021-09-30 14:47:56 +00:00
Eren Gölge
4baecdf92a Update GAN for Trainer_v2 2021-09-30 14:47:56 +00:00
Eren Gölge
a156a40b47 Update ForwardTTS for Trainer_v2 2021-09-30 14:19:19 +00:00
Eren Gölge
d9df33f837 Update align_tts for trainer_v2 2021-09-30 14:18:10 +00:00
Eren Gölge
8ada870a57 Refactor trainer.py for v2 2021-09-30 14:16:34 +00:00
Eren Gölge
7f388f26e3 Bump up to v0.3.1 v0.3.1 2021-09-17 23:53:22 +00:00
Eren Gölge
0f3d868089 Merge pull request #815 from coqui-ai/dev
v0.3.1
2021-09-18 01:50:37 +02:00
Eren Gölge
2766dd1d6e Fix #813 - GlowTTS training (#814)
* Fix #813

* Update glow_tts recipe

* Fix glow-tts test

* Linter fix

* Run data dep init only in training
2021-09-17 20:06:55 +02:00
Eren Gölge
0592a5805c Merge pull request #803 from coqui-ai/dev
v0.3.0
v0.3.0
2021-09-13 14:13:33 +02:00
Eren Gölge
f563415052 Bump up to v0.3.0 2021-09-13 09:40:38 +00:00
Eren Gölge
a97dc8d09f Fix trainer malformatted print 2021-09-13 08:32:02 +00:00
Eren Gölge
91bebebe18 Add new models to .models.json
SpeedySpeech model using `ForwardTTS`
UnivNet model fine-tuned on TacotronDDC_ph spectrograms
2021-09-13 08:22:14 +00:00
Eren Gölge
aed9a32d52 Merge pull request #800 from coqui-ai/forward_tts
Forward TTS implementation
2021-09-13 09:25:56 +02:00