Commit Graph

352 Commits

Author SHA1 Message Date
Eren Gölge
4dbe7ed0de Fix all-zero duration case for GlowTTS 2021-10-01 09:24:26 +00:00
Eren Gölge
7edbe04fe0 Fix WaveRNN config and test 2021-09-30 16:20:12 +00:00
Eren Gölge
4cacbf0d45 Fix WaveRNN test 2021-09-30 14:47:56 +00:00
Eren Gölge
2766dd1d6e Fix #813 - GlowTTS training (#814)
* Fix #813

* Update glow_tts recipe

* Fix glow-tts test

* Linter fix

* Run data dep init only in training
2021-09-17 20:06:55 +02:00
Eren Gölge
1e7db32e90 Test FastPitch train 2021-09-11 10:19:47 +00:00
Eren Gölge
26f76fce22 Remove SpeedySpeech from .models.json 2021-09-10 17:47:27 +00:00
Eren Gölge
7ec23e69d4 Skip TF tests on GPU 2021-09-10 17:28:58 +00:00
Eren Gölge
1ebf9ec6bf Remove speedy_speech implementation 2021-09-10 17:28:20 +00:00
Eren Gölge
7d8f77385a Use glow-tts in synthesis tests 2021-09-10 17:27:33 +00:00
Eren Gölge
d6e29ef98a Style update 2021-09-10 08:30:33 +00:00
Eren Gölge
3abc3a1d32 Fix GPU init in tests 2021-09-10 08:28:10 +00:00
Eren Gölge
ed4b1d8514 Test TTS.tts.utils.helpers 2021-09-10 08:25:21 +00:00
Eren Gölge
8b7e094bde Implement forward_tts
- Generic API for feed-forward TTS models (FastPitch, SpeedySpeech)

- Tests for `forward-tts`

- Edit  FastPitchConfig and SpeedySpeechConfig to use `forward_tts`
2021-09-10 08:24:33 +00:00
Eren Gölge
4761853c5c Fix imports 2021-09-08 13:34:40 +00:00
Eren Gölge
e72c265cd4 Fix linter issues 2021-09-06 15:16:58 +00:00
Eren Gölge
fd287aa438 Update loader tests for dict return 2021-09-06 15:16:58 +00:00
Eren Gölge
2c4bbbf9b9 Use pyworld for pitch 2021-09-06 15:16:58 +00:00
Eren Gölge
076d0cb258 Add tests for certain FastPitch functions 2021-09-06 15:16:58 +00:00
Eren Gölge
d63a6bb690 Set BaseDatasetConfig for tests 2021-09-06 15:16:58 +00:00
Eren Gölge
fba257104d Compute F0 using librosa 2021-09-06 15:16:58 +00:00
Katsuya Iida
165e5814af Update Japanese phonemizer (#758)
* Update default ja vocoder

* update

* Japanese phonemizer test

* Run make style

Co-authored-by: Eren Gölge <egolge@coqui.ai>
2021-09-01 09:33:15 +02:00
Eren Gölge
f186856e5d Add option to sort input sequnce by audio len 2021-08-30 08:10:35 +00:00
Eren Gölge
c312acac7d Implement VITS model 🚀
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge
003e5579e8 Enable custom_symbols in text processing
Models can define their own custom symbols lists with custom
`make_symbols()`
2021-08-09 18:02:36 +00:00
Eren Gölge
e4648ffef1 Fix multi-speaker init of Tacotron models & tests 2021-08-09 18:02:36 +00:00
Agrin Hilmkil
ced4cfdbbf Allow saving / loading checkpoints from cloud paths (#683)
* Allow saving / loading checkpoints from cloud paths

Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.

Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.

* Append suffix _fsspec to save/load function names

* Add a lower bound to the fsspec dependency

Skips the 0 major version.

* Add missing changes from refactor

* Use fsspec for remaining artifacts

* Add test case with path requiring fsspec

* Avoid writing logs to file unless output_path is local

* Document the possibility of using paths supported by fsspec

* Fix style and lint

* Add missing lint fixes

* Add type annotations to new functions

* Use Coqpit method for converting config to dict

* Fix type annotation in semi-new function

* Add return type for load_fsspec

* Fix bug where fs not always created

* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge
75b201c6c1 Merge pull request #673 from coqui-ai/fix_stopnet
Fix stopnet training for Tacotron models
2021-07-24 12:25:38 +02:00
Eren Gölge
fc0c4600bd Fix stopnet training 2021-07-24 11:39:54 +02:00
Eren Gölge
30eed347b6 Merge pull request #581 from Edresson/dev
Compute speaker embeddings in batch for the LSTM  Speaker Encoder and Compute embeddings/ finding chars using config file.
2021-07-23 17:22:51 +02:00
WeberJulian
c79a82ed07 refix linter 2021-07-13 23:12:18 +02:00
Edresson
2e5baffa9c Merge fix and eval split as argparse 2021-07-13 01:47:32 -03:00
Edresson
4eac1c4651 bug fix on train_encoder and unit tests 2021-07-11 12:00:39 -03:00
Eren Gölge
1e9538aaef Add more model tests to test_synthesize 2021-07-04 11:45:49 +02:00
Eren Gölge
47b3b10d6d Bump up to v0.1.0 🚀 2021-06-29 13:07:59 +02:00
Eren Gölge
9790eddada Fix wrong argument name 🛠️ 2021-06-28 17:03:47 +02:00
Eren Gölge
626c9d41e6 Update tests for the new trainer API 2021-06-28 17:03:19 +02:00
Eren Gölge
fcfd95669a Update model test configs 2021-06-28 17:03:19 +02:00
Eren Gölge
00c82c516d rename to 2021-06-28 17:03:19 +02:00
Eren Gölge
fdfb18d230 downsize melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge
25238e0658 fix glow-tts inference() 2021-06-28 17:03:19 +02:00
Eren Gölge
82582993cc use one testing sentence in tts tests 2021-06-28 17:03:19 +02:00
Eren Gölge
419735f440 refactor and fix multi-speaker training in Trainer and Tacotron models 2021-06-28 17:03:19 +02:00
Eren Gölge
269e5a734e add max_decoder_steps argument to tacotron models 2021-06-28 17:03:19 +02:00
Eren Gölge
304d60197b reduce multiband melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge
db6a97d1a2 rename external speaker embedding arguments as d_vectors 2021-06-28 17:03:19 +02:00
Eren Gölge
877bf66b61 reduce size of the metadata.csv used at testing 2021-06-28 17:03:19 +02:00
Eren Gölge
87c61d210a update test to be less demanding 2021-06-28 17:03:19 +02:00
Eren Gölge
6d6896fd99 reduce fullband-melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge
1443d03af1 update test for the new input output API of the tts models 2021-06-28 17:03:19 +02:00
Eren Gölge
ef4ea9e527 update imports for formatters 2021-06-28 17:03:19 +02:00