338 Commits

Author SHA1 Message Date
Eren Gölge
e72c265cd4 Fix linter issues 2021-09-06 15:16:58 +00:00
Eren Gölge
fd287aa438 Update loader tests for dict return 2021-09-06 15:16:58 +00:00
Eren Gölge
2c4bbbf9b9 Use pyworld for pitch 2021-09-06 15:16:58 +00:00
Eren Gölge
076d0cb258 Add tests for certain FastPitch functions 2021-09-06 15:16:58 +00:00
Eren Gölge
d63a6bb690 Set BaseDatasetConfig for tests 2021-09-06 15:16:58 +00:00
Eren Gölge
fba257104d Compute F0 using librosa 2021-09-06 15:16:58 +00:00
Katsuya Iida
165e5814af Update Japanese phonemizer (#758)
* Update default ja vocoder

* update

* Japanese phonemizer test

* Run make style

Co-authored-by: Eren Gölge <egolge@coqui.ai>
2021-09-01 09:33:15 +02:00
Eren Gölge
f186856e5d Add option to sort input sequnce by audio len 2021-08-30 08:10:35 +00:00
Eren Gölge
c312acac7d Implement VITS model 🚀
VITS model implementation built on Glow TTS and HiFiGAN
layers.
2021-08-09 18:02:36 +00:00
Eren Gölge
003e5579e8 Enable custom_symbols in text processing
Models can define their own custom symbols lists with custom
`make_symbols()`
2021-08-09 18:02:36 +00:00
Eren Gölge
e4648ffef1 Fix multi-speaker init of Tacotron models & tests 2021-08-09 18:02:36 +00:00
Agrin Hilmkil
ced4cfdbbf Allow saving / loading checkpoints from cloud paths (#683)
* Allow saving / loading checkpoints from cloud paths

Allows saving and loading checkpoints directly from cloud paths like
Amazon S3 (s3://) and Google Cloud Storage (gs://) by using fsspec.

Note: The user will have to install the relevant dependency for each
protocol. Otherwise fsspec will fail and specify which dependency is
missing.

* Append suffix _fsspec to save/load function names

* Add a lower bound to the fsspec dependency

Skips the 0 major version.

* Add missing changes from refactor

* Use fsspec for remaining artifacts

* Add test case with path requiring fsspec

* Avoid writing logs to file unless output_path is local

* Document the possibility of using paths supported by fsspec

* Fix style and lint

* Add missing lint fixes

* Add type annotations to new functions

* Use Coqpit method for converting config to dict

* Fix type annotation in semi-new function

* Add return type for load_fsspec

* Fix bug where fs not always created

* Restore the experiment removal functionality
2021-08-09 18:02:36 +00:00
Eren Gölge
75b201c6c1 Merge pull request #673 from coqui-ai/fix_stopnet
Fix stopnet training for Tacotron models
2021-07-24 12:25:38 +02:00
Eren Gölge
fc0c4600bd Fix stopnet training 2021-07-24 11:39:54 +02:00
Eren Gölge
30eed347b6 Merge pull request #581 from Edresson/dev
Compute speaker embeddings in batch for the LSTM  Speaker Encoder and Compute embeddings/ finding chars using config file.
2021-07-23 17:22:51 +02:00
WeberJulian
c79a82ed07 refix linter 2021-07-13 23:12:18 +02:00
Edresson
2e5baffa9c Merge fix and eval split as argparse 2021-07-13 01:47:32 -03:00
Edresson
4eac1c4651 bug fix on train_encoder and unit tests 2021-07-11 12:00:39 -03:00
Eren Gölge
1e9538aaef Add more model tests to test_synthesize 2021-07-04 11:45:49 +02:00
Eren Gölge
47b3b10d6d Bump up to v0.1.0 🚀 2021-06-29 13:07:59 +02:00
Eren Gölge
9790eddada Fix wrong argument name 🛠️ 2021-06-28 17:03:47 +02:00
Eren Gölge
626c9d41e6 Update tests for the new trainer API 2021-06-28 17:03:19 +02:00
Eren Gölge
fcfd95669a Update model test configs 2021-06-28 17:03:19 +02:00
Eren Gölge
00c82c516d rename to 2021-06-28 17:03:19 +02:00
Eren Gölge
fdfb18d230 downsize melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge
25238e0658 fix glow-tts inference() 2021-06-28 17:03:19 +02:00
Eren Gölge
82582993cc use one testing sentence in tts tests 2021-06-28 17:03:19 +02:00
Eren Gölge
419735f440 refactor and fix multi-speaker training in Trainer and Tacotron models 2021-06-28 17:03:19 +02:00
Eren Gölge
269e5a734e add max_decoder_steps argument to tacotron models 2021-06-28 17:03:19 +02:00
Eren Gölge
304d60197b reduce multiband melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge
db6a97d1a2 rename external speaker embedding arguments as d_vectors 2021-06-28 17:03:19 +02:00
Eren Gölge
877bf66b61 reduce size of the metadata.csv used at testing 2021-06-28 17:03:19 +02:00
Eren Gölge
87c61d210a update test to be less demanding 2021-06-28 17:03:19 +02:00
Eren Gölge
6d6896fd99 reduce fullband-melgan test model size 2021-06-28 17:03:19 +02:00
Eren Gölge
1443d03af1 update test for the new input output API of the tts models 2021-06-28 17:03:19 +02:00
Eren Gölge
ef4ea9e527 update imports for formatters 2021-06-28 17:03:19 +02:00
Eren Gölge
6c495c6a6e fix glow-tts inference and forward functions for handling cond_input
and refactor its test
2021-06-28 17:03:19 +02:00
Eren Gölge
b500338faa make style 2021-06-28 17:03:19 +02:00
Eren Gölge
d25f017b42 update setup_model.py imports 2021-06-28 17:03:19 +02:00
Eren Gölge
7dff6be871 update tts training tests to use the trainer 2021-06-28 17:03:19 +02:00
Eren Gölge
9134c7dfb6 update sequence_mask import globally 2021-06-28 17:03:19 +02:00
Eren Gölge
8def3c87af trainer-API updates 2021-06-28 17:03:19 +02:00
Eren Gölge
42554cc711 rename MyDataset -> TTSDataset 2021-06-28 17:03:19 +02:00
Edresson Casanova
eb84bb2bc8 Merge branch 'dev' into dev 2021-06-26 15:32:19 -03:00
Eren Gölge
6c7bbcaef0 Use en-us for testing phoneme models 2021-06-25 16:52:17 +02:00
Michael Hansen
a41f53fe72 Fix silly error in tests 2021-06-25 14:41:35 +02:00
Michael Hansen
3f172b84d8 Fix linting issues 2021-06-25 14:41:31 +02:00
Michael Hansen
4d8426fa0a Use eSpeak IPA lexicons by default for phoneme models 2021-06-25 14:41:05 +02:00
Michael Hansen
47191f3ecc Add tests for gruut phonemization 2021-06-25 14:41:05 +02:00
Edresson
28bec238ca fix Lint checks 2021-06-18 14:33:50 -03:00