Commit Graph

4543 Commits

Author SHA1 Message Date
Edresson Casanova
e8a1a50273 Remove unused vars in Delightful TTS layers tests 2023-10-23 09:26:36 -03:00
Edresson Casanova
ec7f54768a Rebase bug fix and update recipe 2023-10-21 17:37:51 -03:00
Edresson Casanova
affaf11148 Add XTTS training unit test 2023-10-21 13:41:12 -03:00
Edresson Casanova
1f92741d6a Fix issue #2971 2023-10-21 13:37:21 -03:00
Edresson Casanova
94dcf84979 Rename XTTS recipe 2023-10-21 13:37:21 -03:00
Edresson Casanova
5f98dbeec9 Update Ljspeech XTTS recipe 2023-10-21 13:37:21 -03:00
Edresson Casanova
469d624615 Update LJspeech XTTS recipe 2023-10-21 13:37:21 -03:00
Edresson Casanova
9e3598c3b7 Bug Fix on inference using XTTS trainer checkpoint 2023-10-21 13:37:21 -03:00
Edresson Casanova
c4ceaabe2c Add test sentences during the training 2023-10-21 13:33:56 -03:00
Edresson Casanova
2f868dd5c2 Bug fix on reproducible evaluation 2023-10-21 13:33:56 -03:00
Edresson Casanova
bafab049c2 Add prompting masking 2023-10-21 13:33:56 -03:00
Edresson Casanova
47d613df3a Add reproducible evaluation 2023-10-21 13:33:56 -03:00
Edresson Casanova
40a4e631ea Update mel spectrogram for the style encoder 2023-10-21 13:33:56 -03:00
Edresson Casanova
a32961bcb4 Add XTTS base training code 2023-10-21 13:33:56 -03:00
Eren Gölge
1e152692ed Bump up to v0.18.2 v0.18.2 2023-10-21 17:29:53 +02:00
Eren Gölge
420a90ed63 Merge pull request #3096 from coqui-ai/fix-xtts-v1.1
Fix xtts v1.1
2023-10-21 17:28:58 +02:00
Julian Weber
dad6a7b0b6 Preserve [ja] token of the text processing 2023-10-21 11:26:03 +02:00
Julian Weber
c7a16042e3 Remove global cutlet import 2023-10-21 11:18:58 +02:00
Edresson Casanova
414f0de0a1 Bump up to v0.18.1 v0.18.1 2023-10-20 17:30:58 -03:00
Edresson Casanova
59576fc0ec Bug fix on XTTS v1.1 inference (#3093)
* Bug fix on XTTS v1.1 inference

* Update .models.json

---------

Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
2023-10-20 17:29:43 -03:00
Eren Gölge
85e7323739 Bump up to v0.18.0 v0.18.0 2023-10-20 16:03:24 +02:00
Julian Weber
cf97116185 XTTS v1.1 (#3089)
* Add support for ne_hifigan

* Update model.json

* Update hash

* Fix model loading

* Enhance text_normalization

* Add xtts to zoo test exception

* Add model hash check

* Add get_number_tokens
2023-10-20 16:02:08 +02:00
Eren Gölge
747f688dc3 Bump up to v0.17.10 v0.17.10 2023-10-19 12:00:15 +02:00
Eren Gölge
93e6961bb5 Update .models.json 2023-10-19 11:59:49 +02:00
Eren Gölge
bf68848f38 Bump up to v0.17.9 v0.17.9 2023-10-19 11:22:42 +02:00
Eren Gölge
c3b011217d Update .models.json 2023-10-19 11:21:21 +02:00
Julian Weber
d21f15cc85 fix readme (#3071)
* fix readme

* fix inference.md
2023-10-17 10:27:11 +02:00
Julian Weber
dcce1644b7 Fix doc dataset (#3070)
* fix formatting dataset doc

* fix autocomplete
2023-10-16 12:29:52 +02:00
David Garvey
a151d70242 Add stdout option (#3027)
* add add cli options for play and speed
--play argument uses simpleaudio to play the tts wav
--speed <float 0.0-2.0> passes speed argument to Coqui Studio models

* remove simpleaudio not referenced in file

* fix simpleaudio dependency version

* add ALSA headers for simpleaudio compilation

* Dockerfile ALSA headers for simpleaudio

* base changes to use stdout instead of play audio
Considering conversion to pipe wav data for audio playback with ohter program
like aplay.

This is incomplete code. Using to get feedback before proceeding with
implementation.

* remove play for pipe_out arg that suppresses stdout
removed play and simpleaudio dependency in place of pipe
fuctionality to allow passing wav file data to a program
dedicated to playing audio.

* scipy.io.wavfile.write fails with /dev/null target

* Streaming inference for XTTS 🚀 (#3035)

* v0.17.7

* Redownload XTTS with the local and remote config do not match

* Remove unused method

* Print a message when it is already donwloaded

* Try-except to present error when the user dont have connection

* Fix style

* 0.17.8

* v0.17.8

---------

Co-authored-by: Julian Weber <julian.weber@hotmail.fr>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
Co-authored-by: Edresson Casanova <edresson1@gmail.com>
Co-authored-by: ggoknar <ggoknar@coqui.ai>
2023-10-16 12:07:21 +02:00
Eren Gölge
cae185fd16 Update README.md 2023-10-16 12:00:59 +02:00
Subash-Lamichhane
b4666bb75e fixed typo of /docs (#3065) 2023-10-16 11:57:15 +02:00
Subash-Lamichhane
3d146422c2 fixed typo of docs\source\implementing_a_new_model.md (#3066) 2023-10-16 11:57:04 +02:00
Dusty Hagstrom
13cd076a7f Synthesizer skips over embeddings file if model only has one speaker (#2587)
* It looks like the Neon model is special in that t does not have a speaker_name and it wants to get the only item available. This was blocking a valid model with one speaker and a d_vector_file from being executed to get the embedding.

* Update synthesizer.py

oh my how embarrassing
2023-10-16 11:55:45 +02:00
Meryem Sakin
e4b8d71f2b Update AnalyzeDataset.ipynb (#2783) 2023-10-16 11:52:37 +02:00
Eren Gölge
b25d96ecee Merge pull request #3058 from coqui-ai/spkr_enc_3020
fixed bugs in fastpitch tts synthesis
2023-10-14 11:40:31 +02:00
Aya Jafari
ffddf10458 unit test fix 2023-10-13 10:56:47 -03:00
Aya Jafari
6eaecab0ca fixed bugs in fastpitch tts synthesis 2023-10-10 23:02:31 -03:00
ggoknar
99635193f5 v0.17.8 v0.17.8 2023-10-07 01:14:05 +03:00
ggoknar
3bb51b1276 0.17.8 2023-10-07 01:13:02 +03:00
Gorkem
0f46757c47 Merge pull request #3038 from coqui-ai/xtts_redonwload
XTTS redownload if needed
2023-10-07 01:02:44 +03:00
Edresson Casanova
2852404bdf Fix style 2023-10-06 17:42:46 -03:00
Edresson Casanova
99650044a4 Try-except to present error when the user dont have connection 2023-10-06 17:37:05 -03:00
Edresson Casanova
529ea3f67f Print a message when it is already donwloaded 2023-10-06 17:26:40 -03:00
Edresson Casanova
ee1ef1c51e Remove unused method 2023-10-06 17:21:22 -03:00
Edresson Casanova
4a6103fec9 Redownload XTTS with the local and remote config do not match 2023-10-06 17:16:30 -03:00
Eren Gölge
0520697b5f v0.17.7 v0.17.7 2023-10-06 18:35:26 +02:00
Julian Weber
e5e0cbffc9 Streaming inference for XTTS 🚀 (#3035) 2023-10-06 18:34:06 +02:00
OPERATOR
2150136210 None is not able to be read for "XTTS", fixes crash if its set to None. (#3009) 2023-10-02 12:53:36 +02:00
Anupam Maurya
f133b9d2d7 Upgrade and Optimize TTS Code in extractttsspectrogram.ipynb (#3012) 2023-10-02 12:51:55 +02:00
Eren Gölge
155c5fc0bd v0.17.6 v0.17.6 2023-09-29 23:44:09 +02:00