4668 Commits

Author SHA1 Message Date
Edresson Casanova
5f900f156a Add XTTS Fine tuning gradio demo (#3296)
* Add XTTS FT demo data processing pipeline

* Add training and inference columns

* Uses tabs instead of columns

* Fix demo freezing issue

* Update demo

* Convert stereo to mono

* Bug fix on XTTS inference

* Update gradio demo

* Update gradio demo

* Update gradio demo

* Update gradio demo

* Add parameters to be able to set then on colab demo

* Add erros messages

* Add intuitive error messages

* Update

* Add max_audio_length parameter

* Add XTTS fine-tuner docs

* Update XTTS finetuner docs

* Delete trainer to freeze memory

* Delete unused variables

* Add gc.collect()

* Update xtts.md

---------

Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-12-01 23:52:23 +01:00
Vlad Cuciureanu
f5b41674e8 fix: Few typos in Tortoise docs. 2023-12-01 20:42:41 +02:00
Aaron-Li
7b8808186a fix pause problem of Chinese speech 2023-12-01 23:30:03 +08:00
Frederico S. Oliveira
bcd500fa7b Fixing bug
Correction in training the Fastspeech/Fastspeech2/FastPitch/SpeedySpeech model using external speaker embedding.
2023-11-30 17:27:05 -03:00
Frederico S. Oliveira
a26e51b0b4 Merge branch 'coqui-ai:dev' into dev 2023-11-30 14:19:05 -03:00
Eren Gölge
6d1905c2b7 Update to v0.21.2 v0.21.2 2023-11-30 13:05:10 +01:00
Hannes Krumbiegel
e40527b103 Fix link to installation instructions (#3329) 2023-11-30 13:03:33 +01:00
Enno Hermann
39321d02be fix: correctly strip/restore initial punctuation (#3336)
* refactor(punctuation): remove orphan code for handling lone punctuation

The case of lone punctuation is already handled at the top of restore(). The
removed if statement would never be called and would in fact raise an
AttributeError because the _punc_index named tuple doesn't have the attribute
`mark`.

* refactor(punctuation): remove unused argument

* fix(punctuation): correctly handle initial punctuation

Stripping and restoring initial punctuation didn't work correctly because the
string-splitting caused an additional empty string to be inserted in the text
list (because `".A".split(".")` => `["", "A"]`). Now, an initial empty string is
skipped and relevant test cases are added.

Fixes #3333
2023-11-30 13:03:16 +01:00
Eren Gölge
93283385e0 Merge pull request #3318 from coqui-ai/calling_hf_models
Run XTTS models by direct name with versions
2023-11-30 13:02:26 +01:00
Frederico S. Oliveira
77c2155609 Merge pull request #1 from coqui-ai/dev
Update
2023-11-29 17:24:02 -03:00
Eren G??lge
bfbaffc84a Fixup 2023-11-28 13:47:45 +01:00
Eren G??lge
18b7d746cb Updating XTTS docs 2023-11-27 14:54:49 +01:00
Eren G??lge
b75e90ba85 Make text splitting optional 2023-11-27 14:53:11 +01:00
Eren G??lge
3b8894a3dd Make style 2023-11-27 14:15:50 +01:00
Eren G??lge
2fd8cf3d94 Make xtts runnable by version names 2023-11-27 14:15:16 +01:00
Eren G??lge
11ec9f7471 Add hi in config defaults 2023-11-24 15:38:36 +01:00
Eren G??lge
00a870c26a Update to v0.21.1 v0.21.1 2023-11-24 15:15:44 +01:00
Eren G??lge
7e575068c9 Merge branch 'dev' of https://github.com/coqui-ai/TTS into dev 2023-11-24 15:15:19 +01:00
Eren G??lge
32065139e7 Simple text cleaner for "hi" 2023-11-24 15:14:34 +01:00
Eren Gölge
1542a50c3a Update to v0.21.0 v0.21.0 2023-11-24 14:37:05 +01:00
Eren G??lge
6dd43b0ce2 Update to XTTS v2.0.3 2023-11-24 14:36:04 +01:00
Julian Weber
a55755c8df update deepspeed version (#3281) 2023-11-24 12:35:49 +01:00
Kaszanas
1bf5926196 Introducing Development Dockerfile (#3263)
* Moved Dockerfile, COPY at the end

This change should prevent re-installation of the dependencies upon
every change of the repository's contents. Typically if Docker detects
that something changed in a layer, all downstream layers are invalidated
and rebuilt.

* Moved Dockerfile back to main directory

Main dockerfile in a separate directory can cause issues with the
current CI/CD setup. This can be a good change for later.

* Introduced Dockerfile.dev, updated CONTRIBUTING

Dockerfile.dev can be used as a separate development environment for
anyone that does not wish to install the dependencies locally.
2023-11-24 12:30:15 +01:00
TITC
4d0f53d2ee Misjudgment of is_multi_lingual When Loading Multilingual Model via model_path (#3273)
* load multilingual model by path

* use config to assert multi lingual or not
2023-11-24 12:28:31 +01:00
Enno Hermann
8c5227ed84 Fix tts_with_vc (#3275)
* Revert "fix for issue 3067"

This reverts commit 041b4b6723.

Fixes #3143. The original issue (#3067) was people trying to use
tts.tts_with_vc_to_file() with XTTS and was "fixed" in #3109. But XTTS has
integrated VC and you can just do tts.tts_to_file(..., speaker_wav="..."), there
is no point in passing it through FreeVC afterwards. So, reverting this commit
because it breaks tts.tts_with_vc_to_file() for any model that doesn't have
integrated VC, i.e. all models this method is meant for.

* fix: support multi-speaker models in tts_with_vc/tts_with_vc_to_file

* fix: only compute spk embeddings for models that support it

Fixes #1440. Passing a `speaker_wav` argument to regular Vits models failed
because they don't support voice cloning. Now that argument is simply ignored.
2023-11-24 12:26:37 +01:00
Enno Hermann
2af0220996 fix: don't pass quotes to espeak (#3286)
Previously, the text was wrapped in an additional set of quotes that was passed
to Espeak. This could result in different phonemization in certain edges and
caused the insertion of an initial separator "_" that had to be removed.
Compare:
$ espeak-ng -q -b 1 -v en-us --ipa=1 '"A"'
_ˈɐ
$ espeak-ng -q -b 1 -v en-us --ipa=1 'A'
ˈeɪ

Fixes #2619
2023-11-24 12:25:37 +01:00
Enno Hermann
4a2684be34 fix(bin.synthesize): more informative error for wrong --language argument (#3294)
In multilingual models, the target language is specified via the
`--language_idx` argument. However, the `tts` CLI also accepts a `--language`
argument for use with Coqui Studio, so it is easy to choose the wrong one,
resulting in the following confusing error at synthesis time:

```
AssertionError:   Language None is not supported. Supported languages are
['en', 'es', 'fr', 'de', 'it', 'pt', 'pl', 'tr', 'ru', 'nl', 'cs', 'ar',
'zh-cn', 'hu', 'ko', 'ja']
```

This commit adds a better error message when `--language` is passed for a
non-studio model.

Fixes #3270, fixes #3291
2023-11-24 12:24:42 +01:00
Tessa Painter
64f391b583 Made the tqdm progress_bar objects of static download methods a static class variable (#3297) 2023-11-24 12:23:59 +01:00
Eren Gölge
b47d9c6e36 Merge pull request #3243 from idiap/checkpoints
Remove duplicate/unused code
2023-11-22 23:52:06 +01:00
Eren Gölge
29dede20d3 Merge pull request #3249 from coqui-ai/run_ci_for_v0.20.6
Run CI for v0.20.6
v0.20.6
2023-11-17 15:45:26 +01:00
Eren Gölge
c011ab7455 Update to v0.20.6 2023-11-17 15:16:32 +01:00
Eren G??lge
52cb1e2f68 Update model hash for v2.0.2 2023-11-17 15:16:32 +01:00
Edresson Casanova
6075fa208c Ensures that only GPT model is in training mode during XTTS GPT training (#3241)
* Ensures that only GPT model is in training mode during training

* Fix parallel wavegan unit test
2023-11-17 15:15:22 +01:00
Eren G??lge
a3279f9294 Make style 2023-11-17 15:15:22 +01:00
Eren G??lge
f21067a84a Make k_diffusion optional 2023-11-17 15:15:21 +01:00
Eren G??lge
44494daa27 Update CI version 2023-11-17 15:15:21 +01:00
Eren G??lge
c864acf2b7 Update versions 2023-11-17 15:15:21 +01:00
Edresson Casanova
11283fce07 Ensures that only GPT model is in training mode during XTTS GPT training (#3241)
* Ensures that only GPT model is in training mode during training

* Fix parallel wavegan unit test
2023-11-17 15:13:46 +01:00
Eren Gölge
14579a4607 Merge pull request #3248 from coqui-ai/slacker_deps
Update versions
2023-11-17 15:13:19 +01:00
Eren G??lge
44880f09ed Make style 2023-11-17 13:43:34 +01:00
Eren G??lge
26efdf6ee7 Make k_diffusion optional 2023-11-17 13:42:33 +01:00
Eren G??lge
08d11e9198 Update CI version 2023-11-17 13:01:32 +01:00
Eren G??lge
63d7145647 Update versions 2023-11-17 12:10:46 +01:00
Enno Hermann
0fb0d67de7 refactor: use save_checkpoint()/save_best_model() from Trainer 2023-11-17 01:18:23 +01:00
Enno Hermann
96678c7ba2 refactor: use copy_model_files() from Trainer 2023-11-17 01:18:23 +01:00
Enno Hermann
5119e651a1 chore(utils.io): remove unused code
These are all available in Trainer.
2023-11-17 01:18:23 +01:00
Enno Hermann
39fe38bda4 refactor: use save_fsspec() from Trainer 2023-11-17 01:18:23 +01:00
Enno Hermann
fdf0c8b10a chore(encoder): remove unused code 2023-11-17 01:18:23 +01:00
Eren Gölge
7e4375da2b Update to v0.20.6 2023-11-16 17:52:13 +01:00
Julian Weber
fbc18b8c34 Fix zh bug (#3238) 2023-11-16 17:51:37 +01:00