4393 Commits

Author SHA1 Message Date
Eren G??lge
346e3de87c Update docs 2023-09-04 12:24:49 +02:00
Eren Gölge
04a36a727b Bump up to v0.16.4 v0.16.4 2023-08-26 10:39:48 +02:00
Eren Gölge
a96562a750 Update .models.json 2023-08-26 10:36:40 +02:00
Jake Tae
409db505d2 Add device support in TTS and Synthesizer (#2855)
* fix: resolve merge conflicts

* fix: retain backwards compatability in functions

* feature: utilize device for voice transfer

* feature: use device for vocoder

* chore: cleanup vocoder cpu logic

* fix: add necessary vocoder output device check

* fix: add necessary vocoder output device check

* fix: indentation

* fix: check if waveform is pt tensor before cpu conversion

---------

Co-authored-by: Jake Tae <jaketae@Jakes-MacBook-Pro-2.local>
2023-08-14 21:04:44 +02:00
Julian Weber
febcaf710a Add customizable data home path (#2871)
* Add customizable data home path

* Add TTS_HOME as an option
2023-08-14 21:02:48 +02:00
Eren Gölge
c4e5effab9 Bump up to v0.16.3 v0.16.3 2023-08-13 12:22:04 +02:00
Michael New
1f9d600b83 Denote human voices in README.md (#2851) 2023-08-13 12:15:17 +02:00
Eren Gölge
3a104d5c49 Update Studio API for XTTS (#2861)
* Update Studio API for XTTS

* Update the docs

* Update README.md

* Update README.md

Update README
2023-08-13 12:04:12 +02:00
Eren G??lge
37b558ccb9 Make style 2023-08-11 12:55:23 +02:00
Eren G??lge
9a8352b8da Fix import error with Bark 2023-08-11 03:33:59 +02:00
Eren Gölge
c87377b713 Bump up to v0.16.2 v0.16.2 2023-08-07 13:21:14 +02:00
Eren Gölge
4186f42b21 Handle missing JA phonemizer (#2843)
* Handle missing JA phonemizer

* Make style
2023-08-07 13:19:38 +02:00
Eren Gölge
48f8133eae Fix imports (#2845) 2023-08-07 13:19:26 +02:00
Javier
4e7f8cd021 Add fairseq onnx support and strict configuration, fixes some onnx errors (#2831) 2023-08-04 11:02:59 +02:00
ChaseC
52a528cfcf add post functionality to /api/tts (#2836) 2023-08-04 10:54:20 +02:00
Eren Gölge
dc04baa1ee Bump up to v0.16.1 v0.16.1 2023-07-31 15:54:45 +02:00
Eren Gölge
17ddd65741 Please p3.11 2023-07-31 15:53:19 +02:00
Eren Gölge
69f080eb47 Fix DelightfulTTS (#2823)
* Fix tests

* Make style
2023-07-31 13:52:45 +02:00
Eren Gölge
483888b9d8 Add kwargs to ignore extra arguments w/o error (#2822) 2023-07-31 11:37:35 +02:00
AWAS666
9e74b51aa6 Delightful TTS VCTK recipe fixes (#2808)
* fix: wrong import class

* fix: formatter name missing

* feat: get rid of clearml
2023-07-31 10:27:42 +02:00
Aleś Bułojčyk
d124f78430 Recipe for Belarusian TTS (#2756)
* Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com>

* Recipe for Belarusian TTS

---------

Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>
2023-07-31 10:26:21 +02:00
Javier
c140df5a58 Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false (#2816) 2023-07-31 10:19:49 +02:00
Eren Gölge
b739326503 Bump up to v0.16.0 v0.16.0 2023-07-24 16:04:10 +02:00
Eren Gölge
8aacb81849 Fix Tortoise load (#2791)
* Remove key prunning in tortoise

* Make lint
2023-07-24 13:42:47 +02:00
Eren Gölge
b3472a739e Update README.md 2023-07-24 13:42:20 +02:00
logan hart
6fdb88f8e2 Add Delightful-TTS implementation (#2095)
* add configs

* Update config file

* Add model configs

* Add model layers

* Add layer files

* Add layer modules

* change config names

* Add emotion manager

* fIX missing ap bug

* Fix missing ap bug

* Add base TTS e2e class

* Fix wrong variable name in load_tts_samples

* Add training script

* Remove range predictor and gaussian upsampling

* Add helper function

* Add vctk recipe

* Add conformer docs

* Fix linting in conformer.py

* Add Docs

* remove duplicate import

* refactor args

* Fix bugs

* Removew emotion embedding

* remove unused arg

* Remove emotion embedding arg

* Remove emotion embedding arg

* fix style issues

* Fix bugs

* Fix bugs

* Add unittests

* make style

* fix formatter bug

* fix test

* Add pyworld compute pitch func

* Update requirments.txt

* Fix dataset Bug

* Chnge layer norm to instance norm

* Add missing import

* Remove emotions.py

* remove ssim loss

* Add init layers func to aligner

* refactor model layers

* remove audio_config arg

* Rename loss func

* Rename to delightful-tts

* Rename loss func

* Remove unused modules

* refactor imports

* replace audio config with audio processor

* Add change sample rate option

* remove broken resample func

* update recipe

* fix style, add config docs

* fix tests and multispeaker embd dim

* remove pyworld

* Make style and fix inference

* Split tts tests

* Fixup

* Fixup

* Fixup

* Add argument names

* Set "random" speaker in the model Tortoise/Bark

* Use a diff f0_cache path for delightfull tts

* Fix delightful speaker handling

* Fix lint

* Make style

---------

Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-07-24 13:41:26 +02:00
Eren Gölge
f24c5e0276 Update README 2023-07-24 13:30:19 +02:00
Eren Gölge
1652598a33 Test synthesize api separately 2023-07-24 12:38:20 +02:00
Eren Gölge
0de12ec5aa API tests (#2790)
* Separate API tests and only run when uplifted

* Make style
2023-07-24 12:14:21 +02:00
Paul O'Leary McCann
c0aabb8596 Make Japanese-specific dependencies optional (#2776)
* Don't install MeCab by default

* Add optional [ja] deps, like [dev] etc

* Add JA requirements file

* Add JA requirements to requirements_all

This should help the tests run.
2023-07-24 11:28:27 +02:00
Aleś Bułojčyk
e5fb0d9627 Fix share model page URL (#2757) 2023-07-09 12:19:49 +02:00
Eren Gölge
672ec3b35e Fix #2749 (#2750) 2023-07-08 11:40:44 +02:00
Eren Gölge
b5cd644132 Bump up to v0.15.6 v0.15.6 2023-07-08 10:33:09 +02:00
Eren Gölge
a2984fb435 Fix #2745 (#2748) 2023-07-07 20:23:27 +02:00
Eren Gölge
7b5c8422c8 Export multispeaker onnx (#2743) 2023-07-06 13:36:50 +02:00
Eren Gölge
08bc758cad Merge pull request #2741 from coqui-ai/merge_2651
Resolve conflicts
2023-07-06 09:53:48 +02:00
JiangCheng
53938e2d32 Squashed commit of the following:
commit dd612fd72e
Author: JiangCheng <jiangcheng@kezaihui.com>
Date:   Mon Jun 5 16:04:54 2023 +0800

    Failed to download the file and need to delete the created file path
2023-07-05 12:08:05 +02:00
Eren Gölge
e42a72eb79 Fix typo 2023-07-04 12:14:54 +02:00
Eren Gölge
229cfbdf8a Update README.md 2023-07-04 12:09:50 +02:00
Wouter van der Velde
d611067d50 fixed small spelling mistakes (#2551) 2023-07-04 11:42:54 +02:00
ZhouGongZaiShi
d5f16d77c2 delete meaningless print() (#2662) 2023-07-04 11:38:17 +02:00
PiaoYang
630327c4e6 Update compute_embeddings.py (#2668)
* [Typo] Fix variable name. More readable description.

Update train_yourtts.py

Reformat.

Reformat using black again.

* Add `old_append`. Fix bool argparse.

* Reformat.
2023-07-04 11:37:47 +02:00
ChaseC
8957799e45 fix loading of model and vocoder configs (#2698) 2023-07-04 11:32:00 +02:00
Eren Gölge
505ac1aa8f Bump up to v0.15.5 v0.15.5 2023-07-03 11:18:06 +02:00
Eren Gölge
453d04836b Merge pull request #2733 from coqui-ai/update_docs
Update docs and credits
2023-07-03 11:17:15 +02:00
Eren Gölge
9b041f958b Update docs and credits 2023-07-02 13:09:40 +02:00
Eren G??lge
21a3f280de Bump up to v0.15.4 v0.15.4 2023-06-30 15:05:00 +02:00
Eren G??lge
90cf712bb4 Update docs 2023-06-30 14:58:15 +02:00
Eren Gölge
588fe21310 Update docs 2023-06-30 14:40:54 +02:00
Eren Gölge
f9cde7bb1b Bump up to v0.15.3 2023-06-30 14:30:18 +02:00