4668 Commits

Author SHA1 Message Date
Reuben Morais
f829bf50f8 Bump version to v0.17.4 (really) v0.17.4 2023-09-15 16:40:34 +02:00
Eren G??lge
aa8fa4756e Bump up to v0.17.4 v0.17.3 2023-09-14 17:52:44 +02:00
Eren G??lge
9d0b76ce23 Check env var for COQUI_TOS_AGREED 2023-09-14 17:51:40 +02:00
Eren G??lge
13dd7c4c9e Bump up to v0.17.2 v0.17.2 2023-09-14 15:24:05 +02:00
Eren G??lge
ded7fd4fb2 Make style 2023-09-14 15:23:37 +02:00
Eren G??lge
44b61d2b92 Fixup 2023-09-14 15:22:54 +02:00
Eren Gölge
623ea41634 Fix model tests (#2943) 2023-09-14 15:21:48 +02:00
Eren G??lge
af62613c86 Bump up to v0.17.1 v0.17.1 2023-09-13 18:23:39 +02:00
Eren G??lge
ee7cee0e35 Fixup 2023-09-13 18:21:44 +02:00
Eren G??lge
5dcf9ae311 Bump up v0.17.0 v0.17.0 2023-09-13 18:04:26 +02:00
Eren Gölge
4033db5f4b 🔥 XTTS implementation 2023-09-13 17:51:24 +02:00
Edresson Casanova
4d3f23b5d3 Add CML-TTS dataset YourTTS training recipe (#2934) 2023-09-12 11:49:14 +02:00
Eren Gölge
9533f8656c Make style 2023-09-04 13:58:37 +02:00
Eren Gölge
562a9509f2 Add BE model 2023-09-04 13:57:03 +02:00
Eren Gölge
b4c82685a7 Add model entries 2023-09-04 13:04:58 +02:00
T145
cdc971ff74 Fixed spectrogram checking on librosa 0.10.x (#2899) 2023-09-04 12:58:27 +02:00
Cohee
b3b1555d82 Fix exception handling in manage.py (#2912) 2023-09-04 12:54:30 +02:00
Eren G??lge
40b527345f Bump up to v0.16.6 v0.16.6 2023-09-04 12:51:53 +02:00
Eren Gölge
d1d95707bd Update docs (#2919) 2023-09-04 12:28:36 +02:00
Unik
32b8ebb633 Updated scipy version (#2914) 2023-09-04 11:39:19 +02:00
Aleś Bułojčyk
fead04f779 Add phonemizer for Belarusian language (#2856) 2023-08-28 11:20:45 +02:00
Jake Tae
b79b6f0762 feature: add device flag to tts cli (#2875) 2023-08-28 11:20:12 +02:00
Jake Tae
fa0cbd78fe Update README with new device API (#2876)
* docs: update readme w/ .to(device) api

* docs: add .to(device) in python quickstart

* docs: move section header out of comment

* chore: use device instead of hard-coded string

* docs: update inference.md
2023-08-28 11:19:00 +02:00
Eren Gölge
c0b5e61749 Bump up to v0.16.5 v0.16.5 2023-08-26 12:00:25 +02:00
Eren Gölge
a7a96d08dd Fix loading Bark (#2893)
* Fixup hubert path

* Make style
2023-08-26 11:59:00 +02:00
Eren Gölge
04a36a727b Bump up to v0.16.4 v0.16.4 2023-08-26 10:39:48 +02:00
Eren Gölge
a96562a750 Update .models.json 2023-08-26 10:36:40 +02:00
Jake Tae
409db505d2 Add device support in TTS and Synthesizer (#2855)
* fix: resolve merge conflicts

* fix: retain backwards compatability in functions

* feature: utilize device for voice transfer

* feature: use device for vocoder

* chore: cleanup vocoder cpu logic

* fix: add necessary vocoder output device check

* fix: add necessary vocoder output device check

* fix: indentation

* fix: check if waveform is pt tensor before cpu conversion

---------

Co-authored-by: Jake Tae <jaketae@Jakes-MacBook-Pro-2.local>
2023-08-14 21:04:44 +02:00
Julian Weber
febcaf710a Add customizable data home path (#2871)
* Add customizable data home path

* Add TTS_HOME as an option
2023-08-14 21:02:48 +02:00
Eren Gölge
c4e5effab9 Bump up to v0.16.3 v0.16.3 2023-08-13 12:22:04 +02:00
Michael New
1f9d600b83 Denote human voices in README.md (#2851) 2023-08-13 12:15:17 +02:00
Eren Gölge
3a104d5c49 Update Studio API for XTTS (#2861)
* Update Studio API for XTTS

* Update the docs

* Update README.md

* Update README.md

Update README
2023-08-13 12:04:12 +02:00
Eren G??lge
37b558ccb9 Make style 2023-08-11 12:55:23 +02:00
Eren G??lge
9a8352b8da Fix import error with Bark 2023-08-11 03:33:59 +02:00
Eren Gölge
c87377b713 Bump up to v0.16.2 v0.16.2 2023-08-07 13:21:14 +02:00
Eren Gölge
4186f42b21 Handle missing JA phonemizer (#2843)
* Handle missing JA phonemizer

* Make style
2023-08-07 13:19:38 +02:00
Eren Gölge
48f8133eae Fix imports (#2845) 2023-08-07 13:19:26 +02:00
Javier
4e7f8cd021 Add fairseq onnx support and strict configuration, fixes some onnx errors (#2831) 2023-08-04 11:02:59 +02:00
ChaseC
52a528cfcf add post functionality to /api/tts (#2836) 2023-08-04 10:54:20 +02:00
Eren Gölge
dc04baa1ee Bump up to v0.16.1 v0.16.1 2023-07-31 15:54:45 +02:00
Eren Gölge
17ddd65741 Please p3.11 2023-07-31 15:53:19 +02:00
Eren Gölge
69f080eb47 Fix DelightfulTTS (#2823)
* Fix tests

* Make style
2023-07-31 13:52:45 +02:00
Eren Gölge
483888b9d8 Add kwargs to ignore extra arguments w/o error (#2822) 2023-07-31 11:37:35 +02:00
AWAS666
9e74b51aa6 Delightful TTS VCTK recipe fixes (#2808)
* fix: wrong import class

* fix: formatter name missing

* feat: get rid of clearml
2023-07-31 10:27:42 +02:00
Aleś Bułojčyk
d124f78430 Recipe for Belarusian TTS (#2756)
* Changes from jhlfrfufyfn <jhlfrfufyfn@gmail.com>

* Recipe for Belarusian TTS

---------

Co-authored-by: jhlfrfufyfn <jhlfrfufyfn@gmail.com>
2023-07-31 10:26:21 +02:00
Javier
c140df5a58 Adds multi-language support for VITS onnx, fixes onnx inference error when speaker_id is None or not passed, fixes onnx exporting for models with init_discriminator=false (#2816) 2023-07-31 10:19:49 +02:00
Eren Gölge
b739326503 Bump up to v0.16.0 v0.16.0 2023-07-24 16:04:10 +02:00
Eren Gölge
8aacb81849 Fix Tortoise load (#2791)
* Remove key prunning in tortoise

* Make lint
2023-07-24 13:42:47 +02:00
Eren Gölge
b3472a739e Update README.md 2023-07-24 13:42:20 +02:00
logan hart
6fdb88f8e2 Add Delightful-TTS implementation (#2095)
* add configs

* Update config file

* Add model configs

* Add model layers

* Add layer files

* Add layer modules

* change config names

* Add emotion manager

* fIX missing ap bug

* Fix missing ap bug

* Add base TTS e2e class

* Fix wrong variable name in load_tts_samples

* Add training script

* Remove range predictor and gaussian upsampling

* Add helper function

* Add vctk recipe

* Add conformer docs

* Fix linting in conformer.py

* Add Docs

* remove duplicate import

* refactor args

* Fix bugs

* Removew emotion embedding

* remove unused arg

* Remove emotion embedding arg

* Remove emotion embedding arg

* fix style issues

* Fix bugs

* Fix bugs

* Add unittests

* make style

* fix formatter bug

* fix test

* Add pyworld compute pitch func

* Update requirments.txt

* Fix dataset Bug

* Chnge layer norm to instance norm

* Add missing import

* Remove emotions.py

* remove ssim loss

* Add init layers func to aligner

* refactor model layers

* remove audio_config arg

* Rename loss func

* Rename to delightful-tts

* Rename loss func

* Remove unused modules

* refactor imports

* replace audio config with audio processor

* Add change sample rate option

* remove broken resample func

* update recipe

* fix style, add config docs

* fix tests and multispeaker embd dim

* remove pyworld

* Make style and fix inference

* Split tts tests

* Fixup

* Fixup

* Fixup

* Add argument names

* Set "random" speaker in the model Tortoise/Bark

* Use a diff f0_cache path for delightfull tts

* Fix delightful speaker handling

* Fix lint

* Make style

---------

Co-authored-by: loganhart420 <loganartpersonal@gmail.com>
Co-authored-by: Eren Gölge <erogol@hotmail.com>
2023-07-24 13:41:26 +02:00