mirror of
https://github.com/coqui-ai/TTS.git
synced 2026-02-24 20:19:54 +01:00
* Implement most similar ref training approach * Use non-enhanced hifigan for test samples * Add Perceiver * Update GPT Trainer for perceiver support * Update XTTS docs * Bug fix masking with XTTS perceiver * Bug fix on gpt forward * Bug Fix on XTTS v2.0 training * Add XTTS v2.0 unit tests * Add XTTS v2.0 inference unit tests * Bug Fix on diffusion inference * Add XTTS v2.0 training recipe * Placeholder model entry * Add cloning params to config * Make prompt embedding configurable * Make cloning configurable * Cheap fix for a cheaper fix * Prevent resampling * Update model entry * Update docs * Update requirements * Code linting * Add xtts v2 to sep tests * Bug fix on XTTS get_gpt_cond_latents * Bug fix on rebase * Make style * Bug fix in Japenese tokenizer * Add num2words to deps * Remove unused kwarg and added num_beams=1 as default --------- Co-authored-by: Eren G??lge <egolge@coqui.ai>
57 lines
960 B
Plaintext
57 lines
960 B
Plaintext
# core deps
|
|
numpy==1.22.0;python_version<="3.10"
|
|
numpy==1.24.3;python_version>"3.10"
|
|
cython==0.29.30
|
|
scipy>=1.11.2
|
|
torch>=1.7
|
|
torchaudio
|
|
soundfile==0.12.*
|
|
librosa==0.10.*
|
|
scikit-learn==1.3.0
|
|
numba==0.55.1;python_version<"3.9"
|
|
numba==0.57.0;python_version>="3.9"
|
|
inflect==5.6.*
|
|
tqdm==4.64.*
|
|
anyascii==0.3.*
|
|
pyyaml==6.*
|
|
fsspec==2023.6.0 # <= 2023.9.1 makes aux tests fail
|
|
aiohttp==3.8.*
|
|
packaging==23.1
|
|
# deps for examples
|
|
flask==2.*
|
|
# deps for inference
|
|
pysbd==0.3.4
|
|
# deps for notebooks
|
|
umap-learn==0.5.*
|
|
pandas>=1.4,<2.0
|
|
# deps for training
|
|
matplotlib==3.7.*
|
|
# coqui stack
|
|
trainer
|
|
# config management
|
|
coqpit>=0.0.16
|
|
# chinese g2p deps
|
|
jieba
|
|
pypinyin
|
|
# korean
|
|
hangul_romanize
|
|
# gruut+supported langs
|
|
gruut[de,es,fr]==2.2.3
|
|
# deps for korean
|
|
jamo
|
|
nltk
|
|
g2pkk>=0.1.1
|
|
# deps for bangla
|
|
bangla
|
|
bnnumerizer
|
|
bnunicodenormalizer
|
|
#deps for tortoise
|
|
k_diffusion
|
|
einops==0.6.*
|
|
transformers==4.33.*
|
|
#deps for bark
|
|
encodec==0.1.*
|
|
# deps for XTTS
|
|
unidecode==1.3.*
|
|
num2words
|