Commit Graph - TTS - Code by DblK

Mirrors/TTS

Fork 0

mirror of https://github.com/coqui-ai/TTS.git synced 2026-07-10 20:41:10 +02:00

Commit Graph

Select branches

Hide Pull Requests

1316-rebase

Edresson-patch-1

Fix-typo-workflow-text

Fix_#1380

VITS-debug-dp

VITS-upsample

add-synpaflex-formatter

add_lang_code

add_neural_hmm_model

api_model_path

attribute-extraction

audio_len_sampler

calling_hf_models

cml-recipe

cml-tts-recipe

conflict_resolve

cpu-only-docker-image

d_vector_serialization

dev

dev-emotion

dev-emotion-rebased

dev-fix-scl

dev-managers

dev-update-readme

dev-yourtts-rec

dev_bark

docs_fix

emotion-dataset

emotion_cond_module

env_tos

fairseq_port

faster-linting

finetune_bark

fix-1423

fix-asserts+mas

fix-publish

fix-xtts-download

fix_#2797

fix_aux_tests

fix_bce_loss

fix_comp_emb

fix_condition_dp_on_speaker

fix_delightfulltts

fix_doc_dataset

fix_style

fix_warn

fix_xtts_load

fix_xtts_v1.1

fix_xtts_v1.1_

fix_zoo_test

force_phonemizer_definition

hifigan

init_stt_v2

main

mp3_len_fix

p3_11

patch-print-license

print_license_info

remove_v1_doc

revert-3038-xtts_redonwload

run_ci_for_v0.20.6

thorsten_models

timeout-check-config

tortoise_fixup

tortoise_pr_fixup

unbind_trainer_ver

unified_api_forwardtts2

update-coqpit-req

update-workflows

update_checkpoint_name

update_model_save

update_trainer

update_xtts_cloning

vits-pitch-pred

vits-pitch-pred-new

xtts

xtts_demo

xtts_redonwload

xtts_trainer

yourtts_adaptive_weight

#1

#10

#1000

#1007

#1009

#1020

#1021

#1022

#1026

#1027

#1028

#1031

#1032

#1043

#1044

#1047

#1048

#1049

#1050

#1051

#1054

#1055

#1056

#1060

#1069

#1078

#1079

#1080

#11

#1125

#1162

#1162

#1190

#1191

#1195

#1207

#1216

#1224

#1225

#1227

#1228

#1238

#1251

#1265

#1274

#1276

#1279

#1280

#1282

#1288

#1303

#1315

#1316

#1324

#1337

#1346

#1347

#1348

#1349

#1353

#1354

#1363

#1365

#1367

#1370

#1374

#1386

#1391

#1395

#1397

#1399

#1402

#1403

#1404

#1405

#1406

#1409

#1411

#1418

#1419

#1422

#1424

#1431

#1434

#1435

#1436

#1437

#1441

#1451

#1452

#1456

#1459

#1461

#1463

#1469

#1470

#1478

#1498

#15

#1510

#1511

#1512

#1514

#1524

#1526

#1527

#1532

#1535

#1537

#1538

#1539

#1540

#1541

#1542

#1544

#1545

#1550

#1555

#1556

#1557

#1558

#1560

#1561

#1562

#1564

#1565

#1567

#1570

#1572

#1573

#1574

#1581

#1583

#1584

#1587

#1589

#1597

#1599

#16

#1616

#1618

#1620

#1623

#1629

#1630

#1638

#1640

#1641

#1653

#1661

#1664

#1666

#1675

#1676

#1679

#1679

#1682

#1686

#1694

#1705

#1717

#1718

#1726

#1739

#1749

#1753

#1760

#1765

#1767

#1768

#1776

#1780

#1791

#1792

#1796

#1797

#1801

#1807

#1809

#1810

#1821

#1822

#1825

#1835

#1844

#1853

#1871

#1872

#1873

#1882

#1898

#1905

#1912

#1914

#1928

#1933

#1938

#1940

#1942

#1945

#1946

#1967

#1970

#1971

#1977

#1978

#1991

#1994

#2

#2001

#2019

#2024

#2048

#2054

#2065

#2066

#2071

#2073

#2077

#2079

#2086

#2095

#2102

#2103

#2114

#2116

#2132

#2135

#2136

#2138

#2140

#2144

#2146

#2149

#2150

#2153

#2154

#2161

#2162

#2169

#2178

#2183

#2187

#2189

#2194

#2195

#2198

#2203

#2204

#2205

#2206

#2211

#2218

#2223

#2226

#2228

#2229

#2234

#2242

#2244

#2245

#2248

#2249

#2253

#2257

#2271

#2272

#2276

#2277

#2284

#2295

#2303

#2305

#2310

#2314

#2316

#2325

#2326

#2328

#2329

#2337

#2339

#2349

#2352

#2357

#2364

#2377

#2379

#2380

#2389

#2390

#2393

#2399

#2407

#2418

#2427

#2435

#2445

#2445

#2451

#2460

#2462

#2463

#2463

#2470

#2475

#2478

#2480

#2484

#2485

#2489

#2495

#2499

#2508

#2509

#2515

#2518

#2519

#2524

#2526

#2527

#2532

#2533

#2540

#2543

#2547

#2549

#2551

#2563

#2571

#2572

#2576

#2577

#2582

#2587

#2592

#2595

#2596

#2600

#2603

#2604

#2616

#2617

#2626

#2628

#2629

#2638

#2647

#2650

#2651

#2653

#2655

#2659

#2661

#2662

#2663

#2666

#2666

#2667

#2668

#2671

#2682

#2685

#2695

#2697

#2698

#2700

#2718

#2725

#2733

#2735

#2741

#2743

#2748

#2750

#2751

#2754

#2756

#2757

#2776

#2783

#2790

#2791

#2806

#2808

#2816

#2822

#2823

#2826

#2831

#2836

#2838

#2840

#2843

#2845

#2846

#2851

#2854

#2855

#2856

#2861

#2870

#2871

#2875

#2876

#2893

#2894

#2899

#2909

#2912

#2914

#2919

#2922

#2925

#2926

#2934

#2939

#2943

#2945

#2949

#2950

#2951

#2954

#2956

#2961

#2963

#2970

#2981

#2983

#2990

#2992

#2993

#2999

#3

#3001

#3003

#3004

#3009

#3010

#3011

#3012

#3021

#3027

#3035

#3036

#3038

#3048

#3057

#3058

#3061

#3062

#3065

#3066

#3070

#3071

#3081

#3082

#3086

#3089

#3090

#3092

#3093

#3094

#3096

#3103

#3105

#3108

#3109

#3115

#3117

#3120

#3126

#3127

#3128

#3129

#3130

#3133

#3137

#3138

#3149

#3150

#3151

#3154

#3156

#3158

#3159

#3160

#3168

#3169

#3170

#3172

#3173

#3176

#3179

#3182

#3183

#3195

#3201

#3203

#3207

#3208

#3210

#3214

#3215

#3216

#3226

#3227

#3230

#3238

#3239

#3241

#3242

#3243

#3247

#3248

#3249

#3251

#3263

#3273

#3275

#3279

#3281

#3286

#3294

#3296

#3297

#3300

#3318

#3319

#3329

#3336

#3341

#3349

#3351

#3352

#3353

#3355

#3368

#3373

#3381

#3385

#3390

#3391

#3392

#3404

#3405

#3412

#3414

#3422

#3423

#3437

#3442

#3446

#3450

#3469

#3471

#3476

#3476

#3487

#3492

#3497

#3500

#3502

#3504

#3509

#3511

#3523

#3528

#3547

#3547

#3561

#3582

#3589

#3594

#3597

#3615

#3622

#3625

#3625

#3632

#3633

#3634

#3651

#3652

#3660

#3666

#367

#3672

#3690

#3696

#3708

#3719

#3720

#3724

#373

#3730

#3747

#375

#3755

#3765

#3769

#3788

#3789

#379

#3792

#3809

#3832

#384

#3852

#3853

#393

#394

#395

#3963

#3963

#3969

#3973

#3977

#398

#3994

#3997

#4

#4000

#4021

#4023

#4052

#4088

#4088

#4090

#4090

#4091

#4091

#4115

#4120

#4166

#4173

#4174

#4175

#4175

#4182

#4188

#4194

#420

#4214

#422

#423

#4298

#430

#4300

#4306

#4320

#4330

#4336

#4337

#435

#4351

#4352

#4355

#4358

#4362

#4362

#4367

#4368

#4380

#4384

#4384

#4390

#4397

#4399

#4401

#4402

#4403

#4404

#4407

#441

#4413

#4414

#4421

#4423

#445

#446

#453

#454

#457

#462

#468

#476

#477

#479

#481

#487

#488

#495

#5

#501

#502

#506

#508

#510

#518

#519

#520

#523

#524

#526

#527

#532

#535

#543

#544

#545

#546

#550

#551

#552

#557

#558

#559

#561

#562

#572

#581

#586

#6

#602

#606

#609

#611

#613

#617

#619

#620

#628

#629

#630

#642

#645

#656

#658

#664

#667

#673

#674

#683

#685

#689

#694

#696

#699

#7

#701

#702

#706

#708

#713

#716

#717

#718

#719

#722

#724

#725

#726

#727

#731

#736

#742

#751

#758

#762

#766

#777

#784

#785

#787

#789

#790

#791

#792

#793

#800

#802

#803

#814

#815

#831

#847

#848

#882

#887

#888

#889

#891

#893

#898

#899

#900

#901

#914

#922

#931

#937

#943

#977

#983

#984

#995

speaker_encoder_model

v0.0.10

v0.0.11

v0.0.12

v0.0.13

v0.0.14

v0.0.15

v0.0.15.1

v0.0.9

v0.1.0

v0.1.1

v0.1.2

v0.1.3

v0.10.0

v0.10.0_models

v0.10.1

v0.10.1_models

v0.10.2

v0.11.0

v0.11.0_models

v0.11.1

v0.12.0

v0.13.0

v0.13.0_models

v0.13.1

v0.13.2

v0.13.3

v0.13.3_models

v0.14.0

v0.14.0_models

v0.14.1

v0.14.1_models

v0.14.2

v0.14.3

v0.15.0

v0.15.1

v0.15.2

v0.15.4

v0.15.5

v0.15.6

v0.16.0

v0.16.1

v0.16.2

v0.16.3

v0.16.4

v0.16.5

v0.16.6

v0.17.0

v0.17.1

v0.17.10

v0.17.2

v0.17.3

v0.17.4

v0.17.5

v0.17.6

v0.17.7

v0.17.8

v0.17.9

v0.18.0

v0.18.1

v0.18.2

v0.19.0

v0.19.1

v0.2.0

v0.2.1

v0.2.2

v0.20.0

v0.20.1

v0.20.2

v0.20.3

v0.20.4

v0.20.5

v0.20.6

v0.21.0

v0.21.1

v0.21.2

v0.21.3

v0.22.0

v0.3.0

v0.3.1

v0.4.0

v0.4.1

v0.4.2

v0.5.0

v0.5.0_models

v0.6.0

v0.6.0_models

v0.6.1

v0.6.1_models

v0.6.2

v0.6.2_models

v0.7.0

v0.7.0_models

v0.7.1

v0.7.1_models

v0.8.0

v0.8.0_models

v0.9.0

d9452d7038 Add end2end VITS loss Edresson Casanova 2022-06-02 13:50:08 -03:00
4a5143679d Make style add-synpaflex-formatter WeberJulian 2022-06-01 18:36:58 +02:00
68cef28a88 Adding TTS Tutorials (#1584) Aya-AlJafari 2022-06-02 13:23:00 +03:00
1f23c6f106 Fix formatter WeberJulian 2022-05-23 23:06:02 +02:00
5857941043 Add synpaflex formatter WeberJulian 2022-05-23 18:43:47 +02:00
f70e82cd19 Use fsspec and torch for embedding file IO (#1581) Eren Gölge 2022-06-01 13:49:42 +02:00
d7de111b76 Change default speakers file extension d_vector_serialization WeberJulian 2022-06-01 10:31:26 +02:00
f7b75e880d Add Thorsten VITS model thorsten_models Eren Gölge 2022-05-31 15:10:04 +02:00
b6bd74a9a9 fix invalid json (#1599) Ryan Le-Nguyen 2022-05-31 18:20:10 +10:00
39f31e3d39 Make style WeberJulian 2022-05-30 21:52:40 +02:00
9f1670010b Add dummy speakers.pth file WeberJulian 2022-05-30 20:54:44 +02:00
41d86e0fe1 Set use_cuda to true if available WeberJulian 2022-05-30 18:58:38 +02:00
fbb450616c Fix compute embedding script WeberJulian 2022-05-30 18:55:11 +02:00
01115124a7 Fix load and save files WeberJulian 2022-05-30 18:52:41 +02:00
7eceead0d7 Fixup Eren Gölge 2022-05-19 10:35:32 +02:00
15fb20c7c0 Use fsspec and torch for embedding file Eren Gölge 2022-05-19 10:32:19 +02:00
a790df4e94 Training recipes for thorsten dataset (#1020) speaker_encoder_model Noran Raskin 2022-05-30 12:07:31 +02:00
71111d14e4 Merge pull request #1587 from ribeiromiranda/patch-1 Eren Gölge 2022-05-29 14:51:08 +02:00
b7080f0f64 Recreate the prior distribution of Capacitron VAE on the right device Edresson Casanova 2022-05-27 16:41:13 -03:00
bd35371944 Add prosody encoder inference support Edresson Casanova 2022-05-27 16:00:41 -03:00
2568b722dd Add an option to detach the prosody encoder input Edresson Casanova 2022-05-27 13:19:06 -03:00
a2aecea8f3 Add VAE prosody encoder Edresson Casanova 2022-05-27 12:53:56 -03:00
312789edbf Condition the prosody encoder on z_p Edresson Casanova 2022-05-26 15:41:24 -03:00
e667fcb057 Support prosody conditional model on decoder input Edresson Casanova 2022-05-25 17:03:38 -03:00
6c23398518 Add emotion classifier loss Edresson Casanova 2022-05-25 10:05:52 -03:00
5b641ff0e3 Fix compute embeddings issue Edresson Casanova 2022-05-25 10:05:28 -03:00
d699416735 Add conditional module Edresson Casanova 2022-05-23 14:05:17 -03:00
6e4b13c6cc Fix unit tests Edresson Casanova 2022-05-23 10:26:45 -03:00
749b217884 Fix rebase issues Edresson Casanova 2022-05-20 18:29:39 -03:00
1a88191a5a Disable the reversal prosody encoder speaker loss Edresson Casanova 2022-05-19 13:55:14 +00:00
8505cd09e8 Add text encoder reversal speaker classifier loss Edresson Casanova 2022-05-17 13:14:15 +00:00
024e567849 Clean up old code Edresson Casanova 2022-05-16 13:09:12 +00:00
dbaa71c944 Add prosody encoder params on config Edresson Casanova 2022-05-16 09:45:28 -03:00
4107a1ef85 Add Speech style balancer Edresson Casanova 2022-04-19 15:51:15 -03:00
d49c6ab72f Add reversal classifier loss Edresson Casanova 2022-04-18 21:09:59 -03:00
004862a79b Add prosody encoder training support Edresson Casanova 2022-04-18 17:01:44 -03:00
d9d9415513 Add emotion embedding in the encoder Edresson Casanova 2022-03-31 19:14:41 -03:00
b2b54668bc Add formatter for the Emotional Speech Dataset Edresson Casanova 2022-03-31 17:27:30 +00:00
d2b5db84f0 Remove useless encoder weights reload Edresson Casanova 2022-03-31 11:05:58 -03:00
721a81b1d8 Fix emotion unit test Edresson Casanova 2022-03-31 08:34:08 -03:00
01dd4e4051 Fix Style tests Edresson Casanova 2022-03-30 16:51:39 -03:00
46762ccf35 Fix style tests Edresson Casanova 2022-03-23 15:31:33 -03:00
d8d1775273 Fix the Bug in Synthesizer Edresson Casanova 2022-03-30 15:32:35 -03:00
39575f2937 Bug fix in single speaker emotion embedding training Edresson Casanova 2022-03-16 20:57:14 +00:00
d1ab3298ba Fix unit tests Edresson Casanova 2022-03-15 19:40:07 +00:00
a3ecaf3bdd Add emotion external embeddings training unit test Edresson Casanova 2022-03-15 13:09:58 +00:00
5e0286c534 Add emotion consistency loss Edresson Casanova 2022-03-15 12:35:00 +00:00
e5c7ae9f1b Fix the bug in sythesizer Edresson Casanova 2022-03-15 12:33:36 +00:00
6f95522edf Add Emotion Support for the VITS model Edresson Casanova 2022-03-15 01:16:48 +00:00
d8f5cb2674 Add emotion manager Edresson Casanova 2022-03-14 14:26:40 +00:00
3b84ef9524 Fixed use_cuda issue in compute_embeddings.py André R. de Miranda 2022-05-20 12:46:46 -03:00
8be21ec387 Capacitron (#977) a-froghyar 2022-05-20 16:17:11 +02:00
ee99a6c1e2 Fix voice conversion inference (#1583) Edresson Casanova 2022-05-20 08:16:01 -03:00
63d27bc8d4 Disable the reversal prosody encoder speaker loss dev-emotion Edresson Casanova 2022-05-19 13:55:14 +00:00
bdefc43d96 Bug fix on pre-compute F0 vits-pitch-pred Edresson Casanova 2022-05-19 13:48:02 +00:00
7b85703b28 Add text encoder reversal speaker classifier loss Edresson Casanova 2022-05-17 13:14:15 +00:00
8adcd1de8e Rename g as spk_emb unified_api_forwardtts2 Eren G??lge 2022-05-17 13:37:05 +02:00
2d29e8219d Fix up Eren G??lge 2022-05-17 13:36:40 +02:00
8e915b70e0 Make hifigan discriminator configurable Eren G??lge 2022-05-17 13:34:57 +02:00
c437db15fd Fix dirt Eren G??lge 2022-05-17 13:34:38 +02:00
a05c82f9ef Fix audio_config handling Eren Gölge 2022-04-22 12:50:10 +02:00
b3fb0e19e8 Implement get_state_dict Eren Gölge 2022-04-22 12:39:46 +02:00
ce4f96292a Remove remaned trainer functions Eren Gölge 2022-04-22 12:37:35 +02:00
96779e75ba Return duration by ForwardTTS inference Eren Gölge 2022-04-19 11:00:15 +02:00
9291d13c69 Make style Eren Gölge 2022-04-19 10:59:59 +02:00
edd59c81e8 Update ForwardTTSe2e tests Eren Gölge 2022-04-19 10:58:52 +02:00
0b585b46c1 Refactor TTSDataset to use numpy transforms Eren Gölge 2022-04-19 09:23:18 +02:00
4171f4e9c6 Update ForwardTTSE2eLoss Eren Gölge 2022-04-19 09:22:50 +02:00
dbe5eb992e Make AP optional in BaseTTS Eren Gölge 2022-04-19 09:22:08 +02:00
6a53b77a95 Add numpy and torch transforms Eren Gölge 2022-04-19 09:21:46 +02:00
c3fb49bf76 Refactor ForwardTTS to skip decoder Eren Gölge 2022-04-19 09:21:31 +02:00
cc57c20162 Make plot results more general Eren Gölge 2022-04-19 09:20:31 +02:00
e7c5db0d97 Add missing kernel size attr to transformer layer Eren Gölge 2022-04-19 09:19:57 +02:00
231c69b12e Remove AP from FastPitchE2e Eren Gölge 2022-04-19 09:19:07 +02:00
4556c61902 Update fastpitche2e recipe Eren Gölge 2022-04-19 09:18:49 +02:00
5f9d559419 Update import statements Eren Gölge 2022-04-19 09:16:03 +02:00
9f8d86b716 Remove redundancy Eren Gölge 2022-04-04 09:46:30 +02:00
0738cb0efe Fix Vocoder logging Eren Gölge 2022-04-04 09:46:10 +02:00
760f045aaa Rename vars in VITS Eren Gölge 2022-04-04 09:45:46 +02:00
775a6ab6ee Add cond layer in decoder Eren Gölge 2022-04-04 09:44:20 +02:00
28a53c7462 Refactor multi-speaker init in ForwardTTS Eren Gölge 2022-04-04 09:43:46 +02:00
c125024da0 Implement BaseTTSE2E Eren Gölge 2022-04-04 09:43:15 +02:00
b16613c5ad Implement ForwardTTSE2E Loss Eren Gölge 2022-04-04 09:42:50 +02:00
aea8cb7668 Implement FastPitchE2E LJSpeech recipe Eren Gölge 2022-04-04 09:41:46 +02:00
2a61b8fdaf Implement ForwardTTSE2E tests Eren Gölge 2022-04-04 09:41:25 +02:00
85731482e1 Implement FastPitchE2EConfig Eren Gölge 2022-04-04 09:41:05 +02:00
fccda5ae7b Implement ForwardTTSE2Eg Eren Gölge 2022-04-04 09:40:36 +02:00
d94b8bac02 Add pitch predictor Edresson Casanova 2022-05-16 21:53:49 +00:00
dcd0d1f6a1 Clean up old code Edresson Casanova 2022-05-16 13:09:12 +00:00
3a524b0597 Add prosody encoder params on config Edresson Casanova 2022-05-16 09:45:28 -03:00
f237e4ccd9 Merge pull request #1574 from coqui-ai/update_badge v0.7.0_models Eren Gölge 2022-05-13 14:58:05 +02:00
e282da5161 Update CI badges Eren Gölge 2022-05-13 14:56:49 +02:00
e5d8ec2402 Change the VITS upsampling interpolation trick to linear (#1564) Edresson Casanova 2022-05-13 05:52:39 -03:00
c6008e5235 Add audio length sampler balancer (#1561) Edresson Casanova 2022-05-12 14:59:19 -03:00
6e460b7e42 Add an assert for the upsampling trick (#1538) Eren Gölge 2022-05-12 19:55:24 +02:00
6048959e24 Add CPU only Docker image (#1573) Eren Gölge 2022-05-12 19:33:27 +02:00
002f826a1c Add unit tests audio_len_sampler Edresson Casanova 2022-05-07 15:56:39 -03:00
e86e3d2e87 Add audio length sampler balancer Edresson Casanova 2022-05-07 14:42:58 -03:00
5e4bd9bfe8 Merge branch 'cpu-only-docker-image' of https://github.com/coqui-ai/TTS into cpu-only-docker-image cpu-only-docker-image Eren Gölge 2022-05-12 18:50:06 +02:00
085517b79a Fix Dockerfile Eren Gölge 2022-05-12 12:55:27 +02:00

... 8 9 10 11 12 ...