README.md

<div align="center">
  <div>&nbsp;</div>
  <img src="resources/openvoicelogo.jpg" width="400"/> 

[Paper](https://arxiv.org/abs/2312.01479) |
[Website](https://research.myshell.ai/open-voice) <br> <br>
<a href="https://trendshift.io/repositories/6161" target="_blank"><img src="https://trendshift.io/api/badge/repositories/6161" alt="myshell-ai%2FOpenVoice | Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/></a>
</div>

## Introduction

### OpenVoice V1

As we detailed in our [paper](https://arxiv.org/abs/2312.01479) and [website](https://research.myshell.ai/open-voice), the advantages of OpenVoice are three-fold:

**1. Accurate Tone Color Cloning.**
OpenVoice can accurately clone the reference tone color and generate speech in multiple languages and accents.

**2. Flexible Voice Style Control.**
OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation. 

**3. Zero-shot Cross-lingual Voice Cloning.**
Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset.

### OpenVoice V2

In April 2024, we released OpenVoice V2, which includes all features in V1 and has:

**1. Better Audio Quality.**
OpenVoice V2 adopts a different training strategy that delivers better audio quality.

**2. Native Multi-lingual Support.**
English, Spanish, French, Chinese, Japanese and Korean are natively supported in OpenVoice V2.

**3. Free Commercial Use.**
Starting from April 2024, both V2 and V1 are released under MIT License. Free for commercial use.

[Video](https://github.com/myshell-ai/OpenVoice/assets/40556743/3cba936f-82bf-476c-9e52-09f0f417bb2f)

OpenVoice has been powering the instant voice cloning capability of [myshell.ai](https://app.myshell.ai/explore) since May 2023. Until Nov 2023, the voice cloning model has been used tens of millions of times by users worldwide, and witnessed the explosive user growth on the platform.

## Main Contributors

- [Zengyi Qin](https://www.qinzy.tech) at MIT
- [Wenliang Zhao](https://wl-zhao.github.io) at Tsinghua University
- [Xumin Yu](https://yuxumin.github.io) at Tsinghua University
- [Ethan Sun](https://twitter.com/ethan_myshell) at MyShell

## How to Use
Please see [usage](docs/USAGE.md) for detailed instructions.

## Common Issues

Please see [QA](docs/QA.md) for common questions and answers. We will regularly update the question and answer list.

## Citation
```
@article{qin2023openvoice,
  title={OpenVoice: Versatile Instant Voice Cloning},
  author={Qin, Zengyi and Zhao, Wenliang and Yu, Xumin and Sun, Xin},
  journal={arXiv preprint arXiv:2312.01479},
  year={2023}
}
```

## License
OpenVoice V1 and V2 are MIT Licensed. Free for both commercial and research use.

## Acknowledgements
This implementation is based on several excellent projects, [TTS](https://github.com/coqui-ai/TTS), [VITS](https://github.com/jaywalnut310/vits), and [VITS2](https://github.com/daniilrobnikov/vits2). Thanks for their awesome work!
update readme 2023-12-07 17:23:10 -05:00			`<div align="center">`
			`<div> </div>`
Update README.md 2023-12-17 23:48:56 -05:00			`<img src="resources/openvoicelogo.jpg" width="400"/>`
Update README.md 2023-11-29 16:10:35 +03:00
update readme 2023-12-07 17:23:10 -05:00			`[Paper](https://arxiv.org/abs/2312.01479) \|`
Update README.md 2024-12-24 14:16:22 -05:00			`[Website](https://research.myshell.ai/open-voice) <br> <br>`
Update README.md 2024-12-24 14:15:43 -05:00			`<a href="https://trendshift.io/repositories/6161" target="_blank"><img src="https://trendshift.io/api/badge/repositories/6161" alt="myshell-ai%2FOpenVoice \| Trendshift" style="width: 250px; height: 55px;" width="250" height="55"/></a>`
update readme 2023-12-07 17:23:10 -05:00			`</div>`
Update README.md 2023-11-29 16:10:35 +03:00
update readme 2023-12-07 17:23:10 -05:00			`## Introduction`
update v2 2024-04-17 21:08:27 +00:00
			`### OpenVoice V1`

update readme 2023-12-16 11:26:49 -05:00			`As we detailed in our [paper](https://arxiv.org/abs/2312.01479) and [website](https://research.myshell.ai/open-voice), the advantages of OpenVoice are three-fold:`
Update README.md 2023-11-29 16:10:35 +03:00
update readme 2023-12-07 17:23:10 -05:00			`1. Accurate Tone Color Cloning.`
			`OpenVoice can accurately clone the reference tone color and generate speech in multiple languages and accents.`
Update README.md 2023-11-29 16:10:35 +03:00
update readme 2023-12-07 17:23:10 -05:00			`2. Flexible Voice Style Control.`
			`OpenVoice enables granular control over voice styles, such as emotion and accent, as well as other style parameters including rhythm, pauses, and intonation.`

			`3. Zero-shot Cross-lingual Voice Cloning.`
			`Neither of the language of the generated speech nor the language of the reference speech needs to be presented in the massive-speaker multi-lingual training dataset.`

update v2 2024-04-17 21:08:27 +00:00			`### OpenVoice V2`

			`In April 2024, we released OpenVoice V2, which includes all features in V1 and has:`

			`1. Better Audio Quality.`
			`OpenVoice V2 adopts a different training strategy that delivers better audio quality.`

			`2. Native Multi-lingual Support.`
			`English, Spanish, French, Chinese, Japanese and Korean are natively supported in OpenVoice V2.`

			`3. Free Commercial Use.`
			`Starting from April 2024, both V2 and V1 are released under MIT License. Free for commercial use.`

Update README.md 2023-12-16 11:27:42 -05:00			`[Video](https://github.com/myshell-ai/OpenVoice/assets/40556743/3cba936f-82bf-476c-9e52-09f0f417bb2f)`
update readme 2023-12-16 11:26:49 -05:00
			`OpenVoice has been powering the instant voice cloning capability of [myshell.ai](https://app.myshell.ai/explore) since May 2023. Until Nov 2023, the voice cloning model has been used tens of millions of times by users worldwide, and witnessed the explosive user growth on the platform.`

			`## Main Contributors`

Update README.md 2024-08-21 11:04:49 -07:00			`- [Zengyi Qin](https://www.qinzy.tech) at MIT`
update readme 2023-12-16 11:26:49 -05:00			`- [Wenliang Zhao](https://wl-zhao.github.io) at Tsinghua University`
			`- [Xumin Yu](https://yuxumin.github.io) at Tsinghua University`
			`- [Ethan Sun](https://twitter.com/ethan_myshell) at MyShell`

update docs 2024-01-16 14:51:38 -05:00			`## How to Use`
			`Please see [usage](docs/USAGE.md) for detailed instructions.`
update readme 2023-12-07 17:23:10 -05:00
update gradio 2024-01-05 09:40:05 -05:00			`## Common Issues`

update docs 2024-01-16 14:51:38 -05:00			`Please see [QA](docs/QA.md) for common questions and answers. We will regularly update the question and answer list.`
update gradio 2024-01-05 09:40:05 -05:00
update readme 2023-12-07 17:23:10 -05:00			`## Citation`
			```
			`@article{qin2023openvoice,`
			`title={OpenVoice: Versatile Instant Voice Cloning},`
			`author={Qin, Zengyi and Zhao, Wenliang and Yu, Xumin and Sun, Xin},`
			`journal={arXiv preprint arXiv:2312.01479},`
			`year={2023}`
			`}`
			```

			`## License`
update v2 2024-04-17 21:08:27 +00:00			`OpenVoice V1 and V2 are MIT Licensed. Free for both commercial and research use.`
update readme 2023-12-07 17:23:10 -05:00
			`## Acknowledgements`
Update README.md 2024-01-01 22:54:35 -05:00			`This implementation is based on several excellent projects, [TTS](https://github.com/coqui-ai/TTS), [VITS](https://github.com/jaywalnut310/vits), and [VITS2](https://github.com/daniilrobnikov/vits2). Thanks for their awesome work!`