mirror of
https://github.com/AIGC-Audio/AudioGPT.git
synced 2025-12-16 11:57:58 +01:00
Added available links to models.
This commit is contained in:
21
README.md
21
README.md
@@ -18,16 +18,17 @@ Please refer to [run.md](run.md)
|
|||||||
|
|
||||||
Here we list the capability of AudioGPT at this time. More supported models and tasks are coming soon. For prompt examples, refer to [asset](assets/README.md).
|
Here we list the capability of AudioGPT at this time. More supported models and tasks are coming soon. For prompt examples, refer to [asset](assets/README.md).
|
||||||
|
|
||||||
|
Currently not every model has repository.
|
||||||
### Speech
|
### Speech
|
||||||
| Task | Supported Foundation Models | Status |
|
| Task | Supported Foundation Models | Status |
|
||||||
|:--------------------------:|:-------------------------------:|:------:|
|
|:--------------------------:|:-------------------------------:|:------:|
|
||||||
| Text-to-Speech | [FastSpeech](), [SyntaSpeech](), [VITS]() | Yes (WIP) |
|
| Text-to-Speech | [FastSpeech](https://github.com/ming024/FastSpeech2), [SyntaSpeech](https://github.com/yerfor/SyntaSpeech), [VITS](https://github.com/jaywalnut310/vits) | Yes (WIP) |
|
||||||
| Style Transfer | [GenerSpeech]() | Yes |
|
| Style Transfer | [GenerSpeech](https://github.com/Rongjiehuang/GenerSpeech) | Yes |
|
||||||
| Speech Recognition | [whisper](), [Conformer]() | Yes |
|
| Speech Recognition | [whisper](https://github.com/openai/whisper), [Conformer](https://github.com/sooftware/conformer) | Yes |
|
||||||
| Speech Enhancement | [ConvTasNet]() | Yes (WIP) |
|
| Speech Enhancement | [ConvTasNet]() | Yes (WIP) |
|
||||||
| Speech Separation | [TF-GridNet]() | Yes (WIP) |
|
| Speech Separation | [TF-GridNet](https://arxiv.org/pdf/2211.12433.pdf) | Yes (WIP) |
|
||||||
| Speech Translation | [Multi-decoder]() | WIP |
|
| Speech Translation | [Multi-decoder](https://arxiv.org/pdf/2109.12804.pdf) | WIP |
|
||||||
| Mono-to-Binaural | [NeuralWarp]() | Yes |
|
| Mono-to-Binaural | [NeuralWarp](https://github.com/fdarmon/NeuralWarp) | Yes |
|
||||||
|
|
||||||
### Sing
|
### Sing
|
||||||
|
|
||||||
@@ -41,16 +42,16 @@ Here we list the capability of AudioGPT at this time. More supported models and
|
|||||||
| Text-to-Audio | [Make-An-Audio]() | Yes |
|
| Text-to-Audio | [Make-An-Audio]() | Yes |
|
||||||
| Audio Inpainting | [Make-An-Audio]() | Yes |
|
| Audio Inpainting | [Make-An-Audio]() | Yes |
|
||||||
| Image-to-Audio | [Make-An-Audio]() | Yes |
|
| Image-to-Audio | [Make-An-Audio]() | Yes |
|
||||||
| Sound Detection | [Audio-transformer]() | Yes |
|
| Sound Detection | [Audio-transformer](https://github.com/RetroCirce/HTS-Audio-Transformer) | Yes |
|
||||||
| Target Sound Detection | [TSDNet]() | Yes |
|
| Target Sound Detection | [TSDNet](https://github.com/gy65896/TSDNet) | Yes |
|
||||||
| Sound Extraction | [LASSNet]() | Yes |
|
| Sound Extraction | [LASSNet](https://github.com/liuxubo717/LASS) | Yes |
|
||||||
|
|
||||||
|
|
||||||
### Talking Head
|
### Talking Head
|
||||||
|
|
||||||
| Task | Supported Foundation Models | Status |
|
| Task | Supported Foundation Models | Status |
|
||||||
|:-------------------------:|:-------------------------------:|:----------:|
|
|:-------------------------:|:-------------------------------:|:----------:|
|
||||||
| Talking Head Synthesis | [GeneFace]() | Yes (WIP) |
|
| Talking Head Synthesis | [GeneFace](https://github.com/yerfor/GeneFace) | Yes (WIP) |
|
||||||
|
|
||||||
|
|
||||||
## Acknowledgement
|
## Acknowledgement
|
||||||
|
|||||||
Reference in New Issue
Block a user