Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sounakray2003/issue91 #92

Open
wants to merge 78 commits into
base: dongchao
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
78 commits
Select commit Hold shift + click to select a range
609d4bb
Add WIP items for the project
ftshijt Mar 28, 2023
8d0203b
Merge pull request #3 from ftshijt/main
Rongjiehuang Mar 28, 2023
f28a3c5
update
Rongjiehuang Mar 28, 2023
fb52b59
update
lmzjms Mar 28, 2023
0b56c58
Merge pull request #4 from lmzjms/main
Rongjiehuang Mar 28, 2023
7099521
Revert "update"
Rongjiehuang Mar 28, 2023
f89ee96
Merge pull request #5 from AIGC-Audio/revert-4-main
Rongjiehuang Mar 28, 2023
03f41ec
update
lmzjms Mar 28, 2023
fd6a33c
update
lmzjms Mar 28, 2023
c97c3f7
Merge pull request #6 from lmzjms/main
Rongjiehuang Mar 28, 2023
a869dce
Update README.md
yangdongchao Mar 29, 2023
7492e97
update
lmzjms Mar 30, 2023
819b3c4
update
lmzjms Mar 30, 2023
2d1dd31
update
lmzjms Mar 31, 2023
cc0c224
merge tts and t2s into NeuralSeq
PeppaPiggeee Mar 31, 2023
6c6e077
update
PeppaPiggeee Mar 31, 2023
3fbfbf9
update
PeppaPiggeee Mar 31, 2023
7094f94
update
PeppaPiggeee Mar 31, 2023
c7e6dbe
update
PeppaPiggeee Mar 31, 2023
3131ec4
Merge pull request #8 from lmzjms/main
Rongjiehuang Mar 31, 2023
f6c8cbe
Update README.md
Rongjiehuang Mar 31, 2023
7a315a8
update
PeppaPiggeee Apr 1, 2023
a23e0f3
Merge branch 'main' into hzq
PeppaPiggeee Apr 1, 2023
3be5167
update
PeppaPiggeee Apr 1, 2023
7a709a6
update
PeppaPiggeee Apr 2, 2023
8569aa0
Merge pull request #9 from AIGC-Audio/hzq
Rongjiehuang Apr 2, 2023
9e2a24b
delect cache
Rongjiehuang Apr 2, 2023
ff82f7a
delect cache
Rongjiehuang Apr 2, 2023
9b4a830
Merge pull request #10 from Rongjiehuang/main
Rongjiehuang Apr 2, 2023
1a69271
cleaning
Rongjiehuang Apr 3, 2023
7ee017c
Update README.md
yangdongchao Apr 4, 2023
322ed8c
detection and extraction
yangdongchao Apr 5, 2023
5cfa061
Merge pull request #12 from AIGC-Audio/ydc
Rongjiehuang Apr 6, 2023
e3a7194
fix e
yangdongchao Apr 6, 2023
3c47c2c
Merge pull request #13 from AIGC-Audio/ydc
Rongjiehuang Apr 6, 2023
112d87b
Merge branch 'main' of github.com:Rongjiehuang/AudioGPT
Rongjiehuang Apr 6, 2023
2da3ccd
update huggingface
Rongjiehuang Apr 6, 2023
e8fdbbf
update huggingface
Rongjiehuang Apr 6, 2023
514f233
add assets
yangdongchao Apr 9, 2023
0dff745
Merge pull request #14 from AIGC-Audio/ydc
yangdongchao Apr 9, 2023
f3cf2be
update
Rongjiehuang Apr 9, 2023
028bf0c
Merge branch 'main' of github.com:Rongjiehuang/AudioGPT
Rongjiehuang Apr 9, 2023
69aca79
update
Rongjiehuang Apr 9, 2023
236a2aa
Merge pull request #15 from Rongjiehuang/main
Rongjiehuang Apr 9, 2023
963fb77
Add Visinger
A-Quarter-Mile Apr 10, 2023
f5c6c4c
update
lmzjms Apr 11, 2023
4d14f89
update
lmzjms Apr 11, 2023
e2b06d3
Merge pull request #16 from A-Quarter-Mile/main
Rongjiehuang Apr 11, 2023
181bcee
add enh / ss
simpleoier Apr 11, 2023
9d9ad78
Merge pull request #18 from simpleoier/enh_ss
Rongjiehuang Apr 11, 2023
84a5493
update
lmzjms Apr 11, 2023
ea246e7
update
lmzjms Apr 11, 2023
e03a456
update
lmzjms Apr 11, 2023
70d54b5
update
lmzjms Apr 11, 2023
cb62a28
Merge pull request #20 from lmzjms/main
Rongjiehuang Apr 12, 2023
aab80e0
clean some codes
Rongjiehuang Apr 12, 2023
8975378
Merge pull request #21 from Rongjiehuang/main
Rongjiehuang Apr 12, 2023
7c6f83a
update
lmzjms Apr 13, 2023
34d0365
Merge branch 'main' of github.com:lmzjms/AudioGPT into main
lmzjms Apr 13, 2023
209995f
update
lmzjms Apr 13, 2023
46e0dbe
update
lmzjms Apr 13, 2023
d218ef7
Merge pull request #22 from lmzjms/main
Rongjiehuang Apr 13, 2023
89d47f0
clean
Rongjiehuang Apr 16, 2023
a06c041
Merge branch 'main' of github.com:Rongjiehuang/AudioGPT
Rongjiehuang Apr 16, 2023
b7ef7f0
Update README.md
MoonInTheRiver Apr 18, 2023
1c4b42f
Update README.md
RayeRen Apr 21, 2023
7ecef2b
update
Rongjiehuang Apr 26, 2023
4a0a02e
Merge branch 'AIGC-Audio:main' into main
Rongjiehuang Apr 26, 2023
36b86ad
Merge pull request #23 from Rongjiehuang/main
Rongjiehuang Apr 26, 2023
ed28e06
update
Rongjiehuang Apr 26, 2023
d1c2e98
Merge branch 'main' of github.com:Rongjiehuang/AudioGPT
Rongjiehuang Apr 26, 2023
afbc05e
Merge branch 'main' of github.com:Rongjiehuang/AudioGPT
Rongjiehuang Apr 26, 2023
97a9a2f
Refine readme
Rongjiehuang Apr 26, 2023
9b6d51d
update
lmzjms Apr 30, 2023
79fe509
update
lmzjms Apr 30, 2023
526a05a
update
lmzjms Apr 30, 2023
f61a97c
update
lmzjms Apr 30, 2023
148737e
Merge pull request #40 from lmzjms/main
lmzjms Apr 30, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
update
Rongjiehuang committed Mar 28, 2023
commit f28a3c5f91a712c5b8c75689beebf501d56c999b
27 changes: 21 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,27 +1,42 @@
# AudioGPT

**AudioGPT** connects ChatGPT and a series of Audio Foundation Models to enable **sending** and **receiving** speech, sing, and audio during chatting.
**AudioGPT** connects ChatGPT and a series of Audio Foundation Models to enable **sending** and **receiving** speech, sing, audio, and talking head during chatting.


## Capabilities

Up-to-date link: https://eac422a9e2289d6b.gradio.app/

Here we list the capability of AudioGPT at this time. More supported models and tasks are comming soon. For prompt examples, refer to [asset](assets/README.md).

### Speech
| Task | Supported Foundation Models | Status |
|:-------------------------:|:-------------------------------:|:------:|
| ----------Speech--------- | / | / |
| Text-to-Speech | [FastSpeech](), [SyntaSpeech](), [VITS]() | Yes (WIP) |
| Style Transfer | [GenerSpeech]() | WIP |
| Style Transfer | [GenerSpeech]() | Yes |
| Speech Recognition | [whisper](), [Conformer]() | Yes |
| Speech Enhancement | [ConvTasNet]() | WIP |
| Speech Separation | [TF-GridNet]() | WIP |
| Speech Translation | [Multi-decoder]() | WIP |
| ----------Sing--------- | / | |
| Mono-to-Binaural Speech | []() | WIP |

### Sing

| Task | Supported Foundation Models | Status |
|:-------------------------:|:-------------------------------:|:------:|
| Text-to-Sing | [DiffSinger](), [VISinger]() | Yes (WIP) |
| ----------Audio--------- | / | |

### Audio
| Task | Supported Foundation Models | Status |
|:-------------------------:|:-------------------------------:|:------:|
| Text-to-Audio | [Make-An-Audio]() | Yes |
| Audio Inpainting | [Make-An-Audio]() | WIP |
| Image-to-Audio | [Make-An-Audio]() | Yes |
| ----------Face--------- |

### Talking Head

| Task | Supported Foundation Models | Status |
|:-------------------------:|:-------------------------------:|:------:|
| Talking Head Synthesis | [GeneFace]() | WIP |

## Internal Version Updates