Skip to content

Commit

Permalink
Update Readme
Browse files Browse the repository at this point in the history
  • Loading branch information
fatchord committed Apr 30, 2019
1 parent e70650f commit 2ada83a
Show file tree
Hide file tree
Showing 2 changed files with 13 additions and 3 deletions.
16 changes: 13 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,8 @@

##### (Update: Vanilla Tacotron One TTS system just implemented - more coming soon!)

![Tacotron with WaveRNN diagrams](assets/tacotron_wavernn.png)

Pytorch implementation of Deepmind's WaveRNN model from [Efficient Neural Audio Synthesis](https://arxiv.org/abs/1802.08435v1)

# Installation
Expand Down Expand Up @@ -84,11 +86,19 @@ Both are trained on LJSpeech
* WaveRNN trained to 800k steps (400k normal mels / 400k gta finetuned)
* Tacotron(r=1) trained to 196k steps

# Acknowledgments
____

### References

* [Efficient Neural Audio Synthesis](https://arxiv.org/abs/1802.08435v1)
* [keithito tacotron](https://github.com/keithito/tacotron)
* Special thanks to github users [G-Wang](https://github.com/G-Wang), [geneing](https://github.com/geneing)
* [Tacotron: Towards End-to-End Speech Synthesis](https://arxiv.org/abs/1703.10135)
* [Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions](https://arxiv.org/abs/1712.05884)

### Acknowlegements

* [https://github.com/keithito/tacotron](https://github.com/keithito/tacotron)
* [https://github.com/r9y9/wavenet_vocoder](https://github.com/r9y9/wavenet_vocoder)
* Special thanks to github users [G-Wang](https://github.com/G-Wang), [geneing](https://github.com/geneing) & [erogol](https://github.com/erogol)



Expand Down
Binary file added assets/tacotron_wavernn.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 2ada83a

Please sign in to comment.