πŸ€– πŸ’¬ Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
ΠŸΠ΅Ρ€Π΅ΠΉΡ‚ΠΈ ΠΊ Ρ„Π°ΠΉΠ»Ρƒ
Eren Golge bcfd492bf0 README update 2018-01-23 14:18:09 +01:00
datasets Change config to json 3 2018-01-22 08:29:27 -08:00
layers New files 2018-01-22 06:59:41 -08:00
models New files 2018-01-22 06:59:41 -08:00
png Beginning 2018-01-22 01:48:59 -08:00
samples Beginning 2018-01-22 01:48:59 -08:00
utils Change config to json 3 2018-01-22 08:29:27 -08:00
.gitignore new files 2018-01-22 06:59:21 -08:00
README.md README update 2018-01-23 14:18:09 +01:00
__init__.py Beginning 2018-01-22 01:48:59 -08:00
config.json Change config to json 3 2018-01-22 08:29:27 -08:00
module.py Beginning 2018-01-22 01:48:59 -08:00
requirements.txt Beginning 2018-01-22 01:48:59 -08:00
synthesis.py Beginning 2018-01-22 01:48:59 -08:00
train.py Change config to json 3 2018-01-22 08:29:27 -08:00

README.md

Tacotron (Work in Progress...)

Here we have pytorch implementation of: - Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model. - Tacotron2 (TODO): Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

At the end, it should be easy to add new models and try different architectures.

You can find here a brief note about possible TTS architectures and their comparisons.

Requirements

Highly recommended to use miniconda for easier installation.

  • python 3.6
  • pytorch > 0.2.0
  • TODO

Data

TODO

File description

TODO

Training the network

TODO

Generate TTS wav file

TODO