DeepSpeech/BIBLIOGRAPHY.md

3.0 KiB

This file contains a list of papers in chronological order that have been published using Mozilla's DeepSpeech.

To appear

  • Raghuveer Peri, Haoqi Li, Krishna Somandepalli, Arindam Jati, Shrikanth Narayanan (2020) "An empirical analysis of information encoded in disentangled neural speaker representations".
  • Rosana Ardila, Megan Branson, Kelly Davis, Michael Henretty, Michael Kohler, Josh Meyer, Reuben Morais, Lindsay Saunders, Francis M. Tyers, and Gregor Weber (2020) "Common Voice: A Massively-Multilingual Speech Corpus".

Published

2020

  • Nils Hjortnaes, Niko Partanen, Michael Rießler and Francis M. Tyers (2020) "Towards a Speech Recognizer for Komi, an Endangered and Low-Resource Uralic Language". Proceedings of the 6th International Workshop on Computational Linguistics of Uralic Languages.
@inproceedings{hjortnaes:2020,
    author = {Nils Hjortnaes and Niko Partanen and Michael Rießler and Francis M. Tyers},
    title = {Towards a Speech Recognizer for Komi, an Endangered and Low-Resource Uralic Language},
    booktitle = {Proceedings of the 6th International Workshop on Computational Linguistics of Uralic Languages},
    year = 2020
}

2019

  • Aashish Agarwal and Torsten Zesch (2019) "German End-to-end Speech Recognition based on DeepSpeech". Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019)
@inproceedings{agarwal:2019,
    author = {Aashish Agarwal and Torsten Zesch},
    title = {German End-to-end Speech Recognition based on DeepSpeech},
    booktitle = {Proceedings of the 15th Conference on Natural Language Processing (KONVENS 2019)},
    year = 2019
  • Yihong Theis (2019) "Learning to detect named entities in bilingual code-mixed open speech corpora". MA Thesis. Kansas State University.
@mastersthesis{theis:2019,
    author = {Yihong Theis},
    title = {Learning to detect named entities in bilingual code-mixed open speech corpora},
    school = {Kansas State University},
    year = 2019
}
  • Ruswan Efendi (2019) "Automatic Speech Recognition Bahasa Indonesia Menggunakan Bidirectional Long Short-Term Memory dan Connectionist Temporal Classification". MA Thesis. Universitas Sumatera Utara.
@mastersthesis{theis:2019,
    author = {Ruswan Efendi},
    title = {Automatic Speech Recognition Bahasa Indonesia Menggunakan Bidirectional Long Short-Term Memory dan Connectionist Temporal Classification},
    school = {Universitas Sumatera Utara},
    year = 2019
}

2018

  • Deepthi Karkada and Vikram A. Saletore (2018) "Training Speech Recognition Models on HPC Infrastructure". 2018 IEEE/ACM Machine Learning in HPC Environments (MLHPC), Dallas, TX, USA, pp. 124-132.
@inproceedings{karkada:2018,
    author = {Deepthi Karkada and Vikram A. Saletore},
    title = {Training Speech Recognition Models on HPC Infrastructure},
    booktitle = {2018 IEEE/ACM Machine Learning in HPC Environments (MLHPC)},
    doi = {https://doi.org/10.1109/MLHPC.2018.8638637}
    year = 2018
}