State-of-the-art speech recognition with sequence-to-sequence models CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ... 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 980 | 2018 |
Generation of large-scale simulated utterances in virtual rooms to train deep-neural networks for far-field speech recognition in Google Home C Kim, A Misra, K Chin, T Hughes, A Narayanan, T Sainath, M Bacchiani | 199 | 2017 |
Unsupervised language model adaptation M Bacchiani, B Roark 2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003 | 199 | 2003 |
Multichannel signal processing with deep neural networks for automatic speech recognition TN Sainath, RJ Weiss, KW Wilson, B Li, A Narayanan, E Variani, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (5), 965-979, 2017 | 193 | 2017 |
Acoustic Modeling for Google Home. B Li, TN Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, ... Interspeech, 399-403, 2017 | 152 | 2017 |
SCANMail: a voicemail interface that makes speech browsable, readable and searchable S Whittaker, J Hirschberg, B Amento, L Stark, M Bacchiani, P Isenhour, ... Proceedings of the SIGCHI conference on Human factors in computing systems …, 2002 | 152 | 2002 |
Restoring punctuation and capitalization in transcribed speech A Gravano, M Jansche, M Bacchiani 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 141 | 2009 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 135 | 2019 |
Processing multi-channel audio waveforms TN Sainath, RJ Weiss, KW Wilson, AW Senior, A Narayanan, Y Hoshen, ... US Patent 9,697,826, 2017 | 132 | 2017 |
Supervised and unsupervised PCFG adaptation to novel domains B Roark, M Bacchiani Proceedings of the 2003 Human Language Technology Conference of the North …, 2003 | 114 | 2003 |
Neural network adaptive beamforming for robust multichannel speech recognition B Li, TN Sainath, RJ Weiss, KW Wilson, M Bacchiani | 113 | 2016 |
From audio to semantics: Approaches to end-to-end spoken language understanding P Haghani, A Narayanan, M Bacchiani, G Chuang, N Gaur, P Moreno, ... 2018 IEEE Spoken Language Technology Workshop (SLT), 720-726, 2018 | 105 | 2018 |
Large vocabulary automatic speech recognition for children H Liao, G Pundak, O Siohan, M Carroll, N Coccaro, QM Jiang, TN Sainath, ... | 104 | 2015 |
Multi-dialect speech recognition with a single sequence-to-sequence model B Li, TN Sainath, KC Sim, M Bacchiani, E Weinstein, P Nguyen, Z Chen, ... 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 92 | 2018 |
MAP adaptation of stochastic grammars M Bacchiani, M Riley, B Roark, R Sproat Computer speech & language 20 (1), 41-68, 2006 | 89 | 2006 |
Fast vocabulary-independent audio search using path-based graph indexing OSM Bacchiani, M Siohan proc. Interspeech, 2005 | 86 | 2005 |
Speech processing for digital home assistants: Combining signal processing with deep-learning techniques R Haeb-Umbach, S Watanabe, T Nakatani, M Bacchiani, B Hoffmeister, ... IEEE Signal processing magazine 36 (6), 111-124, 2019 | 84 | 2019 |
An audio indexing system for election video material C Alberti, M Bacchiani, A Bezman, C Chelba, A Drofa, H Liao, P Moreno, ... 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 83 | 2009 |
AT&T at TREC-8. A Singhal, SP Abney, M Bacchiani, M Collins, D Hindle, FCN Pereira TREC 8, 317-330, 1999 | 83 | 1999 |
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms TN Sainath, RJ Weiss, KW Wilson, A Narayanan, M Bacchiani, A Senior 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015 | 80 | 2015 |