Deep speech 2: End-to-end speech recognition in english and mandarin D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ... International conference on machine learning, 173-182, 2016 | 2735 | 2016 |
Deep speech: Scaling up end-to-end speech recognition A Hannun, C Case, J Casper, B Catanzaro, G Diamos, E Elsen, ... arXiv preprint arXiv:1412.5567, 2014 | 1930 | 2014 |
Mixed precision training P Micikevicius, S Narang, J Alben, G Diamos, E Elsen, D Garcia, ... arXiv preprint arXiv:1710.03740, 2017 | 863 | 2017 |
Deep voice: Real-time neural text-to-speech SÖ Arık, M Chrzanowski, A Coates, G Diamos, A Gibiansky, Y Kang, X Li, ... International Conference on Machine Learning, 195-204, 2017 | 552 | 2017 |
Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems G Diamos, A Kerr, S Yalamanchili, N Clark 2010 19th International Conference on Parallel Architectures and Compilation …, 2010 | 321 | 2010 |
Deep learning scaling is predictable, empirically J Hestness, S Narang, N Ardalani, G Diamos, H Jun, H Kianinejad, ... arXiv preprint arXiv:1712.00409, 2017 | 311 | 2017 |
Deep voice 2: Multi-speaker neural text-to-speech A Gibiansky, S Arik, G Diamos, J Miller, K Peng, W Ping, J Raiman, ... Advances in neural information processing systems 30, 2017 | 269 | 2017 |
Exploring sparsity in recurrent neural networks S Narang, E Elsen, G Diamos, S Sengupta arXiv preprint arXiv:1704.05119, 2017 | 252 | 2017 |
Harmony: an execution model and runtime for heterogeneous many core systems GF Diamos, S Yalamanchili Proceedings of the 17th international symposium on High performance …, 2008 | 251 | 2008 |
Mlperf inference benchmark VJ Reddi, C Cheng, D Kanter, P Mattson, G Schmuelling, CJ Wu, ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020 | 212 | 2020 |
Mlperf training benchmark P Mattson, C Cheng, G Diamos, C Coleman, P Micikevicius, D Patterson, ... Proceedings of Machine Learning and Systems 2, 336-349, 2020 | 173 | 2020 |
A characterization and analysis of ptx kernels A Kerr, G Diamos, S Yalamanchili 2009 IEEE international symposium on workload characterization (IISWC), 3-12, 2009 | 167 | 2009 |
Deep voice 2: Multi-speaker neural text-to-speech S Arik, G Diamos, A Gibiansky, J Miller, K Peng, W Ping, J Raiman, ... arXiv preprint arXiv:1705.08947, 2017 | 163 | 2017 |
Modeling GPU-CPU workloads and systems A Kerr, G Diamos, S Yalamanchili Proceedings of the 3rd workshop on general-purpose computation on graphics …, 2010 | 147 | 2010 |
Simultaneous branch and warp interweaving for sustained GPU performance N Brunie, C Collange, G Diamos 2012 39th Annual International Symposium on Computer Architecture (ISCA), 49-60, 2012 | 125 | 2012 |
Kernel weaver: Automatically fusing database primitives for efficient gpu computation H Wu, G Diamos, S Cadambi, S Yalamanchili 2012 45th Annual IEEE/ACM International Symposium on Microarchitecture, 107-118, 2012 | 116 | 2012 |
SIMD re-convergence at thread frontiers G Diamos, B Ashbaugh, S Maiyuran, A Kerr, H Wu, S Yalamanchili 2011 44th Annual IEEE/ACM International Symposium on Microarchitecture …, 2011 | 104 | 2011 |
Block-sparse recurrent neural networks S Narang, E Undersander, G Diamos arXiv preprint arXiv:1711.02782, 2017 | 101 | 2017 |
Red fox: An execution environment for relational query processing on gpus H Wu, G Diamos, T Sheard, M Aref, S Baxter, M Garland, S Yalamanchili Proceedings of Annual IEEE/ACM International Symposium on Code Generation …, 2014 | 94 | 2014 |
Persistent rnns: Stashing recurrent weights on-chip G Diamos, S Sengupta, B Catanzaro, M Chrzanowski, A Coates, E Elsen, ... International Conference on Machine Learning, 2024-2033, 2016 | 89 | 2016 |