Follow
Jeremy Fowers
Jeremy Fowers
Groq
Verified email at groq.com
Title
Cited by
Cited by
Year
A reconfigurable fabric for accelerating large-scale datacenter services
A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ...
2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA …, 2014
12782014
A cloud-scale acceleration architecture
AM Caulfield, ES Chung, A Putnam, H Angepat, J Fowers, M Haselman, ...
2016 49th Annual IEEE/ACM international symposium on microarchitecture …, 2016
6482016
Accelerating deep convolutional neural networks using specialized hardware
K Ovtcharov, O Ruwase, JY Kim, J Fowers, K Strauss, ES Chung
Microsoft Research Whitepaper 2 (11), 1-4, 2015
4442015
A configurable cloud-scale DNN processor for real-time AI
J Fowers, K Ovtcharov, M Papamichael, T Massengill, M Liu, D Lo, ...
2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018
4122018
A performance and energy comparison of FPGAs, GPUs, and multicores for sliding-window applications
J Fowers, G Brown, P Cooke, G Stitt
Proceedings of the ACM/SIGDA international symposium on Field Programmable …, 2012
3032012
Serving dnns in real time at datacenter scale with project brainwave
E Chung, J Fowers, K Ovtcharov, M Papamichael, A Caulfield, ...
iEEE Micro 38 (2), 8-20, 2018
2592018
A high memory bandwidth fpga accelerator for sparse matrix-vector multiplication
J Fowers, K Ovtcharov, K Strauss, ES Chung, G Stitt
2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom …, 2014
1302014
A reconfigurable fabric for accelerating large-scale datacenter services
A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ...
IEEE Micro 35 (3), 10-22, 2015
1152015
A scalable high-bandwidth architecture for lossless compression on fpgas
J Fowers, JY Kim, D Burger, S Hauck
2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom …, 2015
992015
Toward accelerating deep learning at scale using specialized hardware in the datacenter
K Ovtcharov, O Ruwase, JY Kim, J Fowers, K Strauss, ES Chung
2015 IEEE Hot Chips 27 Symposium (HCS), 1-38, 2015
852015
Accelerating persistent neural networks at datacenter scale
E Chung, J Fowers, K Ovtcharov, M Papamichael, A Caulfield, ...
Hot Chips 29, 2017
652017
Configurable clouds
AM Caulfield, ES Chung, A Putnam, H Angepat, D Firestone, J Fowers, ...
IEEE Micro 37 (3), 52-61, 2017
452017
A performance and energy comparison of convolution on GPUs, FPGAs, and multicore processors
J Fowers, G Brown, J Wernsing, G Stitt
ACM Transactions on Architecture and Code Optimization (TACO) 9 (4), 1-21, 2013
412013
A reconfigurable fabric for accelerating large-scale datacenter services
A Putnam, AM Caulfield, ES Chung, D Chiou, K Constantinides, J Demme, ...
Communications of the ACM 59 (11), 114-122, 2016
402016
A tradeoff analysis of FPGAs, GPUs, and multicores for sliding-window applications
P Cooke, J Fowers, G Brown, G Stitt
ACM Transactions on Reconfigurable Technology and Systems (TRETS) 8 (1), 1-24, 2015
302015
Pushing the limits of narrow precision inferencing at cloud scale with microsoft floating point
B Darvish Rouhani, D Lo, R Zhao, M Liu, J Fowers, K Ovtcharov, ...
Advances in Neural Information Processing Systems 33, 10271-10281, 2020
292020
Sparse matrix data structure
K Strauss, J Fowers, K Ovtcharov
US Patent 9,367,519, 2016
282016
Hardware node with matrix-vector multiply tiles for neural network processing
J Fowers, ES Chung
US Patent 10,140,252, 2018
172018
Neural network processor based on application specific synthesis specialization parameters
J Fowers, K Ovtcharov, ES Chung, TM Massengill, MG Liu, GL Weisz
US Patent App. 15/959,206, 2019
122019
The RACECAR heuristic for automatic function specialization on multi-core heterogeneous systems
JR Wernsing, G Stitt, J Fowers
Proceedings of the 2012 international conference on Compilers, architectures …, 2012
112012
The system can't perform the operation now. Try again later.
Articles 1–20