Srinivas Sridharan, Phd

Cited by

	All	Since 2019
Citations	1618	1323
h-index	22	19
i10-index	31	25

400

200

100

300

2005200620072008200920102011201220132014201520162017201820192020202120222023202411 9 11 6 7 12 8 9 14 11 16 33 48 83 142 175 209 286 396 103

Public access

View all

5 articles

1 article

available

not available

Based on funding mandates

Co-authors

Tushar KrishnaAssociate Professor, Georgia TechVerified email at ece.gatech.edu
Mikhail SmelyanskiyFacebookVerified email at intel.com
Jeffrey S. VetterOak Ridge National LaboratoryVerified email at acm.org
Aditya AkellaProfessor, Computer Science, UT AustinVerified email at cs.utexas.edu
Louis FengUC DavisVerified email at ucdavis.edu

Srinivas Sridharan, Phd

Distinguished Engineer, NVIDIA

Verified email at nvidia.com - Homepage

AI SW/HW Systems Co-design AI Networking ML for Systems


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Distributed deep learning using synchronous stochastic gradient descent D Das, S Avancha, D Mudigere, K Vaidynathan, S Sridharan, D Kalamkar, ... arXiv preprint arXiv:1602.06709, 2016	207	2016
Mixed precision training of convolutional neural networks using integer operations D Das, N Mellempudi, D Mudigere, D Kalamkar, S Avancha, K Banerjee, ... arXiv preprint arXiv:1802.00930, 2018	187	2018
Deep learning at 15pf: supervised and semi-supervised classification for scientific data T Kurth, J Zhang, N Satish, E Racah, I Mitliagkas, MMA Patwary, T Malas, ... Proceedings of the International Conference for High Performance Computing …, 2017	95	2017
Machine learning accelerator mechanism A Bleiweiss, A Ramesh, A Mishra, D Marr, J Cook, S Sridharan, ... US Patent 11,373,088, 2022	82	2022
Deep learning training in facebook data centers: Design of scale-up and scale-out systems M Naumov, J Kim, D Mudigere, S Sridharan, X Wang, W Zhao, S Yilmaz, ... arXiv preprint arXiv:2003.09518, 2020	81	2020
Software-hardware co-design for fast and scalable training of deep learning recommendation models D Mudigere, Y Hao, J Huang, Z Jia, A Tulloch, S Sridharan, X Liu, ... Proceedings of the 49th Annual International Symposium on Computer …, 2022	71	2022
Abstraction layers for scalable distributed machine learning DD Kalamkar, K Vaidyanathan, S Sridharan, D Das US Patent 11,094,029, 2021	66	2021
Fine-grain compute communication execution for deep learning frameworks S Sridharan, D Mudigere US Patent App. 15/869,502, 2018	66	2018
Enabling efficient multithreaded MPI communication through a library-based implementation of MPI endpoints S Sridharan, J Dinan, DD Kalamkar SC'14: Proceedings of the International Conference for High Performance …, 2014	56	2014
Communication optimizations for distributed machine learning S Sridharan, K Vaidyanathan, D Das, C Sakthivel, ME Smorkalov US Patent 11,270,201, 2022	55	2022
Dynamic precision management for integer deep learning primitives N Mellempudi, D Mudigere, D Das, S Sridharan US Patent 10,643,297, 2020	47	2020
Hardware implemented point to point communication primitives for machine learning S Sridharan, K Vaidyanathan, D Das US Patent 11,488,008, 2022	46	2022
Thread migration to improve synchronization performance S Sridharan, B Keck, R Murphy, S Chandra, P Kogge Workshop on Operating System Interference in High Performance Applications, 2006	39	2006
M. khorashadi, P D Mudigere, Y Hao, J Huang, Z Jia, A Tulloch, S Sridharan, X Liu, ... Bhattacharya, P. Lapukhov, M. Naumov, L. Qiao, M. Smelyanskiy, B. Jia, and V …, 2021	37	2021
Astra-sim: Enabling sw/hw co-design exploration for distributed dl training platforms S Rashidi, S Sridharan, S Srinivasan, T Krishna 2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020	36	2020
On scale-out deep learning training for cloud and hpc S Sridharan, K Vaidyanathan, D Kalamkar, D Das, ME Smorkalov, ... arXiv preprint arXiv:1801.08030, 2018	34	2018
Data parallelism and halo exchange for distributed machine learning D Das, K Vaidyanathan, S Sridharan US Patent 11,373,266, 2022	33	2022
High-performance, distributed training of large-scale deep learning recommendation models D Mudigere, Y Hao, J Huang, A Tulloch, S Sridharan, X Liu, M Ozdal, ... arXiv preprint arXiv:2104.05158, 2021	30	2021
Enabling compute-communication overlap in distributed deep learning training platforms S Rashidi, M Denton, S Sridharan, S Srinivasan, A Suresh, J Nie, ... 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021	28	2021
Comparing runtime systems with exascale ambitions using the parallel research kernels RF Van der Wijngaart, A Kayi, JR Hammond, G Jost, T St. John, ... High Performance Computing: 31st International Conference, ISC High …, 2016	26	2016

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors