Di He

Cited by

	All	Since 2019
Citations	8097	7619
h-index	38	37
i10-index	63	60

2400

1200

600

1800

2014201520162017201820192020202120222023202425 34 34 109 239 496 848 1214 1658 2360 1034

Public access

View all

27 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA FellowVerified email at microsoft.com
Liwei WangProfessor, Peking UniversityVerified email at cis.pku.edu.cn
Tao QinSenior Principal Research Manager, Microsoft ResearchVerified email at microsoft.com
Guolin KeDP TechnologyVerified email at dp.tech
Shuxin ZhengPrincipal Researcher, Microsoft ResearchVerified email at microsoft.com
Shengjie LuoPhD Student, Peking UniversityVerified email at stu.pku.edu.cn
Tianle CaiPhD Student, Princeton UniversityVerified email at princeton.edu
Yingce XiaPrincipal Researcher, Microsoft Research AI4ScienceVerified email at microsoft.com
Bohang ZhangPeking UniversityVerified email at pku.edu.cn
Zhuohan LiUC BerkeleyVerified email at berkeley.edu
Fei TianFacebookVerified email at fb.com
Lijun WuMicrosoft ResearchVerified email at microsoft.com
Runtian ZhaiPhD Student, Carnegie Mellon UniversityVerified email at cmu.edu
Shanda LiCarnegie Mellon UniversityVerified email at cs.cmu.edu
Zhiqing SunCarnegie Mellon University | Language Technologies InstituteVerified email at cs.cmu.edu
Chengyue Gong 龚成玥University of Texas at AustinVerified email at cs.utexas.edu

Di He

Peking University

Verified email at pku.edu.cn

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Dual learning for machine translation D He, Y Xia, T Qin, L Wang, N Yu, TY Liu, WY Ma Advances in neural information processing systems 29, 2016	1084	2016
Do Transformers Really Perform Bad for Graph Representation? C Ying, T Cai, S Luo, S Zheng, G Ke, D He, Y Shen, TY Liu NeurIPS 2021, 2021	930	2021
On layer normalization in the transformer architecture R Xiong, Y Yang, D He, K Zheng, S Zheng, C Xing, H Zhang, Y Lan, ... International Conference on Machine Learning, 10524-10533, 2020	767	2020
A theoretical analysis of NDCG ranking measures Y Wang, L Wang, Y Li, D He, W Chen, TY Liu Proceedings of the 26th Annual Conference on Learning Theory (COLT 2013) 8, 6, 2013	670*	2013
Incorporating bert into neural machine translation J Zhu, Y Xia, L Wu, D He, T Qin, W Zhou, H Li, TY Liu ICLR 2020, 2020	433	2020
Rethinking positional encoding in language pre-training G Ke, D He, TY Liu ICLR 2020, 2020	256	2020
Multilingual neural machine translation with knowledge distillation X Tan, Y Ren, D He, T Qin, Z Zhao, TY Liu ICLR 2019, 2019	246	2019
Invertible image rescaling M Xiao, S Zheng, C Liu, Y Wang, D He, G Ke, J Bian, Z Lin, TY Liu Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020	210	2020
Representation degeneration problem in training natural language generation models J Gao, D He, X Tan, T Qin, L Wang, T Liu ICLR 2019, 2018	210	2018
Frage: Frequency-agnostic word representation C Gong, D He, X Tan, T Qin, L Wang, TY Liu Advances in Neural Information Processing Systems, 1334-1345, 2018	172	2018
Macer: Attack-free and scalable robust training via maximizing certified radius R Zhai, C Dan, D He, H Zhang, B Gong, P Ravikumar, CJ Hsieh, L Wang ICLR 2020, 2020	167	2020
Understanding and improving transformer from a multi-particle dynamic system point of view Y Lu, Z Li, D He, Z Sun, B Dong, T Qin, L Wang, TY Liu arXiv preprint arXiv:1906.02762, 2019	156	2019
Non-autoregressive machine translation with auxiliary regularization Y Wang, F Tian, D He, T Qin, CX Zhai, TY Liu AAAI 2019, 2019	149	2019
Adversarially robust generalization just requires more unlabeled data R Zhai, T Cai, D He, C Dan, K He, J Hopcroft, L Wang arXiv preprint arXiv:1906.00555, 2019	148	2019
Graphnorm: A principled approach to accelerating graph neural network training T Cai, S Luo, K Xu, D He, T Liu, L Wang International Conference on Machine Learning, 1204-1215, 2021	137	2021
Layer-wise coordination between encoder and decoder for neural machine translation T He, X Tan, Y Xia, D He, T Qin, Z Chen, TY Liu Advances in Neural Information Processing Systems 31, 2018	129	2018
Non-autoregressive neural machine translation with enhanced decoder input J Guo, X Tan, D He, T Qin, L Xu, TY Liu Proceedings of the AAAI conference on artificial intelligence 33 (01), 3723-3730, 2019	127	2019
Efficient training of bert by progressively stacking L Gong, D He, Z Li, T Qin, L Wang, T Liu International conference on machine learning, 2337-2346, 2019	127	2019
Towards a deep and unified understanding of deep neural models in nlp C Guan, X Wang, Q Zhang, R Chen, D He, X Xie International conference on machine learning, 2454-2463, 2019	115	2019
Fast structured decoding for sequence models Z Sun, Z Li, H Wang, D He, Z Lin, Z Deng Advances in Neural Information Processing Systems 32, 2019	112	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors