Yi Wu

Cited by

	All	Since 2019
Citations	9322	8846
h-index	24	23
i10-index	32	31

2800

1400

700

2100

20172018201920202021202220232024114 317 665 1029 1489 2054 2722 879

Public access

View all

12 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Aviv TamarTechnionVerified email at technion.ac.il
Stuart RussellProfessor of Computer Science, University of California, BerkeleyVerified email at cs.berkeley.edu
Yuandong TianResearch Scientist, Meta AI (FAIR)Verified email at fb.com
Yu Wang (汪玉)Department of Electronic Engineering, Tsinghua University, ChinaVerified email at mail.tsinghua.edu.cn
Fei FangCarnegie Mellon UniversityVerified email at cmu.edu
Igor MordatchGoogle DeepMindVerified email at google.com
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Huazhe XuTsinghua UniversityVerified email at berkeley.edu
Xiaolong WangAssistant Professor, UC San DiegoVerified email at ucsd.edu
Ryan LoweOpenAIVerified email at openai.com
Jean HarbOpenAIVerified email at openai.com
Chao Yu（于超）Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Akash VeluStudent, Stanford UniversityVerified email at stanford.edu
Eugene VinitskyAssistant Professer, NYUVerified email at nyu.edu
Georgia GkioxariCaltechVerified email at caltech.edu
Yunfei LiTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Shusheng XuIIIS, Tsinghua UniversityVerified email at mails.tsinghua.edu.cn
Alexandre BayenProfessor Electrical Engineering and Computer Science, UC BerkeleyVerified email at berkeley.edu
Yuxin WuVerified email at google.com
Ingmar KanitscheiderOpenAIVerified email at openai.com

Yi Wu

Institute for Interdisciplinary Information Sciences, Tsinghua University

Verified email at mail.tsinghua.edu.cn - Homepage

Reinforcement Learning Human-AI Interaction Multi-Agent Learning Robot Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017	4604	2017
The surprising effectiveness of ppo in cooperative multi-agent games C Yu, A Velu, E Vinitsky, J Gao, Y Wang, A Bayen, Y Wu Advances in Neural Information Processing Systems 35, 24611-24624, 2022	895	2022
Emergent tool use from multi-agent autocurricula B Baker, I Kanitscheider, T Markov, Y Wu, G Powell, B McGrew, ... arXiv preprint arXiv:1909.07528, 2019	795	2019
Value iteration networks A Tamar, Y Wu, G Thomas, S Levine, P Abbeel Advances in neural information processing systems 29, 2016	726	2016
Building generalizable agents with a realistic and rich 3d environment Y Wu, Y Wu, G Gkioxari, Y Tian arXiv preprint arXiv:1801.02209, 2018	369	2018
Robust multi-agent reinforcement learning via minimax deep deterministic policy gradient S Li, Y Wu, X Cui, H Dong, F Fang, S Russell Proceedings of the AAAI conference on artificial intelligence 33 (01), 4213-4220, 2019	296	2019
Adversarial training for relation extraction Y Wu, D Bamman, S Russell Proceedings of the 2017 Conference on Empirical Methods in Natural Language …, 2017	242	2017
Multi-task reinforcement learning with soft modularization R Yang, H Xu, Y Wu, X Wang Advances in Neural Information Processing Systems 33, 4767-4777, 2020	167	2020
Influence-based multi-agent exploration T Wang, J Wang, Y Wu, C Zhang arXiv preprint arXiv:1910.05512, 2019	130	2019
Bayesian relational memory for semantic visual navigation Y Wu, Y Wu, A Tamar, S Russell, G Gkioxari, Y Tian Proceedings of the IEEE/CVF international conference on computer vision …, 2019	121*	2019
Evolutionary population curriculum for scaling multi-agent reinforcement learning Q Long, Z Zhou, A Gupta, F Fang, Y Wu, X Wang arXiv preprint arXiv:2003.10423, 2020	104	2020
Noveld: A simple yet effective exploration criterion T Zhang, H Xu, X Wang, Y Wu, K Keutzer, JE Gonzalez, Y Tian Advances in Neural Information Processing Systems 34, 25217-25230, 2021	96*	2021
Deep reinforcement learning for green security games with real-time information Y Wang, ZR Shi, L Yu, Y Wu, R Singh, L Joppa, F Fang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 1401-1408, 2019	87	2019
Sequence level contrastive learning for text summarization S Xu, X Zhang, Y Wu, F Wei Proceedings of the AAAI conference on artificial intelligence 36 (10), 11556 …, 2022	73	2022
Unsupervised extractive summarization by pre-training hierarchical transformers S Xu, X Zhang, Y Wu, F Wei, M Zhou arXiv preprint arXiv:2010.08242, 2020	53	2020
Discovering diverse multi-agent strategic behavior via reward randomization Z Tang, C Yu, B Chen, H Xu, X Wang, F Fang, S Du, Y Wang, Y Wu arXiv preprint arXiv:2103.04564, 2021	48	2021
Swift: Compiled inference for probabilistic programming languages Y Wu, L Li, S Russell, R Bodik arXiv preprint arXiv:1606.09242, 2016	40*	2016
Meta-learning MCMC proposals T Wang, Y Wu, D Moore, SJ Russell Advances in neural information processing systems 31, 2018	38	2018
Maximum entropy population-based training for zero-shot human-ai coordination R Zhao, J Song, Y Yuan, H Hu, Y Gao, Y Wu, Z Sun, W Yang Proceedings of the AAAI Conference on Artificial Intelligence 37 (5), 6145-6153, 2023	36	2023
Revisiting some common practices in cooperative multi-agent reinforcement learning W Fu, C Yu, Z Xu, J Yang, Y Wu arXiv preprint arXiv:2206.07505, 2022	36	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors