Zheng Wen

Cited by

	All	Since 2019
Citations	5452	4640
h-index	31	30
i10-index	59	54

1000

500

250

750

2014201520162017201820192020202120222023202428 67 146 179 328 505 739 847 992 899 656

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Branislav KvetonAmazonVerified email at amazon.com
Benjamin Van RoyStanford UniversityVerified email at stanford.edu
Ian OsbandOpenAIVerified email at openai.com
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Azin AshkanGoogleVerified email at uwaterloo.ca
Xiuyuan LuGoogle DeepMindVerified email at google.com
Yasin Abbasi YadkoriGoogle DeepMindVerified email at google.com
Vikranth DwaracherlaDeepMindVerified email at google.com
Morteza IbrahimiStanford UniversityVerified email at stanford.edu
Mohammad GhavamzadehAmazonVerified email at amazon.com
Sharan VaswaniSimon Fraser UniversityVerified email at sfu.ca
Daniel RussoColumbia UniversityVerified email at gsb.columbia.edu
Botao HaoOpenAIVerified email at openai.com
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Seyed Mohammad AsghariResearch Engineer, DeepMindVerified email at google.com
Brian ErikssonAdobeVerified email at adobe.com
S MuthukrishnanRutgers UnivVerified email at cs.rutgers.edu
Sumeet KatariyaAmazonVerified email at wisc.edu
Shlomo BerkovskyMacquarie UniversityVerified email at mq.edu.au
Abbas KazerouniStanford UniversityVerified email at stanford.edu

Zheng Wen

Google DeepMind

Verified email at google.com - Homepage

Artificial Intelligence Reinforcement Learning Operations Research Large Language Models


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Tutorial on Thompson Sampling D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen arXiv, https://arxiv.org/pdf/1707.02038.pdf, 0	1133*
Generalization and exploration via randomized value functions I Osband, B Van Roy, Z Wen International Conference on Machine Learning, 2377-2386, 2016	340	2016
Deep exploration via randomized value functions I Osband, B Van Roy, DJ Russo, Z Wen Journal of Machine Learning Research 20 (124), 1-62, 2019	332	2019
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits B Kveton, Z Wen, A Ashkan, C Szepesvari International Conference on Artificial Intelligence and Statistics (AISTATS …, 2014	322	2014
Cascading bandits: Learning to rank in the cascade model B Kveton, C Szepesvári, Z Wen, A Ashkan ICML, 2015	317	2015
Optimal demand response using device based reinforcement learning Z Wen, D O'Neill, HR Maei IEEE Transactions on Smart Grid, 2014	308	2014
Online influence maximization under independent cascade model with semi-bandit feedback Z Wen, B Kveton, M Valko, S Vaswani Advances in neural information processing systems 30, 2017	148*	2017
Nearly optimal adaptive procedure with change detection for piecewise-stationary bandit Y Cao, Z Wen, B Kveton, Y Xie The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	138*	2019
Matroid bandits: Fast combinatorial optimization with learning B Kveton, Z Wen, A Ashkan, H Eydgahi, B Eriksson UAI 2014, 2014	129	2014
Cascading bandits for large-scale recommendation problems S Zong, H Ni, K Sung, NR Ke, Z Wen, B Kveton arXiv preprint arXiv:1603.05359, 2016	128	2016
Combinatorial cascading bandits B Kveton, Z Wen, A Ashkan, C Szepesvari Advances in Neural Information Processing Systems 28, 2015	128	2015
Optimal Greedy Diversity for Recommendation A Ashkan, B Kveton, S Berkovsky, Z Wen	113	2015
Efficient learning in large-scale combinatorial semi-bandits Z Wen, B Kveton, A Ashkan http://jmlr.org/proceedings/papers/v37/wen15.html, 2014	109	2014
Online learning to rank in stochastic click models M Zoghi, T Tunys, M Ghavamzadeh, B Kveton, C Szepesvari, Z Wen International conference on machine learning, 4199-4208, 2017	107	2017
Efficient Exploration and Value Function Generalization in Deterministic Systems Z Wen, B Van Roy Advances in Neural Information Processing Systems, 3021--3029, 2013	91	2013
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 36, 2024	90	2024
DCM Bandits: Learning to Rank with Multiple Clicks S Katariya, B Kveton, C Szepesvári, Z Wen arXiv, 2016	90	2016
Model-independent online learning for influence maximization S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, LVS Lakshmanan, ... International conference on machine learning, 3530-3539, 2017	82*	2017
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	78	2019
Stochastic rank-1 bandits S Katariya, B Kveton, C Szepesvari, C Vernade, Z Wen Artificial Intelligence and Statistics, 392-401, 2017	74	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors