Aviv Rosenberg

Cited by

	All	Since 2019
Citations	562	562
h-index	10	10
i10-index	10	10

160

120

2019202020212022202320244 57 104 106 144 147

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yishay MansourTel Aviv UniversityVerified email at tauex.tau.ac.il
Tal LancewickiTel Aviv UniversityVerified email at mail.tau.ac.il
Yonathan EfroniMeta, New YorkVerified email at fb.com
Haipeng LuoAssociate Professor, University of Southern CaliforniaVerified email at usc.edu
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
Lior ShaniGoogle ResearchVerified email at google.com
Alon CohenTel-Aviv University and GoogleVerified email at google.com
Haim KaplanSchool of Computer Science, Tel Aviv UniversityVerified email at post.tau.ac.il
Tiancheng JinPh.D. student, University of Southern CaliforniaVerified email at usc.edu
Bilal PiotGoogle DeepmindVerified email at google.com
Daniele CalandrielloResearch Scientist, DeepMindVerified email at google.com
Asaf CasselSchool of Computer Science, Tel Aviv UniversityVerified email at mail.tau.ac.il
Dmitry SotnikovAmazonVerified email at amazon.com
Liyu ChenUniversity of Southern CaliforniaVerified email at usc.edu
Assaf HallakNVIDIA ResearchVerified email at nvidia.com
Gal ChechikNVIDIA, Bar Ilan UniversityVerified email at biu.ac.il
Gal DalalSr. Research Scientist, NvidiaVerified email at nvidia.com
Wei XiongComputer Science, University of Illinois Urbana-ChampaignVerified email at illinois.edu
Jiaming ShenGoogle DeepMindVerified email at google.com
Rishabh JoshiGoogle Deepmind, ex Brain TeamVerified email at google.com

Aviv Rosenberg

Google Research

Verified email at google.com - Homepage

Machine Learning Reinforcement Learning Online Learning Bandits and Algorithms


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Online convex optimization in adversarial markov decision processes A Rosenberg, Y Mansour International Conference on Machine Learning, 5478-5486, 2019	155	2019
Optimistic policy optimization with bandit feedback L Shani, Y Efroni, A Rosenberg, S Mannor International Conference on Machine Learning, 8604-8613, 2020	98	2020
Online stochastic shortest path with bandit feedback and unknown transition function A Rosenberg, Y Mansour Advances in Neural Information Processing Systems, 2212-2221, 2019	75	2019
Near-optimal regret bounds for stochastic shortest path A Cohen, H Kaplan, Y Mansour, A Rosenberg International Conference on Machine Learning, 8210-8219, 2020	57	2020
Stochastic Shortest Path with Adversarially Changing Costs A Rosenberg, Y Mansour Thirtieth International Joint Conference on Artificial Intelligence (IJCAI …, 2021	35	2021
Minimax regret for stochastic shortest path A Cohen, Y Efroni, Y Mansour, A Rosenberg Thirty-Fifth Conference on Neural Information Processing Systems, 2021	31	2021
Learning adversarial markov decision processes with delayed feedback T Lancewicki, A Rosenberg, Y Mansour Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7281-7289, 2022	27	2022
Near-optimal regret for adversarial mdp with delayed bandit feedback T Jin, T Lancewicki, H Luo, Y Mansour, A Rosenberg Advances in Neural Information Processing Systems 35, 33469-33481, 2022	24	2022
Oracle-efficient regret minimization in factored mdps with unknown structure A Rosenberg, Y Mansour Advances in Neural Information Processing Systems 34, 11148-11159, 2021	18*	2021
Policy optimization for stochastic shortest path L Chen, H Luo, A Rosenberg Conference on Learning Theory, 982-1046, 2022	14	2022
Building math agents with multi-turn iterative preference learning W Xiong, C Shi, J Shen, A Rosenberg, Z Qin, D Calandriello, M Khalman, ... arXiv preprint arXiv:2409.02392, 2024	7	2024
Planning and learning with adaptive lookahead A Rosenberg, A Hallak, S Mannor, G Chechik, G Dalal Proceedings of the AAAI Conference on Artificial Intelligence 37 (8), 9606-9613, 2023	7	2023
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback T Lancewicki, A Rosenberg, D Sotnikov International Conference on Machine Learning, 18482-18534, 2023	4	2023
Multi-turn Reinforcement Learning from Preference Human Feedback L Shani, A Rosenberg, A Cassel, O Lang, D Calandriello, A Zipori, ... arXiv preprint arXiv:2405.14655, 2024	3	2024
Cooperative online learning in stochastic and adversarial MDPs T Lancewicki, A Rosenberg, Y Mansour International Conference on Machine Learning, 11918-11968, 2022	3	2022
Near-optimal regret in linear MDPs with aggregate bandit feedback A Cassel, H Luo, A Rosenberg, D Sotnikov arXiv preprint arXiv:2405.07637, 2024	2	2024
A Unified Analysis of Nonstochastic Delayed Feedback for Combinatorial Semi-Bandits, Linear Bandits, and MDPs D van der Hoeven, L Zierahn, T Lancewicki, A Rosenberg, ... Conference on Learning Theory, 1285-1321, 2023	2	2023
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes A Cassel, A Rosenberg arXiv preprint arXiv:2407.03065, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors