Tom Zahavy

Cited by

	All	Since 2019
Citations	2123	1878
h-index	20	20
i10-index	31	31

460

230

115

345

20162017201820192020202120222023202434 59 142 173 254 312 392 456 289

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
Daniel J. MankowitzGoogle DeepmindVerified email at google.com
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Sebastian FlennerhagResearch Scientist at DeepMindVerified email at google.com
Chen TesslerResearch Scientist, NVIDIA ResearchVerified email at nvidia.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com
Brendan O'DonoghueStanford University, Google DeepMindVerified email at alumni.stanford.edu
Mordechai SegevSolid State Institute, Physics Department and Electrical Engineering Department Technion - IsraelVerified email at technion.ac.il
Alex DikopoltsevQuantum Optoelectronics Group, Department of Physics, ETHVerified email at phys.ethz.ch
Zhongwen XuTencentVerified email at tencent.com
Oren CohenProfessor of Physics, Technion, IsraelVerified email at technion.ac.il
Vivek VeeriahGoogle DeepMindVerified email at google.com
David SilverDeepMind, UCLVerified email at google.com
Matteo HesselResearch Engineer, Google DeepMindVerified email at google.com
Junhyuk OhResearch Scientist, DeepMindVerified email at google.com
Robert Tjarko LangeSakana AI, TU BerlinVerified email at tu-berlin.de
Tom SchaulSenior Staff Scientist, DeepMindVerified email at nyu.edu
Valentin DalibardUniversity of CambridgeVerified email at cl.cam.ac.uk
Nadav MerlisPostdoctoral Fellow @ CREST, ENSAE ParisVerified email at ensae.fr
Alessandro MagnaniWalmartlabsVerified email at walmartlabs.com

Tom Zahavy

Other namesTom Ben Zion Zahavy

Staff Research Scientist, Google DeepMind

Verified email at deepmind.com - Homepage

Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A deep hierarchical approach to lifelong learning in minecraft C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	452	2017
Graying the black box: Understanding dqns T Zahavy, N Ben-Zrihem, S Mannor International conference on machine learning (ICML), 1899-1908, 2016	332	2016
Learn what not to learn: Action elimination with deep reinforcement learning T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor Advances in neural information processing systems 31, 2018	244	2018
Deep learning reconstruction of ultrashort pulses T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ... Optica 5 (5), 666-673, 2018	171	2018
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce T Zahavy, A Krishnan, A Magnani, S Mannor Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	104*	2018
A self-tuning actor-critic algorithm T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ... Advances in neural information processing systems 33, 20913-20924, 2020	83	2020
Bootstrapped meta-learning S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh International Conference on Learning Representations (ICLR) 2022, 2021	70	2021
Reward is enough for convex mdps T Zahavy, B O'Donoghue, G Desjardins, S Singh Advances in Neural Information Processing Systems 34, 25746-25759, 2021	64	2021
Shallow updates for deep reinforcement learning N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor Advances in Neural Information Processing Systems 30, 2017	51	2017
Online limited memory neural-linear bandits with likelihood matching O Nabati, T Zahavy, S Mannor International Conference on Machine Learning, 7905-7915, 2021	40*	2021
Discovering Evolution Strategies via Meta-Black-Box Optimization R Tjarko Lange, T Schaul, Y Chen, T Zahavy, V Dallibard, C Lu, S Singh, ... International Conference on Learning Representations (ICLR) 2023, 2022	39*	2022
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	35	2021
Ensemble robustness and generalization of stochastic deep learning algorithms T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor arXiv preprint arXiv:1602.02389, 2016	34*	2016
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ... International Conference on Learning Representations (ICLR) 2023, 2022	33	2022
Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot R Ziv, A Dikopoltsev, T Zahavy, I Rubinstein, P Sidorenko, O Cohen, ... Optics express 28 (5), 7528-7538, 2020	28	2020
Discovering attention-based genetic algorithms via meta-black-box optimization R Lange, T Schaul, Y Chen, C Lu, T Zahavy, V Dalibard, S Flennerhag Proceedings of the Genetic and Evolutionary Computation Conference, 929-937, 2023	26	2023
Online Apprenticeship Learning L Shani, T Zahavy, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence, 2021	26	2021
Emphatic algorithms for deep reinforcement learning R Jiang, T Zahavy, Z Xu, A White, M Hessel, C Blundell, H Van Hasselt International Conference on Machine Learning (ICML), 5023-5033, 2021	23	2021
Discovering a set of policies for the worst case reward T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O'Donoghue, I Kemaev, ... International Conference on Learning Representations (ICLR) 2021, 2021	22	2021
Balancing constraints and rewards with meta-gradient d4pg DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann International Conference on Learning Representations (ICLR) 2021, 2020	22	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors