Follow
Tom Zahavy
Tom Zahavy
Senior Research Scientist, DeepMind
Verified email at deepmind.com - Homepage
Title
Cited by
Cited by
Year
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, DJ Mankowitz, S Mannor
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence …, 2016
3232016
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International Conference on Machine Learning (ICML) 2016, 1899-1908, 2016
2082016
Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor
Advances in Neural Information Processing Systems (NeurIPS) 2018, 2018
1542018
Deep learning reconstruction of ultrashort pulses
T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ...
Optica 5 (5), 666-673, 2018
942018
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce
T Zahavy, A Krishnan, A Magnani, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
86*2018
A self-tuning actor-critic algorithm
T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ...
Advances in Neural Information Processing Systems 33, 2020
45*2020
Shallow updates for deep reinforcement learning
N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor
Advances in Neural Information Processing Systems (NeurIPS) 2017, 3135-3145, 2017
402017
Ensemble Robustness and Generalization of Stochastic Deep Learning Algorithms
T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor
International Conference on Learning Representations Workshop (ICLRW'18), 2016
25*2016
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
O Nabati, T Zahavy, S Mannor
International Conference on Machine Learning (ICML) 2021, 2021
21*2021
Action assembly: Sparse imitation learning for text based games with combinatorial action spaces
C Tessler, T Zahavy, D Cohen, DJ Mankowitz, S Mannor
RLDM 2019: The Multi-disciplinary Conference on Reinforcement Learning and …, 2019
16*2019
Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot
R Ziv, A Dikopoltsev, T Zahavy, I Rubinstein, P Sidorenko, O Cohen, ...
Optics express 28 (5), 7528-7538, 2020
152020
Visualizing Dynamics: from t-SNE to SEMI-MDPs
NB Zrihem, T Zahavy, S Mannor
ICML Workshop on Human Interpretability in Machine Learning (WHI 2016),, 2016
13*2016
Unknown mixing times in apprenticeship and reinforcement learning
T Zahavy, A Cohen, H Kaplan, Y Mansour
Conference on Uncertainty in Artificial Intelligence (UAI), 2020, 2020
11*2020
Sub-Nyquist sampling of OFDM signals for cognitive radios
T Zahavy, O Shayer, D Cohen, A Tolmachev, YC Eldar
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
102014
Balancing Constraints and Rewards with Meta-Gradient D4PG
DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann
International Conference on Learning Representations (ICLR) 2021, 2021
92021
Discovery of Options via Meta-Learned Subgoals
V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, H van Hasselt, ...
Advances in Neural Information Processing Systems (NeurIPS) 2021, 2021
82021
Deep neural networks in single-shot ptychography
O Wengrowicz, O Peleg, T Zahavy, B Loevsky, O Cohen
Optics Express 28 (12), 17511-17520, 2020
82020
Apprenticeship learning via frank-wolfe
T Zahavy, A Cohen, H Kaplan, Y Mansour
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 6720-6728, 2020
82020
Discovering a Set of Policies for the Worst Case Reward
T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O’Donoghue, I Kemaev, ...
International Conference on Learning Representations (ICLR) 2021, 2021
72021
Train on validation: squeezing the data lemon
G Tennenholtz, T Zahavy, S Mannor
arXiv preprint arXiv:1802.05846, 2018
72018
The system can't perform the operation now. Try again later.
Articles 1–20