Thomas William Anthony
Thomas William Anthony
Google DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Thinking fast and slow with deep learning and tree search
TW Anthony, Z Tian, D Barber
Advances in Neural Information Processing Systems, 5360-5370, 2017
2002017
Openspiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
592019
From Poincar\'e Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
arXiv preprint arXiv:2002.08456, 2020
212020
Policy Gradient Search: Online Planning and Expert Iteration without Search Trees
TW Anthony, R Nishihara, P Moritz, T Salimans, J Schulman
arXiv preprint arXiv:1904.03646, 2019
212019
Learning to Play No-Press Diplomacy with Best Response Policy Iteration
T Anthony, T Eccles, A Tacchetti, J Kramár, I Gemp, TC Hudson, N Porcel, ...
arXiv preprint arXiv:2006.04635, 2020
92020
On the role of planning in model-based deep reinforcement learning
JB Hamrick, AL Friesen, F Behbahani, A Guez, F Viola, S Witherspoon, ...
arXiv preprint arXiv:2011.04021, 2020
82020
Smooth markets: A basic mechanism for organizing gradient-based learners
D Balduzzi, WM Czarnecki, TW Anthony, IM Gemp, E Hughes, JZ Leibo, ...
arXiv preprint arXiv:2001.04678, 2020
82020
OpenSpiel: A Framework for Reinforcement Learning in Games. CoRR abs/1908.09453 (2019)
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint cs.LG/1908.09453, 2019
82019
Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
E Hughes, TW Anthony, T Eccles, JZ Leibo, D Balduzzi, Y Bachrach
arXiv preprint arXiv:2003.00799, 2020
32020
Learning to Play against Any Mixture of Opponents
MO Smith, T Anthony, Y Wang, MP Wellman
arXiv preprint arXiv:2009.14180, 2020
22020
Sample-based Approximation of Nash in Large Many-Player Games via Gradient Descent
I Gemp, R Savani, M Lanctot, Y Bachrach, T Anthony, R Everett, ...
arXiv preprint arXiv:2106.01285, 2021
2021
Expert iteration
TW Anthony
UCL (University College London), 2021
2021
Multiagent Reinforcement Learning in Games with an Iterated Dominance Solution
Y Bachrach, T Lattimore, M Garnelo, J Perolat, D Balduzzi, T Anthony, ...
2019
Neural Design of Contests and All-Pay Auctions using Multi-Agent Simulation
T Anthony, I Gemp, J Kramar, T Eccles, A Tacchetti, Y Bachrach
2019
ITERATIVE EMPIRICAL GAME SOLVING VIA SINGLE POLICY BEST RESPONSE
MO Smith, T Anthony, MP Wellman
The system can't perform the operation now. Try again later.
Articles 1–15