Tom Everitt

Cited by

	All	Since 2019
Citations	1793	1622
h-index	17	16
i10-index	34	29

440

220

110

330

201520162017201820192020202120222023202412 9 27 100 179 252 279 316 432 158

Public access

View all

9 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Marcus HutterResearcher@DeepMind & Professor at ANUVerified email at anu.edu.au
Victoria KrakovnaSenior Research Scientist at DeepMindVerified email at google.com
Ramana KumarDeepMindVerified email at cl.cam.ac.uk
Pedro A. OrtegaArtificial Intelligence & Machine LearningVerified email at adaptiveagents.org
Ryan CareyUniversity of OxfordVerified email at philosophy.ox.ac.uk
Jan LeikeOpenAIVerified email at openai.com
Miljan MarticDeepMindVerified email at google.com
Zachary KentonGoogle DeepMindVerified email at google.com
Laurent OrseauResearch Scientist at Google DeepMindVerified email at google.com
Vishal MainiDeepMindVerified email at deepmind.com
Eric LangloisSanctuary AIVerified email at cs.toronto.edu
Vladimir MikulikDeepMindVerified email at google.com
Andrew LefrancqDeepMindVerified email at google.com
Jarryd MartinThe Walter and Eliza Hall InstituteVerified email at wehi.edu.au
David Scott KruegerUniversity Assistant Professor, University of CambridgeVerified email at cam.ac.uk
Alessandro AbateProfessor of Verification and Control, University of Oxford, UKVerified email at cs.ox.ac.uk
Michael WooldridgeUniversity of OxfordVerified email at cs.ox.ac.uk
James FoxUniversity of YorkVerified email at york.ac.uk
Matt MacDermottPhD Student, Imperial College LondonVerified email at ic.ac.uk
Suraj Narayanan SasikumarSocialtraitVerified email at socialtrait.com

Tom Everitt

Staff Research Scientist at Google DeepMind

Verified email at google.com - Homepage

AI Safety Artificial General Intelligence Causality Incentives


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
AI safety gridworlds J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ... arXiv preprint arXiv:1711.09883, 2017	321	2017
Scalable agent alignment via reward modeling: a research direction J Leike, D Krueger, T Everitt, M Martic, V Maini, S Legg arXiv preprint arXiv:1811.07871, 2018	242	2018
AGI safety literature review T Everitt, G Lea, M Hutter International Joint Conference on AI (IJCAI), 2018	137	2018
Count-based exploration in feature space for reinforcement learning J Martin, SN Sasikumar, T Everitt, M Hutter International Joint Conference on AI (IJCAI), 2017	127	2017
Reinforcement Learning with Corrupted Reward Channel T Everitt, V Krakovna, L Orseau, M Hutter, S Legg 26th International Joint Conference on Artificial Intelligence (IJCAI), 2017	115	2017
Alignment of language agents Z Kenton, T Everitt, L Weidinger, I Gabriel, V Mikulik, G Irving arXiv preprint arXiv:2103.14659, 2021	112	2021
Specification gaming: the flip side of AI ingenuity V Krakovna, J Uesato, V Mikulik, M Rahtz, T Everitt, R Kumar, Z Kenton, ... DeepMind Blog 3, 2020	87	2020
Reward tampering problems and solutions in reinforcement learning: A causal influence diagram perspective T Everitt, M Hutter, R Kumar, V Krakovna Synthese, 2021	80	2021
Avoiding wireheading with value reinforcement learning T Everitt, M Hutter International Conference on Artificial General Intelligence (AGI), 12-22, 2016	44	2016
Shaking the foundations: delusions in sequence models for interaction and control PA Ortega, M Kunesch, G Delétang, T Genewein, J Grau-Moya, J Veness, ... arXiv preprint arXiv:2110.10819, 2021	43	2021
Agent incentives: A causal perspective T Everitt, R Carey, ED Langlois, PA Ortega, S Legg Proceedings of the AAAI Conference on Artificial Intelligence 35 (13), 11487 …, 2021	38	2021
Towards safe artificial general intelligence T Everitt PQDT-Global, 2019	32	2019
Self-modification of policy and utility function in rational agents T Everitt, D Filan, M Daswani, M Hutter International Conference on Artificial General Intelligence (AGI), 1-11, 2016	31	2016
Understanding agent incentives using causal influence diagrams. Part I: Single action settings T Everitt, PA Ortega, E Barnes, S Legg arXiv preprint arXiv:1902.09980, 2019	30	2019
Universal artificial intelligence: Practical agents and fundamental challenges T Everitt, M Hutter Foundations of trusted autonomy, 15-46, 2018	30	2018
Modeling AGI safety frameworks with causal influence diagrams T Everitt, R Kumar, V Krakovna, S Legg arXiv preprint arXiv:1906.08663, 2019	23	2019
Artificial general intelligence T Everitt, B Goertzel, A Potapov Lecture Notes in Artificial Intelligence. Heidelberg: Springer, 2017	17	2017
A game-theoretic analysis of the off-switch game T Wängberg, M Böörs, E Catt, T Everitt, M Hutter Artificial General Intelligence: 10th International Conference, AGI 2017 …, 2017	17	2017
Analytical results on the BFS vs. DFS algorithm selection problem. Part I: tree search T Everitt, M Hutter AI 2015: Advances in Artificial Intelligence: 28th Australasian Joint …, 2015	17	2015
Discovering agents Z Kenton, R Kumar, S Farquhar, J Richens, M MacDermott, T Everitt Artificial Intelligence 322, 103963, 2023	15	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors