‪Eric Hambro‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	11407	11396
h-index	11	11
i10-index	11	11

0

8000

4000

2000

6000

20222023202472 3549 7659

Co-authors

Heinrich KüttlerxAIVerified email at math.lmu.de
Tim RocktäschelProfessor of AI at UCL, Open-Endedness Team Lead at Google DeepMind, Fellow ELLISVerified email at cs.ucl.ac.uk
Mikayel SamvelyanGoogle DeepMindVerified email at google.com
Roberta RaileanuResearch Scientist at Meta, Honorary Lecturer at UCL
Sharath Chandra RaparthyMember of Technical Staff at Reka AI

Eric Hambro

Eric Hambro

Anthropic

Verified email at anthropic.com - Homepage

Machine Learning Reinforcement Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
LLaMA: Open and efficient foundation language models H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... arXiv preprint arXiv:2302.13971, 2023	9885	2023
Toolformer: Language models can teach themselves to use tools T Schick, J Dwivedi-Yu, R Dessì, R Raileanu, M Lomeli, E Hambro, ... Advances in Neural Information Processing Systems 36, 2024	1171	2024
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... NeurIPS 2021 Datasets and Benchmarks, 2021	89	2021
LLaMA: open and efficient foundation language models, 2023 [J] H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... URL https://arxiv. org/abs/2302.13971, 2023	64	2023
Understanding the effects of rlhf on llm generalisation and diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... arXiv preprint arXiv:2310.06452, 2023	59	2023
GPflux: A library for deep Gaussian processes V Dutordoir, H Salimbeni, E Hambro, J McLeod, F Leibfried, A Artemev, ... arXiv preprint arXiv:2104.05674, 2021	29	2021
Rainbow teaming: Open-ended generation of diverse adversarial prompts M Samvelyan, SC Raparthy, A Lupu, E Hambro, AH Markosyan, M Bhatt, ... arXiv preprint arXiv:2402.16822, 2024	27	2024
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024	20	2024
Insights from the Neurips 2021 Nethack Challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022	19	2022
Glore: When, where, and how to improve llm reasoning via global and local refinements A Havrilla, S Raparthy, C Nalmpantis, J Dwivedi-Yu, M Zhuravinskyi, ... arXiv preprint arXiv:2402.10963, 2024	14	2024
Dungeons and Data: A Large-Scale NetHack Dataset E Hambro, R Raileanu, D Rothermel, V Mella, T Rocktäschel, H Küttler, ... Advances in Neural Information Processing Systems 35, 24864-24878, 2022	14	2022
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu arXiv preprint arXiv:2312.03801, 2023	9	2023
moolib: A Platform for Distributed RL. 2022 V Mella, E Hambro, D Rothermel, H Küttler URL https://github. com/facebookresearch/moolib 8, 18, 2022	7*	2022
Know When To Stop: A Study of Semantic Drift in Text Generation A Spataru, E Hambro, E Voita, N Cancedda arXiv preprint arXiv:2404.05411, 2024		2024
Learning to Solve New sequential decision-making Tasks with In-Context Learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu NeurIPS 2023 Foundation Models for Decision Making Workshop, 0

The system can't perform the operation now. Try again later.

Articles 1–15