Johan Ferret

Cited by

	All	Since 2019
Citations	1064	1063
h-index	10	10
i10-index	10	10

620

310

155

465

2020202120222023202424 85 134 209 606

Co-authors

Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Thomas MesnardResearch Scientist at Google DeepMindVerified email at google.com
Philippe PreuxProfessor of computer science, Université de Lille, LIFL, SequeL, INRIAVerified email at univ-lille.fr
Raphaël MarinierGoogle AIVerified email at google.com
Harrison LeeGoogle ResearchVerified email at google.com
Samrat PhataleGoogle ResearchVerified email at google.com
Nathan GrinsztajnInriaVerified email at inria.fr
Nino VieillardGoogle DeepMindVerified email at google.com
Léonard HussenotGoogle DeepMindVerified email at google.com
Olivier BachemResearch Scientist, Google BrainVerified email at google.com
Robert DadashiGoogle DeepMindVerified email at google.com
Geoffrey CideronGoogle DeepMindVerified email at google.com
Yannis Flet-BerliacPostdoc, Stanford UniversityVerified email at stanford.edu
Alexis D. JacqGoogleVerified email at google.com
Ramé AlexandreGoogle DeepMindVerified email at google.com
Roee AharoniGoogle ResearchVerified email at google.com
Sabela RamosSoftware Engineer. Google.Verified email at google.com
Mathieu BlondelGoogleVerified email at google.com
Eduardo PignatelliUniversity College LondonVerified email at ucl.ac.uk

Johan Ferret

Research Scientist, Google DeepMind

Verified email at google.com - Homepage

Reinforcement Learning Machine Learning Artificial Intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a Family of Highly Capable Multimodal Models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	442	2023
Acme: A Research Framework for Distributed Reinforcement Learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	232	2020
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ... arXiv preprint arXiv:2309.00267, 2023	151	2023
Adversarially Guided Actor-Critic Y Flet-Berliac, J Ferret, O Pietquin, P Preux, M Geist International Conference on Learning Representations (ICLR 2021), 2021	67	2021
Self-Attentional Credit Assignment for Transfer in Reinforcement Learning J Ferret, R Marinier, M Geist, O Pietquin International Joint Conference on Artificial Intelligence (IJCAI 2020), 2019	31	2019
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	28	2024
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback P Roit, J Ferret, L Shani*, R Aharoni, G Cideron, R Dadashi, M Geist, ... ACL, 2023	26	2023
Self-Imitation Advantage Learning J Ferret, O Pietquin, M Geist International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020	23	2020
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning N Grinsztajn, J Ferret, O Pietquin, P Preux, M Geist Advances in Neural Information Processing Systems (NeurIPS 2021), 2021	21	2021
Lazy-MDPs: Towards Interpretable Reinforcement Learning By Learning When To Act A Jacq, J Ferret, O Pietquin, M Geist International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2022	19*	2022
WARM: On the Benefits of Weight Averaged Reward Models A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ... arXiv preprint arXiv:2401.12187, 2024	8	2024
Credit assignment as a proxy for transfer in reinforcement learning J Ferret, R Marinier, M Geist, O Pietquin Learning Transferrable Skills Workshop, NeurIPS, 2019	6	2019
Direct Language Model Alignment from Online AI Feedback S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ... arXiv preprint arXiv:2402.04792, 2024	4	2024
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning E Pignatelli, J Ferret, M Geist, T Mesnard, H van Hasselt, L Toni arXiv preprint arXiv:2312.01072, 2023	2	2023
More efficient exploration with symbolic priors on action sequence equivalences T Johnstone, N Grinsztajn, J Ferret, P Preux Deep Reinforcement Learning Workshop, NeurIPS, 2022	2*	2022
On actions that matter: Credit assignment and interpretability in reinforcement learning J Ferret Université de Lille, 2022	2	2022
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models A Botev, S De, SL Smith, A Fernando, GC Muraru, R Haroun, L Berrada, ... arXiv preprint arXiv:2404.07839, 2024		2024
Offline Credit Assignment in Deep Reinforcement Learning with Hindsight Discriminator Networks J Ferret, O Pietquin, M Geist EWRL, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors