Follow
Anna Harutyunyan
Anna Harutyunyan
DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Safe and efficient off-policy reinforcement learning
R Munos, T Stepleton, A Harutyunyan, M Bellemare
Advances in neural information processing systems 29, 2016
6782016
Reinforcement learning from demonstration through shaping
T Brys, A Harutyunyan, HB Suay, S Chernova, ME Taylor, A Nowé
Twenty-fourth international joint conference on artificial intelligence, 2015
2712015
Expressing Arbitrary Reward Functions as Potential-Based Advice
A Harutyunyan, S Devlin, P Vrancx, A Nowé
Twenty-Ninth Conference on Artificial Intelligence (AAAI), 2015
1132015
Policy Transfer using Reward Shaping
T Brys, A Harutyunyan, ME Taylor, A Nowé
Fourteenth International Conference on Autonomous Agents and Multi-Agent …, 2015
1022015
Q () with off-policy corrections
A Harutyunyan, MG Bellemare, T Stepleton, R Munos
International Conference on Algorithmic Learning Theory, 305-320, 2016
932016
Multi-objectivization of reinforcement learning problems by reward shaping
T Brys, A Harutyunyan, P Vrancx, ME Taylor, D Kudenko, A Nowé
2014 international joint conference on neural networks (IJCNN), 2315-2322, 2014
882014
Hindsight credit assignment
A Harutyunyan, W Dabney, T Mesnard, M Gheshlaghi Azar, B Piot, ...
Advances in neural information processing systems 32, 2019
852019
On the expressivity of markov reward
D Abel, W Dabney, A Harutyunyan, MK Ho, M Littman, D Precup, S Singh
Advances in Neural Information Processing Systems 34, 7799-7812, 2021
782021
Counterfactual credit assignment in model-free reinforcement learning
T Mesnard, T Weber, F Viola, S Thakoor, A Saade, A Harutyunyan, ...
arXiv preprint arXiv:2011.09464, 2020
612020
The termination critic
A Harutyunyan, W Dabney, D Borsa, N Heess, R Munos, D Precup
arXiv preprint arXiv:1902.09996, 2019
532019
Multi-objectivization and ensembles of shapings in reinforcement learning
T Brys, A Harutyunyan, P Vrancx, A Nowé, ME Taylor
Neurocomputing 263, 48-59, 2017
462017
Real-time gait event detection based on kinematic data coupled to a biomechanical model
S Lambrecht, A Harutyunyan, K Tanghe, M Afschrift, J De Schutter, ...
Sensors 17 (4), 671, 2017
292017
Predicting seat-off and detecting start-of-assistance events for assisting sit-to-stand with an exoskeleton
K Tanghe, A Harutyunyan, E Aertbeliën, F De Groote, J De Schutter, ...
IEEE Robotics and Automation Letters 1 (2), 792-799, 2016
272016
Shaping Mario with Human Advice
A Harutyunyan, T Brys, P Vrancx, A Nowé
Fourteenth International Conference on Autonomous Agents and Multi-Agent …, 2015
242015
Learning with options that terminate off-policy
A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
232018
Reinforcement learning in POMDPs with memoryless options and option-observation initiation sets
D Steckelmacher, D Roijers, A Harutyunyan, P Vrancx, H Plisnier, A Nowé
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
232018
An analysis of quantile temporal-difference learning
M Rowland, R Munos, MG Azar, Y Tang, G Ostrovski, A Harutyunyan, ...
arXiv preprint arXiv:2301.04462, 2023
172023
Planted-model evaluation of algorithms for identifying differences between spreadsheets
A Harutyunyan, G Borradaile, C Chambers, C Scaffidi
2012 IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC …, 2012
172012
Off-Policy Shaping Ensembles in Reinforcement Learning
A Harutyunyan, T Brys, P Vrancx, A Nowe
Frontiers in Artificial Intelligence and Applications 263 (ECAI 2014), 1021 …, 2014
142014
Conditional importance sampling for off-policy learning
M Rowland, A Harutyunyan, H Hasselt, D Borsa, T Schaul, R Munos, ...
International Conference on Artificial Intelligence and Statistics, 45-55, 2020
122020
The system can't perform the operation now. Try again later.
Articles 1–20