Follow
Pedro A. Ortega
Title
Cited by
Cited by
Year
AI safety gridworlds
J Leike, M Martic, V Krakovna, PA Ortega, T Everitt, A Lefrancq, L Orseau, ...
arXiv preprint arXiv:1711.09883, 2017
2312017
Social influence as intrinsic motivation for multi-agent deep reinforcement learning
N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ...
International conference on machine learning, 3040-3049, 2019
2162019
Thermodynamics as a theory of decision-making with information-processing costs
PA Ortega, DA Braun
Proceedings of the Royal Society A: Mathematical, Physical and Engineering …, 2013
2162013
A Medical Claim Fraud/Abuse Detection System based on Data Mining: A Case Study in Chile.
PA Ortega, CJ Figueroa, GA Ruz
DMIN 6, 26-29, 2006
1432006
Nash equilibria in multi-agent motor interactions
DA Braun, PA Ortega, DM Wolpert
PLoS computational biology 5 (8), e1000468, 2009
1142009
Meta reinforcement learning as task inference
J Humplik, A Galashov, L Hasenclever, PA Ortega, YW Teh, N Heess
arXiv preprint arXiv:1905.06424, 2019
742019
A minimum relative entropy principle for learning and acting
PA Ortega, DA Braun
Journal of Artificial Intelligence Research 38, 475-511, 2010
692010
Information, utility and bounded rationality
DA Ortega, PA Braun
International Conference on Artificial General Intelligence, 269-274, 2011
662011
Causal reasoning from meta-reinforcement learning
I Dasgupta, J Wang, S Chiappa, J Mitrovic, P Ortega, D Raposo, ...
arXiv preprint arXiv:1901.08162, 2019
652019
Path integral control and bounded rationality
DA Braun, PA Ortega, E Theodorou, S Schaal
2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011
572011
Intrinsic social motivation via causal influence in multi-agent RL
N Jaques, A Lazaridou, E Hughes, C Gulcehre, PA Ortega, DJ Strouse, ...
472018
Meta-learning of sequential strategies
PA Ortega, JX Wang, M Rowland, T Genewein, Z Kurth-Nelson, ...
arXiv preprint arXiv:1905.03030, 2019
452019
Generalized Thompson sampling for sequential decision-making and causal inference
PA Ortega, DA Braun
Complex Adaptive Systems Modeling 2 (2), 2014
412014
Laser processing of Al2O3/a‐SiCx:H stacks: a feasible solution for the rear surface of high‐efficiency p‐type c‐Si solar cells
I Martín, P Ortega, M Colina, A Orpella, G López, R Alcubilla
Progress in Photovoltaics: Research and Applications 21 (5), 1171-1175, 2013
402013
Action and perception as divergence minimization
D Hafner, PA Ortega, J Ba, T Parr, K Friston, N Heess
arXiv preprint arXiv:2009.01791, 2020
332020
From Poincaré recurrence to convergence in imperfect information games: Finding equilibrium via regularization
J Perolat, R Munos, JB Lespiau, S Omidshafiei, M Rowland, P Ortega, ...
International Conference on Machine Learning, 8525-8535, 2021
312021
Human decision-making under limited time
PA Ortega, AA Stocker
Advances in Neural Information Processing Systems 29, 2016
312016
Information-Theoretic Bounded Rationality
PA Ortega, DA Braun, JS Dyer, KE Kim, N Tishby
arXiv preprint arXiv:1512.06789, 2015
312015
Motor coordination: when two have to act as one
DA Braun, PA Ortega, DM Wolpert
Experimental brain research 211 (3), 631-641, 2011
302011
-type emitter surface passivation in solar cells by means of antireflective amorphous silicon carbide layers
R Ferre, I Martín, P Ortega, M Vetter, I Torres, R Alcubilla
Journal of applied physics 100 (7), 073703, 2006
252006
The system can't perform the operation now. Try again later.
Articles 1–20