Paul Christiano
Paul Christiano
Alignment Research Center
Verified email at alignmentresearchcenter.org - Homepage
Title
Cited by
Cited by
Year
Concrete problems in AI safety
D Amodei, C Olah, J Steinhardt, P Christiano, J Schulman, D Mané
arXiv preprint arXiv:1606.06565, 2016
13102016
Theano: A Python framework for fast computation of mathematical expressions
R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, N Ballas, ...
arXiv e-prints, arXiv: 1605.02688, 2016
851*2016
Deep reinforcement learning from human preferences
P Christiano, J Leike, TB Brown, M Martic, S Legg, D Amodei
arXiv preprint arXiv:1706.03741, 2017
5062017
Electrical flows, laplacian systems, and faster approximation of maximum flow in undirected graphs
P Christiano, JA Kelner, A Madry, DA Spielman, SH Teng
Proceedings of the forty-third annual ACM symposium on Theory of computing …, 2011
3282011
A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models
C Finn, P Christiano, P Abbeel, S Levine
arXiv preprint arXiv:1611.03852, 2016
2482016
Transfer from simulation to real world through learning deep inverse dynamics model
P Christiano, Z Shah, I Mordatch, J Schneider, T Blackwell, J Tobin, ...
arXiv preprint arXiv:1610.03518, 2016
1552016
Quantum money from hidden subspaces
S Aaronson, P Christiano
Proceedings of the forty-fourth annual ACM symposium on Theory of computing …, 2012
1212012
Fine-tuning language models from human preferences
DM Ziegler, N Stiennon, J Wu, TB Brown, A Radford, D Amodei, ...
arXiv preprint arXiv:1909.08593, 2019
842019
A cryptographic test of quantumness and certifiable randomness from a single quantum device
Z Brakerski, P Christiano, U Mahadev, U Vazirani, T Vidick
2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS …, 2018
652018
Unrestricted adversarial examples
TB Brown, N Carlini, C Zhang, C Olsson, P Christiano, I Goodfellow
arXiv preprint arXiv:1809.08352, 2018
602018
Learning to summarize from human feedback
N Stiennon, L Ouyang, J Wu, DM Ziegler, R Lowe, C Voss, A Radford, ...
arXiv preprint arXiv:2009.01325, 2020
472020
AI safety via debate
G Irving, P Christiano, D Amodei
arXiv preprint arXiv:1805.00899, 2018
422018
Concrete problems in AI safety. arXiv
D Amodei, C Olah, J Steinhardt, P Christiano, J Schulman, D Mané
arXiv preprint arXiv:1606.06565, 2016
382016
Robust Cooperation in the Prisoner's Dilemma: Program Equilibrium via Provability Logic
M Barasz, P Christiano, B Fallenstein, M Herreshoff, P LaVictoire, ...
arXiv preprint arXiv:1401.5577, 2014
32*2014
Supervising strong learners by amplifying weak experts
P Christiano, B Shlegeris, D Amodei
arXiv preprint arXiv:1810.08575, 2018
272018
Online local learning via semidefinite programming
P Christiano
Proceedings of the forty-sixth annual ACM symposium on Theory of computing …, 2014
152014
Non-omniscience, probabilistic inference, and metamathematics
P Christiano
Machine Intelligence Research Institute, Berkeley, CA, June, 2014
14*2014
Reflective oracles: A foundation for game theory in artificial intelligence
B Fallenstein, J Taylor, PF Christiano
International Workshop on Logic, Rationality and Interaction, 411-415, 2015
13*2015
Lossless fault-tolerant data structures with additive overhead
P Christiano, ED Demaine, S Kishore
Workshop on Algorithms and Data Structures, 243-254, 2011
92011
Recursively summarizing books with human feedback
J Wu, L Ouyang, DM Ziegler, N Stiennon, R Lowe, J Leike, P Christiano
arXiv preprint arXiv:2109.10862, 2021
42021
The system can't perform the operation now. Try again later.
Articles 1–20