עקוב אחר
Pieter Abbeel
Pieter Abbeel
UC Berkeley | Covariant
כתובת אימייל מאומתת בדומיין cs.berkeley.edu - דף הבית
כותרת
צוטט על ידי
צוטט על ידי
שנה
Model-agnostic meta-learning for fast adaptation of deep networks
C Finn, P Abbeel, S Levine
International conference on machine learning, 1126-1135, 2017
123232017
Denoising diffusion probabilistic models
J Ho, A Jain, P Abbeel
Advances in neural information processing systems 33, 6840-6851, 2020
90432020
Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor
T Haarnoja, A Zhou, P Abbeel, S Levine
International conference on machine learning, 1861-1870, 2018
80192018
Trust region policy optimization
J Schulman, S Levine, P Abbeel, M Jordan, P Moritz
International conference on machine learning, 1889-1897, 2015
80102015
Infogan: Interpretable representation learning by information maximizing generative adversarial nets
X Chen, Y Duan, R Houthooft, J Schulman, I Sutskever, P Abbeel
Advances in neural information processing systems 29, 2016
52422016
Multi-agent actor-critic for mixed cooperative-competitive environments
R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch
Advances in neural information processing systems 30, 2017
47002017
Apprenticeship learning via inverse reinforcement learning
P Abbeel, AY Ng
Proceedings of the twenty-first international conference on Machine learning, 1, 2004
41472004
End-to-end training of deep visuomotor policies
S Levine, C Finn, T Darrell, P Abbeel
Journal of Machine Learning Research 17 (39), 1-40, 2016
38692016
High-dimensional continuous control using generalized advantage estimation
J Schulman, P Moritz, S Levine, M Jordan, P Abbeel
arXiv preprint arXiv:1506.02438, 2015
34692015
Domain randomization for transferring deep neural networks from simulation to the real world
J Tobin, R Fong, A Ray, J Schneider, W Zaremba, P Abbeel
2017 IEEE/RSJ international conference on intelligent robots and systems …, 2017
31202017
Hindsight experience replay
M Andrychowicz, F Wolski, A Ray, J Schneider, R Fong, P Welinder, ...
Advances in neural information processing systems 30, 2017
26962017
Soft actor-critic algorithms and applications
T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ...
arXiv preprint arXiv:1812.05905, 2018
24522018
Benchmarking deep reinforcement learning for continuous control
Y Duan, X Chen, R Houthooft, J Schulman, P Abbeel
International conference on machine learning, 1329-1338, 2016
19942016
A simple neural attentive meta-learner
N Mishra, M Rohaninejad, X Chen, P Abbeel
arXiv preprint arXiv:1707.03141, 2017
14822017
Sim-to-real transfer of robotic control with dynamics randomization
XB Peng, M Andrychowicz, W Zaremba, P Abbeel
2018 IEEE international conference on robotics and automation (ICRA), 3803-3810, 2018
14382018
Reinforcement learning with deep energy-based policies
T Haarnoja, H Tang, P Abbeel, S Levine
International conference on machine learning, 1352-1361, 2017
13762017
Constrained policy optimization
J Achiam, D Held, A Tamar, P Abbeel
International conference on machine learning, 22-31, 2017
13602017
Decision transformer: Reinforcement learning via sequence modeling
L Chen, K Lu, A Rajeswaran, K Lee, A Grover, M Laskin, P Abbeel, ...
Advances in neural information processing systems 34, 15084-15097, 2021
11952021
Guided cost learning: Deep inverse optimal control via policy optimization
C Finn, S Levine, P Abbeel
International conference on machine learning, 49-58, 2016
10902016
RL: Fast Reinforcement Learning via Slow Reinforcement Learning
Y Duan, J Schulman, X Chen, PL Bartlett, I Sutskever, P Abbeel
arXiv preprint arXiv:1611.02779, 2016
10802016
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–20