עקוב אחר
Hosein Hasanbeig
Hosein Hasanbeig
Microsoft Research
כתובת אימייל מאומתת בדומיין microsoft.com - דף הבית
כותרת
צוטט על ידי
צוטט על ידי
שנה
Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees
M Hasanbeig, Y Kantaros, A Abate, D Kroening, GJ Pappas, I Lee
IEEE Conference on Decision and Control (CDC), 2019
1442019
Logically-Constrained Reinforcement Learning
M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1801.08099, 2018
1222018
Cautious Reinforcement Learning with Logical Constraints
M Hasanbeig, A Abate, D Kroening
AAMAS, 483-491, 2020
1032020
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic
M Cai, M Hasanbeig, S Xiao, A Abate, Z Kan
IEEE Robotics and Automation and IROS, 2021
892021
Deep Reinforcement Learning with Temporal Logics
M Hasanbeig, D Kroening, A Abate
International Conference on Formal Modeling and Analysis of Timed Systems, 1-22, 2020
662020
Certified reinforcement learning with logic guidance
H Hasanbeig, D Kroening, A Abate
Artificial Intelligence 322, 103949, 2023
652023
Deepsynth: Program Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
AAAI Conference on Artificial Intelligence (AAAI-21), 2021
58*2021
Logically-Constrained Neural Fitted Q-iteration
M Hasanbeig, A Abate, D Kroening
AAMAS, 2012-2014, 2019
522019
Modular Deep Reinforcement Learning with Temporal Logic Specifications
LZ Yuan, M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1909.11591, 2019
492019
Evaluating cognitive maps in large language models with cogeval: No emergent planning
I Momennejad, H Hasanbeig, FV Frujeri, H Sharma, RO Ness, N Jojic, ...
Advances in neural information processing systems 37, 2023
45*2023
Towards Verifiable and Safe Model-free Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
Workshop on Artificial Intelligence and Formal Verification, Logics …, 2020
29*2020
Shielding Atari Games with Bounded Prescience
M Giacobbe, M Hasanbeig, D Kroening, H Wijk
International Conference on Autonomous Agents and Multiagent Systems, 2021
262021
Deepsynth: Program synthesis for automatic task segmentation in deep reinforcement learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
arXiv preprint arXiv:1911.10244, 2019
192019
LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
International Conference on Quantitative Evaluation of Systems, 217-231, 2022
152022
On Synchronous Binary Log-Linear Learning and Second Order Q-learning
M Hasanbeig, L Pavel
IFAC World Congress 50 (1), 8987-8992, 2017
122017
Allure: A systematic protocol for auditing and improving llm-based evaluation of text using iterative in-context-learning
H Hasanbeig, H Sharma, L Betthauser, FV Frujeri, I Momennejad
arXiv preprint arXiv:2309.13701, 2023
82023
Distributed Coverage Control by Robot Networks in Unknown Environments using a Modified EM Algorithm
M Hasanbeig, L Pavel
International Journal of Computer and Information Engineering 11 (7), 815-823, 2017
82017
From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning
M Hasanbeig, L Pavel
arXiv preprint arXiv:1802.02277, 2018
72018
ALLURE: auditing and improving llm-based evaluation of text using iterative in-context-learning
H Hasanbeig, H Sharma, L Betthauser, F Vieira Frujeri, I Momennejad
arXiv e-prints, arXiv: 2309.13701, 2023
52023
Jump operator planning: Goal-conditioned policy ensembles and zero-shot transfer
TJ Ringstrom, M Hasanbeig, A Abate
arXiv preprint arXiv:2007.02527, 2020
52020
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–20