Adithya M Devraj
Title
Cited by
Cited by
Year
Zap Q-learning
AM Devraj, S Meyn
Advances in Neural Information Processing Systems, 2235-2244, 2017
442017
Fastest convergence for Q-learning
AM Devraj, SP Meyn
arXiv preprint arXiv:1707.03770, 2017
282017
Learning techniques for feedback particle filter design
A Radhakrishnan, A Devraj, S Meyn
2016 IEEE 55th Conference on Decision and Control (CDC), 5453-5459, 2016
152016
Differential TD learning for value function approximation
AM Devraj, SP Meyn
Decision and Control (CDC), 2016 IEEE 55th Conference on, 6347-6354, 2016
112016
Power allocation in energy harvesting sensors with ARQ: A convex optimization approach
AM Devraj, MK Sharma, CR Murthy
2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP …, 2014
82014
Zap Q-Learning-A User's Guide
AM Devraj, A Bušić, S Meyn
2019 Fifth Indian Control Conference (ICC), 10-15, 2019
72019
Explicit Mean-Square Error Bounds for Monte-Carlo and Linear Stochastic Approximation
S Chen, AM Devraj, A Bušić, S Meyn
arXiv preprint arXiv:2002.02584, 2020
62020
Optimal matrix momentum stochastic approximation and applications to q-learning
AM Devraj, A Bušić, S Meyn
arXiv preprint arXiv:1809.06277, 2018
52018
Zap Q-Learning With Nonlinear Function Approximation
S Chen, AM Devraj, A Bušić, S Meyn
arXiv preprint arXiv:1910.05405, 2019
42019
Differential temporal difference learning
AM Devraj, I Kontoyiannis, SP Meyn
arXiv preprint arXiv:1812.11137, 2018
42018
Reinforcement learning for control of building HVAC systems
NS Raman, AM Devraj, P Barooah, SP Meyn
2020 American Control Conference (ACC), 2326-2332, 2020
32020
Q-learning with uniformly bounded variance: Large discounting is not a barrier to fast learning
AM Devraj, SP Meyn
arXiv preprint arXiv:2002.10301, 2020
32020
Model-Free Primal-Dual Methods for Network Optimization with Application to Real-Time Optimal Power Flow
Y Chen, A Bernstein, A Devraj, S Meyn
arXiv preprint arXiv:1909.13132, 2019
32019
On Matrix Momentum Stochastic Approximation and Applications to Q-learning
AM Devraj, A Bušic, S Meyn
57th Annual Allerton Conference on Communication, Control, and Computing …, 2019
32019
Stochastic variance reduced primal dual algorithms for empirical composition optimization
AM Devraj, J Chen
Advances in Neural Information Processing Systems, 9882-9892, 2019
22019
Geometric ergodicity in a weighted Sobolev space
A Devraj, I Kontoyiannis, S Meyn
The Annals of Probability 48 (1), 380-403, 2020
12020
Zap Q-Learning for Optimal Stopping Time Problems
S Chen, AM Devraj, A Bušić, SP Meyn
arXiv preprint arXiv:1904.11538, 2019
12019
Zap Stochastic Approximation and Reinforcement Learning
F Durand, AM Devraj, S Meyn
2020
Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation
S Chen, A Devraj, A Bernstein, S Meyn
arXiv preprint arXiv:2009.14431, 2020
2020
Zap Q-Learning for´ optimal stopping
S Chen, AM Devraj, A Bušić, S Meyn
2020 American Control Conference (ACC), 3920-3925, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–20