Follow
Yao Ma
Yao Ma
Amazon AI
Verified email at amazon.com
Title
Cited by
Cited by
Year
Theoretical comparisons of positive-unlabeled learning against positive-negative learning
G Niu, MC Du Plessis, T Sakai, Y Ma, M Sugiyama
Advances in neural information processing systems 29, 2016
1292016
A policy search method for temporal logic specified reinforcement learning tasks
X Li, Y Ma, C Belta
2018 Annual American Control Conference (ACC), 240-245, 2018
772018
Hybrid constraint SVR for facial age estimation
J Liu, Y Ma, L Duan, F Wang, Y Liu
Signal Processing 94, 576-582, 2014
452014
Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers
Y Ma, A Olshevsky, C Szepesvári, V Saligrama
Journal of Machine Learning Research 21 (133), 1-36, 2020
262020
Bandit-based task assignment for heterogeneous crowdsourcing
H Zhang, Y Ma, M Sugiyama
Neural computation 27 (11), 2447-2475, 2015
202015
Automata guided reinforcement learning with demonstrations
X Li, Y Ma, C Belta
arXiv preprint arXiv:1809.06305, 2018
152018
Double layer multiple task learning for age estimation with insufficient training samples
Y Ma, J Liu, X Yang, Y Liu, N Zheng
Neurocomputing 147, 380-386, 2015
122015
Facial age estimation from web photos using multiple-instance learning
X Yang, J Liu, Y Ma, J Xue
2014 IEEE international conference on multimedia and expo (ICME), 1-6, 2014
112014
Crowdsourcing with sparsely interacting workers
Y Ma, A Olshevsky, V Saligrama, C Szepesvari
arXiv preprint arXiv:1706.06660, 2017
62017
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
Y Ma, T Zhao, K Hatano, M Sugiyama
ECML PKDD 2014, 2014
62014
Automata guided hierarchical reinforcement learning for zero-shot skill composition
X Li, Y Ma, C Belta
52018
Automata-guided hierarchical reinforcement learning for skill composition
X Li, Y Ma, C Belta
arXiv preprint arXiv:1711.00129, 2017
42017
Online Markov decision processes with policy iteration
Y Ma, H Zhang, M Sugiyama
arXiv preprint arXiv:1510.04454, 2015
32015
Automata Guided Skill Composition
X Li, Y Ma, C Belta
2018
Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers
Y Ma, A Olshevsky, C Szepesvári, V Saligrama
ICML 2018, 2018
2018
AUTOMATA GUIDED HIERARCHICAL REINFORCE-MENT LEARNING FOR ZERO-SHOT SKILL COMPOSI
X Li, Y Ma, C Belta
arXiv preprint arXiv:1711.00129, 2017
2017
Online decision making in non-stationary Markovian environments
Y Ma
(No Title), 2015
2015
An Online Policy Gradient Algorithm for Continuous State and Action Markov Decision Processes with Bandit Feedback
Y Ma, M Sugiyama
電子情報通信学会技術研究報告 114 (306 (IBISML2014 35-84)), 141-148, 2014
2014
The system can't perform the operation now. Try again later.
Articles 1–18