Tetsuro Morimura
Tetsuro Morimura
CyberAgent, Inc.
Verified email at cyberagent.co.jp
Title
Cited by
Cited by
Year
Nonparametric return distribution approximation for reinforcement learning
T Morimura, M Sugiyama, H Kashima, H Hachiya, T Tanaka
ICML, 2010
792010
Parametric return density estimation for reinforcement learning
T Morimura, M Sugiyama, H Kashima, H Hachiya, T Tanaka
arXiv preprint arXiv:1203.3497, 2012
722012
Map matching with hidden Markov model on sampled road network
R Raymond, T Morimura, T Osogami, N Hirosue
Proceedings of the 21st International Conference on Pattern Recognition …, 2012
642012
Statistical reinforcement learning: modern machine learning approaches
M Sugiyama
CRC Press, 2015
492015
Ibm mega traffic simulator
T Osogami, T Imamichi, H Mizuta, T Morimura, R Raymond, T Suzumura, ...
IBM Res., Tokyo, Japan, IBM Res. Rep. RT0896, 2012
392012
Utilizing the natural gradient in temporal difference reinforcement learning with eligibility traces
T Morimura, E Uchibe, K Doya
International Symposium on Information Geometry and Its Applications, 256-263, 2005
342005
Methods and apparatus for pooling and depooling the transmission of stream data
S Raman
US Patent 7,096,272, 2006
272006
Recharge delay for an implantable medical device
R Leinders, N Torgerson, M Stein, T Goblish, T Heathershaw, J Rodriguez
US Patent App. 10/133,703, 2003
252003
これからの強化学習
牧野貴樹, 澁谷長史, 白川真一, 浅田稔, 麻生英樹, 荒井幸代, 飯間等, ...
森北出版, 2016
242016
Solving inverse problem of Markov chain with partial observations.
T Morimura, T Osogami, T Idé
NIPS, 1655-1663, 2013
232013
City-wide traffic flow estimation from a limited number of low-quality cameras
T Idé, T Katsuki, T Morimura, R Morris
IEEE Transactions on Intelligent Transportation Systems 18 (4), 950-959, 2016
202016
Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning
T Morimura, E Uchibe, J Yoshimoto, J Peters, K Doya
Neural computation 22 (2), 342-376, 2010
162010
A generalized natural actor-critic algorithm
T Morimura, E Uchibe, J Yoshimoto, K Doya
Advances in neural information processing systems 22, 1312-1320, 2009
142009
Statistical origin-destination generation with multiple sources
T Morimura, S Kato
Proceedings of the 21st International Conference on Pattern Recognition …, 2012
132012
Identification of antibiotic clarithromycin binding peptide displayed by T7 phage particles
T Morimura, N Noda, Y Kato, T Watanabe, T Saitoh, T Yamazaki, ...
The Journal of antibiotics 59 (10), 625-632, 2006
112006
A consistent method for graph based anomaly localization
S Hara, T Morimura, T Takahashi, H Yanagisawa, T Suzuki
Artificial intelligence and statistics, 333-341, 2015
102015
Adaptive step-size policy gradients with average reward metric
T Matsubara, T Morimura, J Morimoto
Proceedings of 2nd Asian Conference on Machine Learning, 285-298, 2010
102010
Bayesian unsupervised vehicle counting
T Katasuki, T Morimura, T Ide
Technical Report, IBM Research RT0951, 2013
92013
A new natural policy gradient by stationary distribution metric
T Morimura, E Uchibe, J Yoshimoto, K Doya
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2008
92008
Updating policy parameters under Markov decision process system environment
T Morimura, T Osogami, T Shirai
US Patent 8,818,925, 2014
82014
The system can't perform the operation now. Try again later.
Articles 1–20