Follow
Chengzhuo Ni
Title
Cited by
Cited by
Year
On the convergence and sample efficiency of variance-reduced policy gradient method
J Zhang, C Ni, C Szepesvari, M Wang
Advances in Neural Information Processing Systems 34, 2228-2240, 2021
532021
Learning to control in metric space with optimal regret
C Ni, LF Yang, M Wang
2019 57th Annual Allerton Conference on Communication, Control, and …, 2019
282019
Representation learning for low-rank general-sum markov games
C Ni, Y Song, X Zhang, Z Ding, C Jin, M Wang
The Eleventh International Conference on Learning Representations, 2022
18*2022
Off-policy fitted q-evaluation with differentiable function approximators: Z-estimation and inference theory
R Zhang, X Zhang, C Ni, M Wang
International Conference on Machine Learning, 26713-26749, 2022
182022
Reward-directed conditional diffusion: Provable distribution estimation and reward improvement
H Yuan, K Huang, C Ni, M Chen, M Wang
Advances in Neural Information Processing Systems 36, 2024
132024
Learning Good State and Action Representations for Markov Decision Process via Tensor Decomposition
C Ni, Y Duan, M Dahleh, M Wang, AR Zhang
Journal of Machine Learning Research 24 (115), 1-53, 2023
10*2023
Optimal estimation of policy gradient via double fitted iteration
C Ni, R Zhang, X Ji, X Zhang, M Wang
International Conference on Machine Learning, 16724-16783, 2022
4*2022
Maximum likelihood tensor decomposition of Markov decision process
C Ni, M Wang
2019 IEEE International Symposium on Information Theory (ISIT), 3062-3066, 2019
32019
Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization
H Yuan, C Ni, H Wang, X Zhang, L Cong, C Szepesvári, M Wang
Advances in Neural Information Processing Systems, 2022
22022
Diffusion Model for Data-Driven Black-Box Optimization
Z Li, H Yuan, K Huang, C Ni, Y Ye, M Chen, M Wang
arXiv preprint arXiv:2403.13219, 2024
12024
Cell2State: Learning Cell State Representations From Barcoded Single-Cell Gene-Expression Transitions
Y Wu, JC Kim, C Ni, L Cong, M Wang
2021
The system can't perform the operation now. Try again later.
Articles 1–11