עקוב אחר
Qinqing Zheng
Qinqing Zheng
Meta
כתובת אימייל מאומתת בדומיין meta.com - דף הבית
כותרת
צוטט על ידי
צוטט על ידי
שנה
Online decision transformer
Q Zheng, A Zhang, A Grover
International Conference on Machine Learning 162, 27042--27059, 2022
2422022
A convergent gradient descent algorithm for rank minimization and semidefinite programming from random linear measurements
Q Zheng, J Lafferty
Advances in Neural Information Processing Systems, 109--117, 2015
2212015
Convergence analysis for rectangular matrix completion using Burer-Monteiro factorization and gradient descent
Q Zheng, J Lafferty
arXiv preprint arXiv:1605.07051, 2016
1832016
Federated f-differential privacy
Q Zheng, S Chen, Q Long, W Su
(AISTATS 2021) International conference on artificial intelligence and …, 2021
662021
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
H Sikchi, Q Zheng, A Zhang, S Niekum
ICLR 2024, 2023
30*2023
Minimax Estimation for Personalized Federated Learning: An Alternative between FedAvg and Local Training?
S Chen, Q Zheng, Q Long, WJ Su
Journal of Machine Learning Research 24 (262), 1-59, 2023
23*2023
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
L Lehnert, S Sukhbaatar, DJ Su, Q Zheng, P Mcvay, M Rabbat, Y Tian
COLM 2024, 2024
222024
Semi-supervised offline reinforcement learning with action-free trajectories
Q Zheng, M Henaff, B Amos, A Grover
(ICML 2023) International Conference on Machine Learning, 42339-42362, 2023
202023
Sharp Composition Bounds for Gaussian Differential Privacy via Edgeworth Expansion
Q Zheng, J Dong, Q Long, WJ Su
(ICML 2020) International Conference on Machine Learning, 11420-11435, 2020
202020
Guided flows for generative modeling and decision making
Q Zheng, M Le, N Shaul, Y Lipman, A Grover, RTQ Chen
arXiv preprint arXiv:2311.13443, 2023
182023
Interpolating convex and non-convex tensor decompositions via the subspace norm
Q Zheng, R Tomioka
Advances in Neural Information Processing Systems, 3106-3113, 2015
162015
Diffusion world model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
Z Ding, A Zhang, Y Tian, Q Zheng
arXiv preprint arXiv:2402.03570, 2024
142024
Latent state marginalization as a low-cost approach for improving exploration
D Zhang, A Courville, Y Bengio, Q Zheng, A Zhang, RTQ Chen
ICLR 2023, 2022
122022
Near-Optimal Confidence Sequences for Bounded Random Variables
AK Kuchibhotla, Q Zheng
ICML 2021, 2021
102021
Reliable conditioning of behavioral cloning for offline reinforcement learning
T Nguyen, Q Zheng, A Grover
arXiv preprint arXiv:2210.05158, 2022
9*2022
Shadowsync: Performing synchronization in the background for highly scalable distributed training
Q Zheng, BY Su, J Yang, A Azzolini, Q Wu, O Jin, S Karandikar, ...
arXiv preprint arXiv:2003.03477, 2020
72020
Performing Synchronization in the Background for Highly Scalable Distributed Training
Q Zheng, SU Bor-Yiing, J Yang, AG Azzolini, Q Wu, O Jin
US Patent App. 16/989,131, 2022
22022
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
Q Zheng, M Henaff, A Zhang, A Grover, B Amos
arXiv preprint arXiv:2410.23022, 2024
2024
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces
DJ Su, S Sukhbaatar, M Rabbat, Y Tian, Q Zheng
arXiv preprint arXiv:2410.09918, 2024
2024
Symmetric Factorization for Nonconvex Optimization
Q Zheng
2017
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–20