Follow
Sherry Yang
Sherry Yang
Google DeepMind, UC Berkeley
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Graphit: A high-performance graph dsl
Y Zhang, M Yang, R Baghdadi, S Kamil, J Shun, S Amarasinghe
Proceedings of the ACM on Programming Languages 2 (OOPSLA), 1-30, 2018
220*2018
Multi-game decision transformers
KH Lee, O Nachum, MS Yang, L Lee, D Freeman, S Guadarrama, ...
Advances in Neural Information Processing Systems 35, 27921-27936, 2022
1622022
Benchmarking attribution methods with relative feature importance
M Yang, B Kim
Workshop on Human Centered AI at Advances in Neural Information Processing …, 2019
148*2019
Representation Matters: Offline Pretraining for Sequential Decision Making
M Yang, O Nachum
International Conference on Machine Learning 139, 11784-11794, 2021
1212021
Off-policy evaluation via the regularized lagrangian
M Yang, O Nachum, B Dai, L Li, D Schuurmans
Advances in Neural Information Processing Systems 33, 2020
1022020
Foundation models for decision making: Problems, methods, and opportunities
M Yang, O Nachum, Y Du, J Wei, P Abbeel, D Schuurmans
arXiv preprint arXiv:2303.04129, 2023
802023
Learning Universal Policies via Text-Guided Video Generation
Y Du*, M Yang*, B Dai, H Dai, O Nachum, J Tenenbaum, D Schuurmans, ...
Advances in Neural Information Processing Systems 36, 2023
792023
Benchmarks for Deep Off-Policy Evaluation
J Fu, M Norouzi, O Nachum, G Tucker, Z Wang, A Novikov, M Yang, ...
International Conference on Learning Representations, 2021
732021
Offline RL for Natural Language Generation with Implicit Language Q Learning
C Snell, I Kostrikov, Y Su, M Yang, S Levine
International Conference on Learning Representations 2023, 2022
632022
Combiner: Full Attention Transformer with Sparse Computation Cost
H Ren, H Dai, Z Dai, M Yang, J Leskovec, D Schuurmans, B Dai
Advances in Neural Information Processing Systems 33, 2021
622021
Trail: Near-optimal imitation learning with suboptimal data
M Yang, S Levine, O Nachum
International Conference on Learning Representations, 2021
432021
Offline policy selection under uncertainty
M Yang, B Dai, O Nachum, G Tucker, D Schuurmans
International Conference on Artificial Intelligence and Statistics, 4376-4396, 2022
372022
Provable Representation Learning for Imitation with Contrastive Fourier Features
O Nachum, M Yang
Advances in Neural Information Processing Systems 34, 2021
372021
Dichotomy of control: Separating what you can control from what you cannot
M Yang, D Schuurmans, P Abbeel, O Nachum
International Conference on Learning Representations 2023, 2022
332022
Making linear mdps practical via contrastive representation learning
T Zhang, T Ren, M Yang, J Gonzalez, D Schuurmans, B Dai
International Conference on Machine Learning, 26447-26466, 2022
332022
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning
S Verma, J Fu, M Yang, S Levine
Findings of the Association for Computational Linguistics: NAACL 2022, 2022
332022
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
302024
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
C Snell, M Yang, J Fu, Y Su, S Levine
Findings of the Association for Computational Linguistics: NAACL 2022, 2022
192022
Towards Automatic Evaluation of Dialog Systems: A Model-Free Off-Policy Evaluation Approach
H Jiang, B Dai, M Yang, T Zhao, W Wei
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
182021
Chain of thought imitation with procedure cloning
M Yang, D Schuurmans, P Abbeel, O Nachum
Advances in Neural Information Processing Systems 35, 36366-36381, 2022
172022
The system can't perform the operation now. Try again later.
Articles 1–20