Understanding deep learning (still) requires rethinking generalization C Zhang, S Bengio, M Hardt, B Recht, O Vinyals Communications of the ACM 64 (3), 107-115, 2021 | 7807 | 2021 |
Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems T Chen, M Li, Y Li, M Lin, N Wang, M Wang, T Xiao, B Xu, C Zhang, ... arXiv preprint arXiv:1512.01274, 2015 | 2904 | 2015 |
Transfusion: Understanding transfer learning for medical imaging M Raghu, C Zhang, J Kleinberg, S Bengio Advances in Neural Information Processing Systems, 2019 | 1402 | 2019 |
Unsupervised feature selection for multi-cluster data D Cai, C Zhang, X He Proceedings of the 16th ACM SIGKDD international conference on Knowledge …, 2010 | 1375 | 2010 |
Do vision transformers see like convolutional neural networks? M Raghu, T Unterthiner, S Kornblith, C Zhang, A Dosovitskiy Advances in neural information processing systems 34, 12116-12128, 2021 | 1162 | 2021 |
Training deep nets with sublinear memory cost T Chen, B Xu, C Zhang, C Guestrin arXiv preprint arXiv:1604.06174, 2016 | 1095 | 2016 |
Learning with a Wasserstein loss C Frogner, C Zhang, H Mobahi, M Araya, TA Poggio Advances in neural information processing systems 28, 2015 | 728 | 2015 |
Quantifying memorization across neural language models N Carlini, D Ippolito, M Jagielski, K Lee, F Tramer, C Zhang arXiv preprint arXiv:2202.07646, 2022 | 679 | 2022 |
Machine theory of mind N Rabinowitz, F Perbet, F Song, C Zhang, SMA Eslami, M Botvinick International conference on machine learning, 4218-4227, 2018 | 671 | 2018 |
What is being transferred in transfer learning? B Neyshabur, H Sedghi, C Zhang Advances in Neural Information Processing Systems, 2020 | 578 | 2020 |
Deduplicating training data makes language models better K Lee, D Ippolito, A Nystrom, C Zhang, D Eck, C Callison-Burch, N Carlini arXiv preprint arXiv:2107.06499, 2021 | 571 | 2021 |
A study on overfitting in deep reinforcement learning C Zhang, O Vinyals, R Munos, S Bengio arXiv preprint arXiv:1804.06893, 2018 | 505 | 2018 |
What neural networks memorize and why: Discovering the long tail via influence estimation V Feldman, C Zhang Advances in Neural Information Processing Systems, Spotlight, 2020 | 470 | 2020 |
Automated fault detection without seismic processing M Araya-Polo, T Dahlke, C Frogner, C Zhang, T Poggio, D Hohl The Leading Edge 36 (3), 208-214, 2017 | 305 | 2017 |
Are all layers created equal? C Zhang, S Bengio, Y Singer Journal of Machine Learning Research 23 (67), 1-28, 2022 | 188 | 2022 |
Deep learning with label differential privacy B Ghazi, N Golowich, R Kumar, P Manurangsi, C Zhang Advances in neural information processing systems 34, 27131-27145, 2021 | 167 | 2021 |
Counterfactual memorization in neural language models C Zhang, D Ippolito, K Lee, M Jagielski, F Tramèr, N Carlini Advances in Neural Information Processing Systems 36, 39321-39362, 2023 | 143 | 2023 |
Preventing verbatim memorization in language models gives a false sense of privacy D Ippolito, F Tramèr, M Nasr, C Zhang, M Jagielski, K Lee, ... arXiv preprint arXiv:2210.17546, 2022 | 136 | 2022 |
Theory of deep learning IIb: Optimization properties of SGD C Zhang, Q Liao, A Rakhlin, B Miranda, N Golowich, T Poggio arXiv preprint arXiv:1801.02254, 2018 | 122* | 2018 |
Characterizing Structural Regularities of Labeled Data in Overparameterized Models Z Jiang, C Zhang, K Talwar, MC Mozer International Conference on Machine Learning, Long Presentation, 2021 | 118* | 2021 |