Action recognition using context and appearance distribution features X Wu, D Xu, L Duan, J Luo CVPR 2011, 489-496, 2011 | 285 | 2011 |
Discriminative human action recognition in the learned hierarchical manifold space L Han, X Wu, W Liang, G Hou, Y Jia Image and Vision Computing 28 (5), 836-849, 2010 | 131 | 2010 |
Joint syntax representation learning and visual cue translation for video captioning J Hou, X Wu, W Zhao, J Luo, Y Jia Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 106 | 2019 |
Learning normal patterns via adversarial attention-based autoencoder for abnormal event detection in videos H Song, C Sun, X Wu, M Chen, Y Jia IEEE Transactions on Multimedia 22 (8), 2138-2148, 2019 | 95 | 2019 |
Memcap: Memorizing style knowledge for image captioning W Zhao, X Wu, X Zhang Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 12984 …, 2020 | 89 | 2020 |
View-invariant action recognition using latent kernelized structural SVM X Wu, Y Jia Computer Vision–ECCV 2012: 12th European Conference on Computer Vision …, 2012 | 81 | 2012 |
Joint commonsense and relation reasoning for image and video captioning J Hou, X Wu, X Zhang, Y Qi, Y Jia, J Luo Proceedings of the AAAI conference on artificial intelligence 34 (07), 10973 …, 2020 | 73* | 2020 |
Action recognition using multilevel features and latent structural SVM X Wu, D Xu, L Duan, J Luo, Y Jia IEEE transactions on Circuits and Systems for Video Technology 23 (8), 1422-1431, 2013 | 67 | 2013 |
Cross-view action recognition over heterogeneous feature spaces X Wu, H Wang, C Liu, Y Jia Proceedings of the IEEE International Conference on Computer Vision, 609-616, 2013 | 67 | 2013 |
Content-attention representation by factorized action-scene network for action recognition J Hou, X Wu, Y Sun, Y Jia IEEE Transactions on Multimedia 20 (6), 1537-1547, 2017 | 57 | 2017 |
Cross-domain image captioning via cross-modal retrieval and model adaptation W Zhao, X Wu, J Luo IEEE Transactions on Image Processing 30, 1180-1192, 2020 | 55 | 2020 |
Boosting entity-aware image captioning with multi-modal knowledge graph W Zhao, X Wu IEEE Transactions on Multimedia, 2023 | 52 | 2023 |
Domain adversarial reinforcement learning for partial domain adaptation J Chen, X Wu, L Duan, S Gao IEEE Transactions on Neural Networks and Learning Systems 33 (2), 539-553, 2020 | 51 | 2020 |
Meta-causal learning for single domain generalization J Chen, Z Gao, X Wu, J Luo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 41 | 2023 |
Incremental discriminative-analysis of canonical correlations for action recognition X Wu, W Liang, Y Jia 2009 IEEE 12th international conference on computer vision, 2035-2041, 2009 | 38* | 2009 |
Spatial–temporal relation reasoning for action prediction in videos X Wu, R Wang, J Hou, H Lin, J Luo International Journal of Computer Vision 129 (5), 1484-1505, 2021 | 36 | 2021 |
Exploiting images for video recognition: Heterogeneous feature augmentation via symmetric adversarial learning F Yu, X Wu, J Chen, L Duan IEEE Transactions on Image Processing 28 (11), 5308-5321, 2019 | 35 | 2019 |
Multi-modal dependency tree for video captioning W Zhao, X Wu, J Luo Advances in Neural Information Processing Systems 34, 6634-6645, 2021 | 32 | 2021 |
Temporal action localization in untrimmed videos using action pattern trees H Song, X Wu, B Zhu, Y Wu, M Chen, Y Jia IEEE transactions on multimedia 21 (3), 717-730, 2018 | 31 | 2018 |
Exploiting informative video segments for temporal action localization C Sun, H Song, X Wu, Y Jia, J Luo IEEE Transactions on Multimedia 24, 274-287, 2021 | 28 | 2021 |