Sequential prediction of social media popularity with deep temporal context networks B Wu, WH Cheng, Y Zhang, Q Huang, J Li, T Mei arXiv preprint arXiv:1712.04443, 2017 | 136 | 2017 |
Audio captioning transformer X Mei, X Liu, Q Huang, MD Plumbley, W Wang arXiv preprint arXiv:2107.09817, 2021 | 80 | 2021 |
Conditional sound generation using neural discrete time-frequency representation learning X Liu, T Iqbal, J Zhao, Q Huang, MD Plumbley, W Wang 2021 IEEE 31st International Workshop on Machine Learning for Signal …, 2021 | 53 | 2021 |
An encoder-decoder based audio captioning system with transfer and reinforcement learning X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... arXiv preprint arXiv:2108.02752, 2021 | 52 | 2021 |
Separate what you describe: Language-queried audio source separation X Liu, H Liu, Q Kong, X Mei, J Zhao, Q Huang, MD Plumbley, W Wang arXiv preprint arXiv:2203.15147, 2022 | 47 | 2022 |
CL4AC: A contrastive loss for audio captioning X Liu, Q Huang, X Mei, T Ko, HL Tang, MD Plumbley, W Wang arXiv preprint arXiv:2107.09990, 2021 | 33 | 2021 |
Leveraging pre-trained bert for audio captioning X Liu, X Mei, Q Huang, J Sun, J Zhao, H Liu, MD Plumbley, V Kilic, ... 2022 30th European Signal Processing Conference (EUSIPCO), 1145-1149, 2022 | 32 | 2022 |
A feature generalization framework for social media popularity prediction K Wang, P Wang, X Chen, Q Huang, Z Mao, Y Zhang Proceedings of the 28th ACM international conference on multimedia, 4570-4574, 2020 | 27 | 2020 |
Token-level supervised contrastive learning for punctuation restoration Q Huang, T Ko, HL Tang, X Liu, B Wu arXiv preprint arXiv:2107.09099, 2021 | 26 | 2021 |
An encoder-decoder based audio captioning system with transfer and reinforcement learning for DCASE challenge 2021 task 6 X Mei, Q Huang, X Liu, G Chen, J Wu, Y Wu, J Zhao, S Li, T Ko, HL Tang, ... DCASE2021 Challenge, Tech. Rep, Tech. Rep, 2021 | 17 | 2021 |
Visually-aware audio captioning with adaptive audio-visual attention X Liu, Q Huang, X Mei, H Liu, Q Kong, J Sun, S Li, T Ko, Y Zhang, ... arXiv preprint arXiv:2210.16428, 2022 | 15 | 2022 |
Personalized dialogue generation with persona-adaptive attention Q Huang, Y Zhang, T Ko, X Liu, B Wu, W Wang, H Tang Proceedings of the AAAI Conference on Artificial Intelligence 37 (11), 12916 …, 2023 | 13 | 2023 |
Retrieval-augmented text-to-audio generation Y Yuan, H Liu, X Liu, Q Huang, MD Plumbley, W Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 12 | 2024 |
Wavjourney: Compositional audio creation with large language models X Liu, Z Zhu, H Liu, Y Yuan, M Cui, Q Huang, J Liang, Y Cao, Q Kong, ... arXiv preprint arXiv:2307.14335, 2023 | 11 | 2023 |
Learning retrieval augmentation for personalized dialogue generation Q Huang, S Fu, X Liu, W Wang, T Ko, Y Zhang, L Tang arXiv preprint arXiv:2406.18847, 2024 | 4 | 2024 |
SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge B Wu, P Liu, WH Cheng, B Liu, Z Zeng, J Wang, Q Huang, J Luo Proceedings of the 31st ACM International Conference on Multimedia, 9651-9655, 2023 | 1 | 2023 |
Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models S Fu, X Wang, Q Huang, Y Zhang arXiv preprint arXiv:2408.13979, 2024 | | 2024 |
Selective Prompting Tuning for Personalized Conversations with LLMs Q Huang, X Liu, T Ko, B Wu, W Wang, Y Zhang, L Tang arXiv preprint arXiv:2406.18187, 2024 | | 2024 |
Reproducibility Companion Paper: Recommendation of Mix-and-Match Clothing by Modeling Indirect Personal Compatibility S Liao, Y Ding, PY Mok, Q Huang, J Cao Proceedings of the 2024 International Conference on Multimedia Retrieval …, 2024 | | 2024 |
基于迁移学习与强化学习的自动音频标注系统 G Chen, S Li, X Shao, X Mei, X Liu, Q Huang, W Wang Journal of Fudan University (Natural Science), 520-526, 2022 | | 2022 |