Follow
Jinfa Huang
Jinfa Huang
University of Rochester, Peking University
Verified email at ur.rochester.edu - Homepage
Title
Cited by
Cited by
Year
Moe-llava: Mixture of experts for large vision-language models
B Lin, Z Tang, Y Ye, J Cui, B Zhu, P Jin, J Huang, J Zhang, M Ning, ...
arXiv preprint arXiv:2401.15947, 2024
1262024
A Survey of Large Language Models in Medicine: Principles, Applications, and Challenges
H Zhou, F Liu, B Gu, X Zou, J Huang, J Wu, Y Li, SS Chen, P Zhou, J Liu, ...
arXiv preprint arXiv:2311.05112, 2023
78*2023
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
P Jin, J Huang, F Liu, X Wu, S Ge, G Song, D Clifton, J Chen
NeurIPS 2022, Spotlight, 30291-30306, 2022
622022
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning
P Jin, J Huang, P Xiong, S Tian, C Liu, X Ji, L Yuan, J Chen
CVPR 2023, Highlight, 2023
602023
Weakly-supervised 3d spatial reasoning for text-based visual question answering
H Li, J Huang, P Jin, G Song, Q Wu, J Chen
IEEE Transactions on Image Processing 32, 3367-3382, 2023
34*2023
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment
P Jin, H Li, Z Cheng, J Huang, Z Wang, L Yuan, C Liu, J Chen
IJCAI 2023, 2023
312023
Gpt-4V (ision) as a Social Media Analysis Engine
H Lyu, J Huang, D Zhang, Y Yu, X Mou, J Pan, Z Yang, Z Wei, J Luo
ACM Transactions on Intelligent Systems and Technology (TIST), 2023
272023
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
S Yuan, J Huang, Y Shi, Y Xu, R Zhu, B Lin, X Cheng, L Yuan, J Luo
arXiv preprint arXiv:2404.05014, 2024
232024
Guoym at SemEval-2020 task 8: Ensemble-based Classification of Visuo-lingual Metaphor in Memes
Y Guo, J Huang, Y Dong, M Xu
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 1120-1125, 2020
202020
Chronomagic-bench: A benchmark for metamorphic evaluation of text-to-time-lapse video generation
S Yuan, J Huang, Y Xu, Y Liu, S Zhang, Y Shi, R Zhu, X Cheng, J Luo, ...
NeurIPS 2024 D&B Spotlight, 2024
132024
Look-m: Look-once optimization in kv cache for efficient multimodal long-context inference
Z Wan, Z Wu, C Liu, J Huang, Z Zhu, P Jin, L Wang, L Yuan
EMNLP 2024 Findings, 2024
112024
Continuous-Multiple Image Outpainting in One-Step via Positional Query and A Diffusion-based Approach
S Zhang, J Huang, Q Zhou, Z Wang, F Wang, J Luo, J Yan
ICLR 2024, 2024
82024
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter
M Cao, H Tang, J Huang, P Jin, C Zhang, R Liu, L Chen, X Liang, L Yuan, ...
ACL 2024 Findings, 2024
62024
LLMBind: A unified modality-task integration framework
B Zhu, M Ning, P Jin, B Lin, J Huang, Q Song, J Zhang, Z Tang, M Pan, ...
arXiv preprint arXiv:2402.14891, 2024
62024
Improving Scene Graph Generation with Superpixel-Based Interaction Learning
J Wang, C Zhang, J Huang, B Ren, Z Deng
ACMMM 2023, 2023
62023
Cross-Modality Time-Variant Relation Learning for Generating Dynamic Scene Graphs
J Wang, J Huang, C Zhang, Z Deng
ICRA 2023, 2023
52023
Ldnn: Linguistic Knowledge Injectable Deep Neural Network for Group Cohesiveness Understanding
Y Wang, J Wu, J Huang, G Hattori, Y Takishima, S Wada, R Kimura, ...
Proceedings of the 2020 International Conference on Multimodal Interaction …, 2020
42020
Muse: Mamba is efficient multi-scale learner for text-video retrieval
H Tang, M Cao, J Huang, R Liu, P Jin, G Li, X Liang
arXiv preprint arXiv:2408.10575, 2024
22024
Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
J Huang, J Pan, Z Wan, H Lyu, J Luo
COLING 2025, 2024
22024
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
S Yuan, J Huang, X He, Y Ge, Y Shi, L Chen, J Luo, L Yuan
arXiv preprint arXiv:2411.17440, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20