Follow
Yale Song
Yale Song
FAIR, Meta
Verified email at csail.mit.edu - Homepage
Title
Cited by
Cited by
Year
TVSum: Summarizing web videos using titles
Y Song, J Vallmitjana, A Stent, A Jaimes
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2015
8052015
Learning from noisy labels with distillation
Y Li, J Yang, Y Song, L Cao, J Luo, LJ Li
Proceedings of the IEEE International Conference on Computer Vision, 1910-1918, 2017
6862017
TGIF-QA: Toward spatio-temporal reasoning in visual question answering
Y Jang, Y Song, Y Yu, Y Kim, G Kim
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
5952017
Polysemous visual-semantic embedding for cross-modal retrieval
Y Song, M Soleymani
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
3072019
TGIF: A new dataset and benchmark on animated gif description
Y Li, Y Song, L Cao, J Tetreault, L Goldberg, A Jaimes, J Luo
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016
3062016
Video co-summarization: Video summarization by visual co-occurrence
WS Chu, Y Song, A Jaimes
Proceedings of the IEEE conference on computer vision and pattern …, 2015
2862015
Improving pairwise ranking for multi-label image classification
Y Li, Y Song, J Luo
Proceedings of the IEEE conference on computer vision and pattern …, 2017
2782017
# FluxFlow: Visual analysis of anomalous information spreading on social media
J Zhao, N Cao, Z Wen, Y Song, YR Lin, C Collins
IEEE transactions on visualization and computer graphics 20 (12), 1773-1782, 2014
2372014
Continuous body and hand gesture recognition for natural human-computer interaction
Y Song, D Demirdjian, R Davis
ACM Transactions on Interactive Intelligent Systems (TiiS) 2 (1), 5, 2012
2162012
Video2GIF: Automatic generation of animated gifs from video
M Gygli, Y Song, L Cao
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016
194*2016
Tracking body and hands for gesture recognition: Natops aircraft handling signals database
Y Song, D Demirdjian, R Davis
2011 IEEE International Conference on Automatic Face & Gesture Recognition …, 2011
1622011
Fast, cheap, and good: Why animated GIFs engage us
S Bakhshi, DA Shamma, L Kennedy, Y Song, P de Juan, JJ Kaye
Proceedings of the 2016 chi conference on human factors in computing systems …, 2016
1392016
Action recognition by hierarchical sequence summarization
Y Song, LP Morency, R Davis
Proceedings of the IEEE conference on computer vision and pattern …, 2013
1382013
To click or not to click: Automatic selection of beautiful thumbnails from videos
Y Song, M Redi, J Vallmitjana, A Jaimes
Proceedings of the 25th ACM International on Conference on Information and …, 2016
1292016
Active Contrastive Learning of Audio-Visual Video Representations
S Ma, Z Zeng, D McDuff, Y Song
International Conference on Learning Representations, 2021
1242021
Computerized system and method for automatically detecting and rendering highlights from streaming videos
Y Song, J Vallmitjana
US Patent 10,390,082, 2019
1052019
Multi-view latent variable discriminative models for action recognition
Y Song, LP Morency, R Davis
2012 IEEE Conference on Computer Vision and Pattern Recognition, 2120-2127, 2012
1052012
Ego-Exo4D: Understanding skilled human activity from first-and third-person perspectives
K Grauman, A Westbury, L Torresani, K Kitani, J Malik, T Afouras, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1002024
Parameter Efficient Multimodal Transformers for Video Representation Learning
S Lee, Y Yu, G Kim, T Breuel, J Kautz, Y Song
International Conference on Learning Representations, 2021
952021
Multimodal Human Behavior Analysis: Learning Correlation and Interaction Across Modalities
Y Song, LP Morency, R Davis
Proceedings of the 14th ACM international conference on Multimodal …, 2012
922012
The system can't perform the operation now. Try again later.
Articles 1–20