Follow
AJ Piergiovanni
AJ Piergiovanni
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Pali: A jointly-scaled multilingual language-image model
X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ...
arXiv preprint arXiv:2209.06794, 2022
3902022
Representation flow for action recognition
AJ Piergiovanni, MS Ryoo
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019
1872019
Evolving losses for unsupervised video representation learning
AJ Piergiovanni, A Angelova, MS Ryoo
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1532020
Tokenlearner: Adaptive space-time tokenization for videos
M Ryoo, AJ Piergiovanni, A Arnab, M Dehghani, A Angelova
Advances in neural information processing systems 34, 12786-12797, 2021
1272021
Assemblenet: Searching for multi-stream neural connectivity in video architectures
MS Ryoo, AJ Piergiovanni, M Tan, A Angelova
arXiv preprint arXiv:1905.13209, 2019
1102019
F-vlm: Open-vocabulary object detection upon frozen vision and language models
W Kuo, Y Cui, X Gu, AJ Piergiovanni, A Angelova
arXiv preprint arXiv:2209.15639, 2022
1012022
Tokenlearner: What can 8 learned tokens do for images and videos?
MS Ryoo, AJ Piergiovanni, A Arnab, M Dehghani, A Angelova
arXiv preprint arXiv:2106.11297, 2021
1012021
Learning latent super-events to detect multiple activities in videos
AJ Piergiovanni, MS Ryoo
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
1012018
Temporal gaussian mixture layer for videos
AJ Piergiovanni, M Ryoo
International Conference on Machine learning, 5152-5161, 2019
972019
Fine-grained activity recognition in baseball videos
AJ Piergiovanni, MS Ryoo
Proceedings of the ieee conference on computer vision and pattern …, 2018
862018
Pali-x: On scaling up a multilingual vision and language model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
arXiv preprint arXiv:2305.18565, 2023
802023
Evolving space-time neural architectures for videos
AJ Piergiovanni, A Angelova, A Toshev, MS Ryoo
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
762019
Learning latent subevents in activity videos using temporal attention filters
A Piergiovanni, C Fan, M Ryoo
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
622017
4d-net for learned multi-modal alignment
AJ Piergiovanni, V Casser, MS Ryoo, A Angelova
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
582021
Attentionnas: Spatiotemporal attention cell search for video classification
X Wang, X Xiong, M Neumann, AJ Piergiovanni, MS Ryoo, A Angelova, ...
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
522020
Tiny video networks
AJ Piergiovanni, A Angelova, MS Ryoo
Applied AI Letters 3 (1), e38, 2022
502022
Assemblenet++: Assembling modality representations via attention connections
MS Ryoo, AJ Piergiovanni, J Kangaspunta, A Angelova
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
452020
Learning real-world robot policies by dreaming
AJ Piergiovanni, A Wu, MS Ryoo
2019 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2019
382019
Avid dataset: Anonymized videos from diverse countries
AJ Piergiovanni, M Ryoo
Advances in Neural Information Processing Systems 33, 16711-16721, 2020
372020
Rethinking video vits: Sparse video tubes for joint image and video learning
AJ Piergiovanni, W Kuo, A Angelova
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
352023
The system can't perform the operation now. Try again later.
Articles 1–20