Scatter: selective context attentional scene text recognizer R Litman, O Anschel, S Tsiper, R Litman, S Mazor, R Manmatha proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 157 | 2020 |
Sequence-to-sequence contrastive learning for text recognition A Aberdam, R Litman, S Tsiper, O Anschel, R Slossberg, S Mazor, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 114 | 2021 |
Latr: Layout-aware transformer for scene-text vqa AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 86 | 2022 |
Multimodal semi-supervised learning for text recognition A Aberdam, R Ganz, S Mazor, R Litman arXiv preprint arXiv:2205.03873, 2022 | 20 | 2022 |
Out-of-vocabulary challenge report S Garcia-Bordils, A Mafla, AF Biten, O Nuriel, A Aberdam, S Mazor, ... European Conference on Computer Vision, 359-375, 2022 | 16 | 2022 |
Textadain: Paying attention to shortcut learning in text recognizers O Nuriel, S Fogel, R Litman European Conference on Computer Vision, 427-445, 2022 | 15* | 2022 |
On calibration of scene-text recognition models R Slossberg, O Anschel, A Markovitz, R Litman, A Aberdam, S Tsiper, ... European Conference on Computer Vision, 263-279, 2022 | 10 | 2022 |
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition A Aberdam, D Bensaïd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ... Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023 | 9 | 2023 |
Towards Models that Can See and Read R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023 | 8 | 2023 |
Question Aware Vision Transformer for Multimodal Reasoning R Ganz, Y Kittenplon, A Aberdam, EB Avraham, O Nuriel, S Mazor, ... arXiv preprint arXiv:2402.05472, 2024 | | 2024 |
GRAM: Global Reasoning for Multi-Page VQA T Blau, S Fogel, R Ronen, A Golts, R Ganz, EB Avraham, A Aberdam, ... arXiv preprint arXiv:2401.03411, 2024 | | 2024 |
M3T: A new benchmark dataset for multi-modal document-level machine translation B Hsu, X Liu, H Li, Y Fujinuma, M Nădejde, X Niu, Y Kittenplon, R Litman, ... | | 2024 |
Residual context refinement network architecture for optical character recognition R Litman, O Anschel, S Tsiper, R Litman, S Mazor, J Wu, R Manmatha US Patent 11,308,354, 2022 | | 2022 |
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition Supplementary Material A Aberdam, D Bensaıd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ... | | |
Towards Models that Can See and Read Supplementary Material R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman | | |
LaTr: Layout-Aware Transformer for Scene-Text VQA Supplementary Material AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha | | |
Supplementary Material: Sequence-to-Sequence Contrastive Learning for Text Recognition A Aberdam, R Litman, S Tsiper, O Anschel, R Slossberg, S Mazor, ... | | |
SCATTER: Selective Context Attentional Scene Text Recognizer Supplementary Materials R Litman, O Anschel, S Tsiper, R Litman, S Mazor, R Manmatha | | |