Scatter: selective context attentional scene text recognizer R Litman, O Anschel, S Tsiper, R Litman, S Mazor, R Manmatha proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 173 | 2020 |
Sequence-to-sequence contrastive learning for text recognition A Aberdam, R Litman, S Tsiper, O Anschel, R Slossberg, S Mazor, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 141 | 2021 |
Latr: Layout-aware transformer for scene-text vqa AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 104 | 2022 |
Out-of-vocabulary challenge report S Garcia-Bordils, A Mafla, AF Biten, O Nuriel, A Aberdam, S Mazor, ... European Conference on Computer Vision, 359-375, 2022 | 23 | 2022 |
Multimodal semi-supervised learning for text recognition A Aberdam, R Ganz, S Mazor, R Litman arXiv preprint arXiv:2205.03873, 2022 | 22 | 2022 |
Textadain: Paying attention to shortcut learning in text recognizers O Nuriel, S Fogel, R Litman European Conference on Computer Vision, 427-445, 2022 | 17* | 2022 |
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition A Aberdam, D Bensaïd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ... Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023 | 16 | 2023 |
Towards Models that Can See and Read R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023 | 13 | 2023 |
On calibration of scene-text recognition models R Slossberg, O Anschel, A Markovitz, R Litman, A Aberdam, S Tsiper, ... European Conference on Computer Vision, 263-279, 2022 | 12 | 2022 |
Question aware vision transformer for multimodal reasoning R Ganz, Y Kittenplon, A Aberdam, E Ben Avraham, O Nuriel, S Mazor, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 8 | 2024 |
GRAM: Global reasoning for multi-page VQA T Blau, S Fogel, R Ronen, A Golts, R Ganz, E Ben Avraham, A Aberdam, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 5 | 2024 |
VisFocus: Prompt-guided vision encoders for OCR-free dense document understanding O Abramovich, N Nayman, S Fogel, I Lavi, R Litman, S Tsiper, R Tichauer, ... arXiv preprint arXiv:2407.12594, 2024 | | 2024 |
M3T: A new benchmark dataset for multi-modal document-level machine translation B Hsu, X Liu, H Li, Y Fujinuma, M Nadejde, X Niu, Y Kittenplon, R Litman, ... arXiv preprint arXiv:2406.08255, 2024 | | 2024 |
Residual context refinement network architecture for optical character recognition R Litman, O Anschel, S Tsiper, R Litman, S Mazor, J Wu, R Manmatha US Patent 11,308,354, 2022 | | 2022 |
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition Supplementary Material A Aberdam, D Bensaıd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ... | | |
Towards Models that Can See and Read Supplementary Material R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman | | |
LaTr: Layout-Aware Transformer for Scene-Text VQA Supplementary Material AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha | | |
Supplementary Material: Sequence-to-Sequence Contrastive Learning for Text Recognition A Aberdam, R Litman, S Tsiper, O Anschel, R Slossberg, S Mazor, ... | | |
SCATTER: Selective Context Attentional Scene Text Recognizer Supplementary Materials R Litman, O Anschel, S Tsiper, R Litman, S Mazor, R Manmatha | | |