Scatter: selective context attentional scene text recognizer R Litman, O Anschel, S Tsiper, R Litman, S Mazor, R Manmatha proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020 | 186 | 2020 |
Sequence-to-sequence contrastive learning for text recognition A Aberdam, R Litman, S Tsiper, O Anschel, R Slossberg, S Mazor, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 155 | 2021 |
Latr: Layout-aware transformer for scene-text vqa AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 114 | 2022 |
Out-of-vocabulary challenge report S Garcia-Bordils, A Mafla, AF Biten, O Nuriel, A Aberdam, S Mazor, ... European Conference on Computer Vision, 359-375, 2022 | 26 | 2022 |
Multimodal semi-supervised learning for text recognition A Aberdam, R Ganz, S Mazor, R Litman arXiv preprint arXiv:2205.03873, 2022 | 26 | 2022 |
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition A Aberdam, D Bensaīd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ... Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023 | 20 | 2023 |
Textadain: Paying attention to shortcut learning in text recognizers O Nuriel, S Fogel, R Litman European Conference on Computer Vision, 427-445, 2022 | 19* | 2022 |
Question aware vision transformer for multimodal reasoning R Ganz, Y Kittenplon, A Aberdam, E Ben Avraham, O Nuriel, S Mazor, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 17 | 2024 |
Towards Models that Can See and Read R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023 | 15 | 2023 |
On calibration of scene-text recognition models R Slossberg, O Anschel, A Markovitz, R Litman, A Aberdam, S Tsiper, ... European Conference on Computer Vision, 263-279, 2022 | 14 | 2022 |
GRAM: Global reasoning for multi-page VQA T Blau, S Fogel, R Ronen, A Golts, R Ganz, E Ben Avraham, A Aberdam, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 8 | 2024 |
M3T: A new benchmark dataset for multi-modal document-level machine translation B Hsu, X Liu, H Li, Y Fujinuma, M Nadejde, X Niu, Y Kittenplon, R Litman, ... arXiv preprint arXiv:2406.08255, 2024 | 3 | 2024 |
VisFocus: Prompt-guided vision encoders for ocr-free dense document understanding O Abramovich, N Nayman, S Fogel, I Lavi, R Litman, S Tsiper, R Tichauer, ... European Conference on Computer Vision, 241-259, 2024 | 1 | 2024 |
DocVLM: Make Your VLM an Efficient Reader MS Nacson, A Aberdam, R Ganz, EB Avraham, A Golts, Y Kittenplon, ... arXiv preprint arXiv:2412.08746, 2024 | | 2024 |
DocVLM: Make Your VLM an Efficient Reader M Shpigel Nacson, A Aberdam, R Ganz, E Ben Avraham, A Golts, ... arXiv e-prints, arXiv: 2412.08746, 2024 | | 2024 |
TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models J Fhima, EB Avraham, O Nuriel, Y Kittenplon, R Ganz, A Aberdam, ... arXiv preprint arXiv:2411.04642, 2024 | | 2024 |
Residual context refinement network architecture for optical character recognition R Litman, O Anschel, S Tsiper, R Litman, S Mazor, J Wu, R Manmatha US Patent 11,308,354, 2022 | | 2022 |
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition Supplementary Material A Aberdam, D Bensaıd, A Golts, R Ganz, O Nuriel, R Tichauer, S Mazor, ... | | |
Towards Models that Can See and Read Supplementary Material R Ganz, O Nuriel, A Aberdam, Y Kittenplon, S Mazor, R Litman | | |
LaTr: Layout-Aware Transformer for Scene-Text VQA Supplementary Material AF Biten, R Litman, Y Xie, S Appalaraju, R Manmatha | | |