default search action
18th ICDAR 2024: Athens, Greece - Part IV
- Elisa H. Barney Smith, Marcus Liwicki, Liangrui Peng:
Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30 - September 4, 2024, Proceedings, Part IV. Lecture Notes in Computer Science 14807, Springer 2024, ISBN 978-3-031-70545-8
Layout Analysis and Document Classification
- Yamato Okamoto, Youngmin Baek, Geewook Kim, Ryota Nakao, DongHyun Kim, Moonbin Yim, Seunghyun Park, Bado Lee:
CREPE: Coordinate-Aware End-to-End Document Parser. 3-20 - Tahira Shehzadi, Didier Stricker, Muhammad Zeshan Afzal:
A Hybrid Approach for Document Layout Analysis in Document Images. 21-39 - Jiawei Wang, Kai Hu, Qiang Huo:
DLAFormer: An End-to-End Transformer For Document Layout Analysis. 40-57 - Francisco J. Castellanos, Juan P. Martinez-Esteso, Alejandro Galán-Cuenca, Antonio Javier Gallego:
A Region-Based Approach for Layout Analysis of Music Score Images in Scarce Data Scenarios. 58-75 - Qilin Deng, Mayire Ibrayim, Askar Hamdulla, Hailong Luo, Chunhu Zhang:
Doc-DINO: A Transformer Model for Complex Logical Document Layout Analysis. 76-89 - Lei Kang, Mohamed Ali Souibgui, Fei Yang, Lluís Gómez, Ernest Valveny, Dimosthenis Karatzas:
Machine Unlearning for Document Classification. 90-102 - Saifullah Saifullah, Stefan Agne, Andreas Dengel, Sheraz Ahmed:
DocXplain: A Novel Model-Agnostic Explainability Method for Document Image Classification. 103-123 - Sankalp Sinha, Muhammad Saif Ullah Khan, Talha Uddin Sheikh, Didier Stricker, Muhammad Zeshan Afzal:
CICA: Content-Injected Contrastive Alignment for Zero-Shot Document Image Classification. 124-141 - Marcel Lamott, Yves-Noel Weweler, Adrian Ulges, Faisal Shafait, Dirk Krechel, Darko Obradovic:
LAPDoc: Layout-Aware Prompting for Documents. 142-159 - Wiam Adnan, Joël Tang, Yassine Bel Khayat Zouggari, Seif Edinne Laatiri, Laurent Lam, Fabien Caspani:
A LayoutLMv3-Based Model for Enhanced Relation Extraction in Visually-Rich Documents. 160-174 - Anna Scius-Bertrand, Atefeh Fakhari, Lars Vögtlin, Daniel Ribeiro Cabral, Andreas Fischer:
Are Layout Analysis and OCR Still Useful for Document Information Extraction Using Foundation Models? 175-191
Machine Learning Methods
- Jordy Van Landeghem, Subhajit Maity, Ayan Banerjee, Matthew B. Blaschko, Marie-Francine Moens, Josep Lladós, Sanket Biswas:
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications. 195-217 - Martin Kiss, Michal Hradis:
Self-supervised Pre-training of Text Recognizers. 218-235 - Qiangang Pan, Yahong Hu, Youbai Xie, Xianghui Meng, Yilun Zhang:
Deep Learning-Driven Innovative Model for Generating Functional Knowledge Units. 236-252 - Wenjun Sun, Tran Thi Hong Hanh, Carlos-Emiliano González-Gallardo, Mickaël Coustaty, Antoine Doucet:
Global-SEG: Text Semantic Segmentation Based on Global Semantic Pair Relations. 253-269 - Omar Hamed, Souhail Bakkali, Matthew B. Blaschko, Sien Moens, Jordy Van Landeghem:
Multimodal Adaptive Inference for Document Image Classification with Anytime Early Exiting. 270-286 - Yiran Zhao, Di Wu, Shuqi Dai, Tong Li:
Integrating Dependency Type and Directionality into Adapted Graph Attention Networks to Enhance Relation Extraction. 287-305 - Manh-Tu Vu, Marie Beurton-Aimar:
ViT-ED: Transformer Network for Image Similarity Measurement. 306-323 - Jerod Weinman, Amelia Gómez Grabowska, Dimosthenis Karatzas:
Counting the Corner Cases: Revisiting Robust Reading Challenge Data Sets, Evaluation Protocols, and Metrics. 324-342 - Weiguang Zhang, Qiufeng Wang, Kaizhu Huang, Xiaomeng Gu, Fengjun Guo:
Coarse-to-Fine Document Image Registration for Dewarping. 343-358 - Daria M. Ershova, Alexander V. Gayer, Alexander Sheshkus, Vladimir V. Arlazarov:
An Ultra-lightweight Approach for Machine Readable Zone Detection via Semantic Segmentation and Fast Hough Transform. 359-374 - Tong Zhang, Jianing Zhang, Rong Yan:
Synergistic Diverse Perspective for Topic Evolution Analysis on Weibo. 375-388 - Sho Shimotsumagari, Shumpei Takezaki, Daichi Haraguchi, Seiichi Uchida:
Cross-Domain Image Conversion by CycleDM. 389-406 - Ahana Kundu, Ujjwal Bhattacharya:
YOLO Assisted A* Algorithm for Robust Line Segmentation of Degraded Document Images. 407-424 - George Retsinas, Konstantina Nikolaidou, Giorgos Sfikas:
Enhancing CRNN HTR Architectures with Transformer Blocks. 425-440 - Yujie Lu, Dean Wu, Yuhong Zhang:
Dynamic Reasoning with Language Model and Knowledge Graph for Question Answering. 441-455
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.