Zobrazeno 1 - 10
of 8 982
pro vyhledávání: '"layout analysis"'
Handwritten document recognition (HDR) is one of the most challenging tasks in the field of computer vision, due to the various writing styles and complex layouts inherent in handwritten texts. Traditionally, this problem has been approached as two s
Externí odkaz:
http://arxiv.org/abs/2412.18981
Autor:
Clérice, Thibault, Janes, Juliette, Scheithauer, Hugo, Bénière, Sarah, Cafiero, Florian, Romary, Laurent, Gabay, Simon, Sagot, Benoît
We present a novel, open-access dataset designed for semantic layout analysis, built to support document recreation workflows through mapping with the Text Encoding Initiative (TEI) standard. This dataset includes 7,254 annotated pages spanning a lar
Externí odkaz:
http://arxiv.org/abs/2411.10068
The advent of multimodal learning has brought a significant improvement in document AI. Documents are now treated as multimodal entities, incorporating both textual and visual information for downstream analysis. However, works in this space are ofte
Externí odkaz:
http://arxiv.org/abs/2412.12902
Document Layout Analysis is crucial for real-world document understanding systems, but it encounters a challenging trade-off between speed and accuracy: multimodal methods leveraging both text and visual features achieve higher accuracy but suffer fr
Externí odkaz:
http://arxiv.org/abs/2410.12628
Scientific posters are used to present the contributions of scientific papers effectively in a graphical format. However, creating a well-designed poster that efficiently summarizes the core of a paper is both labor-intensive and time-consuming. A sy
Externí odkaz:
http://arxiv.org/abs/2407.19787
Autor:
Sheikh, Talha Uddin, Shehzadi, Tahira, Hashmi, Khurram Azeem, Stricker, Didier, Afzal, Muhammad Zeshan
Document layout analysis is a key area in document research, involving techniques like text mining and visual analysis. Despite various methods developed to tackle layout analysis, a critical but frequently overlooked problem is the scarcity of label
Externí odkaz:
http://arxiv.org/abs/2406.06236
Document layout analysis (DLA) is crucial for understanding the physical layout and logical structure of documents, serving information retrieval, document summarization, knowledge extraction, etc. However, previous studies have typically used separa
Externí odkaz:
http://arxiv.org/abs/2405.11757
Autor:
Bi, Tianci, Zhang, Xiaoyi, Zhang, Zhizheng, Xie, Wenxuan, Lan, Cuiling, Lu, Yan, Zheng, Nanning
Significant progress has been made in scene text detection models since the rise of deep learning, but scene text layout analysis, which aims to group detected text instances as paragraphs, has not kept pace. Previous works either treated text detect
Externí odkaz:
http://arxiv.org/abs/2405.07481
Document layout analysis involves understanding the arrangement of elements within a document. This paper navigates the complexities of understanding various elements within document images, such as text, images, tables, and headings. The approach em
Externí odkaz:
http://arxiv.org/abs/2404.17888