Document image characterization using a multiresolution analysis of the texture: application to old documents

Autor: Journet, Nicholas, Ramel, Jean-Yves, Mullot, Rémy, Eglin, Véronique
Zdroj: International Journal on Document Analysis and Recognition; 20240101, Issue: Preprints p1-10, 10p
Abstrakt: Abstract: In this article, we propose a method of characterization of images of old documents based on a texture approach. This characterization is carried out with the help of a multi-resolution study of the textures contained in the images of the document. Thus, by extracting five features linked to the frequencies and to the orientations in the different areas of a page, it is possible to extract and compare elements of high semantic level without expressing any hypothesis about the physical or logical structure of the analyzed documents. Experimentation based on segmentation, data analysis and document image retrieval tools demonstrate the performance of our propositions and the advances that they represent in terms of characterization of content of a deeply heterogeneous corpus.
Databáze: Supplemental Index