Autor: |
Journet, Nicholas, Ramel, Jean-Yves, Mullot, Rémy, Eglin, Véronique |
Zdroj: |
International Journal on Document Analysis and Recognition; 20240101, Issue: Preprints p1-10, 10p |
Abstrakt: |
Abstract: In this article, we propose a method of characterization of images of old documents based on a texture approach. This characterization is carried out with the help of a multi-resolution study of the textures contained in the images of the document. Thus, by extracting five features linked to the frequencies and to the orientations in the different areas of a page, it is possible to extract and compare elements of high semantic level without expressing any hypothesis about the physical or logical structure of the analyzed documents. Experimentation based on segmentation, data analysis and document image retrieval tools demonstrate the performance of our propositions and the advances that they represent in terms of characterization of content of a deeply heterogeneous corpus. |
Databáze: |
Supplemental Index |
Externí odkaz: |
|