XML Data Representation in Document Image Analysis
Autor: | Y. Rangoni, A. Belaid, Ingrid Falk |
---|---|
Přispěvatelé: | READ (READ), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS), Natural Language Processing: representation, inference and semantics (TALARIS), Inria Nancy - Grand Est, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS), IAPR, Flavio Bortolozzi and Robert Sabourin |
Rok vydání: | 2007 |
Předmět: |
Computer science
computer.internet_protocol Process (engineering) 02 engineering and technology XSLT External Data Representation Document Class Model Reverse Engineering METS 0202 electrical engineering electronic engineering information engineering Document Image Analysis and Recognition ALTO computer.programming_language 060201 languages & linguistics Focus (computing) Information retrieval Representation (systemics) [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV] 06 humanities and the arts XML TEI Digital library [INFO.INFO-TT]Computer Science [cs]/Document and Text Processing 0602 languages and literature 020201 artificial intelligence & image processing computer |
Zdroj: | ICDAR Proc. ICDAR'07 9th International Conference on Document Analysis and Recognition-ICDAR'07 9th International Conference on Document Analysis and Recognition-ICDAR'07, IAPR, Sep 2007, Curitiba, Brazil. pp.78-82 |
ISSN: | 1520-5363 |
Popis: | International audience; This paper presents the XML-based formats ALTO, TEI, METS used for Digital Libraries and their interest for data representation in a Document Image Analysis and Recognition (DIAR) process. In the first part we briefly present these formats with focus on their adequacy for structural representation and modeling of DIAR data. The second part shows how these formats can be used in a reverse engineering process. Their implementation as a data representation framework will be shown. |
Databáze: | OpenAIRE |
Externí odkaz: |