Document retro-conversion for personalized electronic reedition

Autor: Rangoni, Yves, Belaïd, Abdel, Alusse, André, Cecotti, Hubert, Farah, Fady, Gagean, Nicolas, Fiala, Dalibor, Rousselot, François, Vigne, Henri
Přispěvatelé: READ (READ), Laboratoire Lorrain de Recherche en Informatique et ses Applications (LORIA), Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Henri Poincaré - Nancy 1 (UHP)-Université Nancy 2-Institut National Polytechnique de Lorraine (INPL)-Centre National de la Recherche Scientifique (CNRS), Laboratoire d' Informatique et d'Intelligence Artificielle (LIIA), Institut National des Sciences Appliquées - Strasbourg (INSA Strasbourg), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA), EVER TEAM, Umapada Pal, Swapan K. Parui, Bidyut B. Chaudhuri, Rangoni, Yves, Umapada Pal, Swapan K. Parui, Bidyut B. Chaudhuri
Jazyk: angličtina
Rok vydání: 2005
Předmět:
Zdroj: International Workshop on Document Analysis
International Workshop on Document Analysis, Umapada Pal, Mar 2005, Kolkata, India
Popis: In this paper, we propose a generic framework to store, retrieve, transform and present mixed sets of native and virtual documents. We intend to use or to develop specific tools organized in a global architecture, from document analysis and capture, document retrieval and classification-categorization, to full generation of personal sets of documents, corresponding to user's specific needs and profile. The first step concerns document preparation and formal analysis. The second step adds semantic metadata, content indexing, and structure-semantic analysis. The third step helps user for the constitution of personalized documents. Research is based on domain specific large sets of documents, as for example European Union law documents (many millions, many file formats, in twenty official languages).
Databáze: OpenAIRE