Populating a Database from Parallel Texts Using Ontology-Based Information Extraction
Autor: | Hamish Cunningham, Susannah J. Lydon, Valentin Tablan, Mary McGee Wood, Diana Maynard |
---|---|
Rok vydání: | 2004 |
Předmět: |
Database
Computer science business.industry RDF Schema computer.file_format Ontology (information science) computer.software_genre Data structure Domain (software engineering) Information extraction Formal ontology Ontology Information system Artificial intelligence RDF business computer Natural language processing |
Zdroj: | Natural Language Processing and Information Systems ISBN: 9783540225645 NLDB |
DOI: | 10.1007/978-3-540-27779-8_22 |
Popis: | Legacy data in many mature descriptive sciences is distributed across multiple text descriptions. The challenge is both to extract this data, and to correlate it once extracted. The MultiFlora system does this using an established Information Extraction system tuned to the domain of botany and integrated with a formal ontology to structure and store the data. A range of output formats are supported through the W3C RDFS standard, making it simple to populate a database as desired. |
Databáze: | OpenAIRE |
Externí odkaz: |