A high accuracy method for semi-supervised information extraction

Autor: Antonio Sanfilippo, Stephen Tratz
Rok vydání: 2007
Předmět:
Zdroj: HLT-NAACL (Short Papers)
Popis: Customization to specific domains of discourse and/or user requirements is one of the greatest challenges for today's Information Extraction (IE) systems. While demonstrably effective, both rule-based and supervised machine learning approaches to IE customization pose too high a burden on the user. Semi-supervised learning approaches may in principle offer a more resource effective solution but are still insufficiently accurate to grant realistic application. We demonstrate that this limitation can be overcome by integrating fully-supervised learning techniques within a semi-supervised IE approach, without increasing resource requirements.
Databáze: OpenAIRE