Named entity recognition in a South African context
Autor: | Alta de Waal, Cobus Venter, Anita Louis |
---|---|
Rok vydání: | 2006 |
Předmět: |
Computer science
business.industry Probabilistic logic Bayesian network Context (language use) computer.software_genre Domain (software engineering) Information extraction Named-entity recognition Text types Artificial intelligence business computer Natural language processing Dynamic Bayesian network |
Zdroj: | Proceedings of the 2006 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing couuntries - SAICSIT '06. |
DOI: | 10.1145/1216262.1216281 |
Popis: | The feasibility of a probabilistic Named Entity Recognition system in a South African context was tested. The intended use of the system is in a cyber forensic domain. At the core of the system is a dynamic Bayesian Network, which takes into account the probabilistic relationship between variables as well as contextual information. We illustrate the performance of such a system using different probability thresholds for classification purposes and compare the performance with and without a name gazetteer. Our system compares competently with similar existing systems in the information extraction domain. Future work will involve the application of the system in the cyber forensic environment, which poses new challenges such as diverse text types. |
Databáze: | OpenAIRE |
Externí odkaz: |