Named entity recognition in a South African context

Autor: Alta de Waal, Cobus Venter, Anita Louis
Rok vydání: 2006
Předmět:
Zdroj: Proceedings of the 2006 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing couuntries - SAICSIT '06.
DOI: 10.1145/1216262.1216281
Popis: The feasibility of a probabilistic Named Entity Recognition system in a South African context was tested. The intended use of the system is in a cyber forensic domain. At the core of the system is a dynamic Bayesian Network, which takes into account the probabilistic relationship between variables as well as contextual information. We illustrate the performance of such a system using different probability thresholds for classification purposes and compare the performance with and without a name gazetteer. Our system compares competently with similar existing systems in the information extraction domain. Future work will involve the application of the system in the cyber forensic environment, which poses new challenges such as diverse text types.
Databáze: OpenAIRE