Improving Arabic morphological analyzers benchmark
Autor: | Abdellah Yousfi, Rachida Tajmout, Karim Bouzoubaa, Younes Jaafar, Hakima Khamar |
---|---|
Rok vydání: | 2016 |
Předmět: |
Linguistics and Language
Machine translation Computer science 02 engineering and technology computer.software_genre Language and Linguistics Task (project management) Set (abstract data type) 0202 electrical engineering electronic engineering information engineering Decision-making Parsing business.industry 05 social sciences Benchmarking Human-Computer Interaction Benchmark (computing) 020201 artificial intelligence & image processing Computer Vision and Pattern Recognition Artificial intelligence Metric (unit) 0509 other social sciences 050904 information & library sciences business computer Software Natural language processing |
Zdroj: | International Journal of Speech Technology. 19:259-267 |
ISSN: | 1572-8110 1381-2416 |
Popis: | The various tools dedicated to Arabic natural language processing have undergone significant development during recent years. Among these tools, Arabic morphological analyzers are of great importance because they are often used within other projects that are more advanced such as syntactic parsers, search engines, machine translation systems, etc. Thus, researchers are forced to make a decision concerning which morphological analyzer to use in their research projects, and this task is very difficult since there are many criteria to take into account. In order to facilitate this choice, we considered the problem of benchmarking morphological analyzers in a previous work by proposing a solution that allows returning a set of metrics of each analyzer that are: accuracy, precision, recall, F-measure and the execution time. In this article, we present two new major improvements to our solution: the establishment of the first version of our corpus that is dedicated to the evaluation of morphological analyzers, as well as the introduction of a new metric, which combines all metrics related to results as well as the execution time of the analyzers. |
Databáze: | OpenAIRE |
Externí odkaz: |