Semantic Role Labeling System for Persian Language

Autor: Fatemeh Sedghi, Azadeh Mirzaei, Pegah Safari
Rok vydání: 2020
Předmět:
Zdroj: ACM Transactions on Asian and Low-Resource Language Information Processing. 19:1-12
ISSN: 2375-4702
2375-4699
DOI: 10.1145/3372246
Popis: In this article, we present an automatic semantic role labeling system in Persian consisting of two modules: argument identification for specifying argument spans and argument classification for categorizing their semantic roles. Our modules have been trained on Persian Proposition Bank in which predicate-argument information is manually added as a layer on top of Persian Dependency Treebank with about 30,000 sentences. Therefore, our system was trained on 216,871 verbal predicates and 42,386 nonverbal ones consisting of 40,813 nouns and 1,573 adjectives with 33 semantic classes. As a supervised method, we used maximum entropy for building an argument identifier that results in human-level accuracy of 99% and support vector machine for an argument classifier with an F1 of 84. Regarding both verbal and nonverbal predicates with an expanded role set, we achieved reasonable results.
Databáze: OpenAIRE