On Building the Largest and Cross-Linguistic Turkish Dependency Corpus

Autor: Merve Özçelik, Busra Marsan, Asli Kuzgun, Bilge Nas Arican, Neslihan Kara, Olcay Taner Yildiz, Neslihan Cesur, Deniz Baran Aslan
Přispěvatelé: Işık Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü, Işık University, Faculty of Engineering, Department of Computer Engineering, Yıldız, Olcay Taner
Rok vydání: 2020
Zdroj: 2020 Innovations in Intelligent Systems and Applications Conference (ASYU).
Popis: In this paper, we aim to introduce the dependency annotation process of the largest and the only cross-linguistic Turkish dependency treebank which was translated from the original Penn Treebank corpus. Within the scope of this project, 16.400 sentences have been morphologically and semantically annotated, and the dependency relations were manually carried out by a team of linguists. It is hoped that this project will serve as a base for a successful dependency parser and a system which can automatically perform the bi-directional conversion between constituency and dependency trees. Publisher's Version
Databáze: OpenAIRE