On Building the Largest and Cross-Linguistic Turkish Dependency Corpus
Autor: | Merve Özçelik, Busra Marsan, Asli Kuzgun, Bilge Nas Arican, Neslihan Kara, Olcay Taner Yildiz, Neslihan Cesur, Deniz Baran Aslan |
---|---|
Přispěvatelé: | Işık Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü, Işık University, Faculty of Engineering, Department of Computer Engineering, Yıldız, Olcay Taner |
Rok vydání: | 2020 |
Předmět: |
Dependency relation
TreeBank Process (engineering) Turkish Computer science Treebank Dependency computer.software_genre Annotation Dependency grammar Intelligent systems Dependency parsing Scope (project management) business.industry Dependency parser Forestry Dependency trees language.human_language Turkishs language Artificial intelligence Syntactics business computer Bi-directional Natural language processing Treebanks Dependency (project management) |
Zdroj: | 2020 Innovations in Intelligent Systems and Applications Conference (ASYU). |
Popis: | In this paper, we aim to introduce the dependency annotation process of the largest and the only cross-linguistic Turkish dependency treebank which was translated from the original Penn Treebank corpus. Within the scope of this project, 16.400 sentences have been morphologically and semantically annotated, and the dependency relations were manually carried out by a team of linguists. It is hoped that this project will serve as a base for a successful dependency parser and a system which can automatically perform the bi-directional conversion between constituency and dependency trees. Publisher's Version |
Databáze: | OpenAIRE |
Externí odkaz: |