Chunking in Turkish with conditional random fields
Autor: | Olcay Taner Yildiz, Razieh Ehsani, Onur Görgün, Ercan Solak |
---|---|
Přispěvatelé: | Işık Üniversitesi, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü, Işık University, Faculty of Engineering, Department of Computer Engineering, Yıldız, Olcay Taner, Solak, Ercan, Ehsani, Razieh, Görgün, Onur |
Jazyk: | angličtina |
Rok vydání: | 2015 |
Předmět: |
Conditional random field
Computer science Turkish Theory & Methods Treebank Text processing Computational linguistics computer.software_genre Artificial Intelligence Chunking (psychology) Part-of-speech tagging Speech business.industry Random processes Forestry Linguistics Robotics Resolution (logic) Noun phrase language.human_language Natural language processing systems Turkishs Computer Science language Artificial intelligence Translation (languages) business computer Natural language processing Information Systems Treebanks |
Zdroj: | Computational Linguistics and Intelligent Text Processing ISBN: 9783319181103 CICLing (1) |
Popis: | In this paper, we report our work on chunking in Turkish. We used the data that we generated by manually translating a subset of the Penn Treebank. We exploited the already available tags in the trees to automatically identify and label chunks in their Turkish translations. We used conditional random fields (CRF) to train a model over the annotated data. We report our results on different levels of chunk resolution. Publisher's Version |
Databáze: | OpenAIRE |
Externí odkaz: |