Grammatical Tagging of a Persian Corpus

Autor: S. Mostafa Assi, M. Haji Abdolhosseini
Rok vydání: 2000
Předmět:
Zdroj: International Journal of Corpus Linguistics. 5:69-81
ISSN: 1569-9811
1384-6655
DOI: 10.1075/ijcl.5.1.05ass
Popis: The purpose of this article is to briefly introduce an interactive POS tagging system developed as a project at the Institute for Humanities and Cultural Studies in Tehran, Iran. The system is designed as part of the annotation procedure for a Persian corpus called The Farsi Linguistic Database (FLDB) (a project at the Institute for Humanities and Cultural Studies in Tehran which comprises a selection of contemporary Modern Persian literature, formal and informal spoken varieties of the language, and a series of dictionary entries and word lists [Assi 1997: 5]) and is the first attempt ever to tag a Persian corpus. In Section 1, the project itself will be introduced; Section 2 presents an evaluation of the project, and Section 3 is allocated to some suggestions for future work.
Databáze: OpenAIRE