Development of Saraiki WordNet by Mapping of Word Senses: A Corpus Based Approach.

Autor: Gul, Sarah, Azher, Musarrat, Nawaz, Sana
Předmět:
Zdroj: Linguistics & Literature Review (LLR); 2021, Vol. 7 Issue 2, p47-66, 20p
Abstrakt: The main focus of this paper is to develop the Saraiki WordNet. Saraiki is one of the regional languages spoken in Pakistan and has a unique history of its own. Saraiki language has remarkable similarity with two languages i.e. Punjabi and Sindhi. Saraiki has different dialects and they differ according to the region where they are spoken. This paper uses the Urdu WordNet (Zafar, Mahmood, Shams & Hussain, 2014) as the basis for the formation of Saraiki WordNet. Urdu WordNet (Zafar et al., 2014) is created by UET Lahore and is based on Princeton WordNet (Miller, 1990). Development of Saraiki WordNet is very significant with regard to Natural Language Processing (NLP). Dictionaries or Lugats and literary sources such as Poetry and Fiction and non-literary sources like Newspaper of Saraiki language are used for the data purposes and the Urdu word senses are mapped to Saraiki word senses. The method used in this study is mapping and expand approach is used in the mapping process. This study will prove significant in creating bilingual dictionaries in future and this work can be used for further advancement in procedure of the development of the bilingual dictionaries. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index