Morphological Analysis for the Maltese Language: The Challenges of a Hybrid System
Autor: | Albert Gatt, Claudia Borg |
---|---|
Rok vydání: | 2017 |
Předmět: |
FOS: Computer and information sciences
Computer science ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Verb computer.software_genre ComputingMethodologies_ARTIFICIALINTELLIGENCE Natural language processing (Computer science) Cluster analysis Computer Science - Computation and Language business.industry I.2.7 Corpora (Linguistics) Word (Linguistics) language.human_language Linguistic analysis (Linguistics) Maltese ComputingMethodologies_PATTERNRECOGNITION Hybrid system Morphological analysis language Artificial intelligence business Reference (Linguistics) computer Computation and Language (cs.CL) Natural language processing Word (computer architecture) |
Zdroj: | WANLP@EACL |
DOI: | 10.48550/arxiv.1703.08701 |
Popis: | Maltese is a morphologically rich language with a hybrid morphological system which features both concatenative and non-concatenative processes. This paper analyses the impact of this hybridity on the performance of machine learning techniques for morphological labelling and clustering. In particular, we analyse a dataset of morphologically related word clusters to evaluate the difference in results for concatenative and non-concatenative clusters. We also describe research carried out in morphological labelling, with a particular focus on the verb category. Two evaluations were carried out, one using an unseen dataset, and another one using a gold standard dataset which was manually labelled. The gold standard dataset was split into concatenative and non-concatenative to analyse the difference in results between the two morphological systems. non peer-reviewed |
Databáze: | OpenAIRE |
Externí odkaz: |