Persian Phoneme and Syllable Recognition using Recurrent Neural Networks for Phonological Awareness Assessment
Autor: | M. Khanzadi, H. Veisi, R. Alinaghizade, Z. Soleymani |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2022 |
Předmět: | |
Zdroj: | Journal of Artificial Intelligence and Data Mining, Vol 10, Iss 1, Pp 117-126 (2022) |
Druh dokumentu: | article |
ISSN: | 2322-5211 2322-4444 |
DOI: | 10.22044/jadm.2022.11185.2268 |
Popis: | One of the main problems in children with learning difficulties is the weakness of phonological awareness (PA) skills. In this regard, PA tests are used to evaluate this skill. Currently, this assessment is paper-based for the Persian language. To accelerate the process of the assessments and make it engaging for children, we propose a computer-based solution that is a comprehensive Persian phonological awareness assessment system implementing expressive and pointing tasks. For the expressive tasks, the solution is powered by recurrent neural network-based speech recognition systems. To this end, various recognition modules are implemented, including a phoneme recognition system for the phoneme segmentation task, a syllable recognition system for the syllable segmentation task, and a sub-word recognition system for three types of phoneme deletion tasks, including initial, middle, and final phoneme deletion. The recognition systems use bidirectional long short-term memory neural networks to construct acoustic models. To implement the recognition systems, we designed and collected Persian Kid’s Speech Corpus that is the largest in Persian for children’s speech. The accuracy rate for phoneme recognition was 85.5%, and for syllable recognition was 89.4%. The accuracy rates of the initial, middle, and final phoneme deletion were 96.76%, 98.21%, and 95.9%, respectively. |
Databáze: | Directory of Open Access Journals |
Externí odkaz: |