ANALYSIS OF WHISPER AUTOMATIC SPEECH RECOGNITION PERFORMANCE ON LOW RESOURCE LANGUAGE

Autor: Riefkyanov Surya Adia Pratama, Agit Amrullah
Jazyk: English<br />Indonesian
Rok vydání: 2024
Předmět:
Zdroj: Pilar Nusa Mandiri, Vol 20, Iss 1, Pp 1-8 (2024)
Druh dokumentu: article
ISSN: 1978-1946
2527-6514
DOI: 10.33480/pilar.v20i1.4633
Popis: Implementing Automatic Speech Recognition Technology in daily life could give convenience to its users. However, speeches that can be recognized accurately by the ASR model right now are in languages considered high resources, like English. In previous research, a few regional languages like Javanese, Sundanese, Balinese and Btaknese are used in automatic speech recognition. This research aim is to improve speech recognition using the ASR model on low-resource language. The dataset used in this research is the Javanese dataset specifically because there is a high-quality Javanese speech dataset provided by previous research. The method used is fine-tuning the Whisper model which has been trained on 680,000 hours of multilingual voice data using a Javanese speech dataset. To reduce computation requirements, parameter efficient fine-tuning (PEFT) implemented in the fine-tuning process. The trainable parameter is reduced to
Databáze: Directory of Open Access Journals