Building Corpora of Spoken Filipino Words Using Speech Segmentation with Automatic Labeling

Autor:	Felizardo C. Reyes, Arnel C. Fajardo
Rok vydání:	2019
Předmět:	Application programming interface business.industry Computer science Deep learning Cloud computing Python (programming language) computer.software_genre Speech segmentation Data set Filipino Language Segmentation Artificial intelligence business computer Natural language processing computer.programming_language
Zdroj:	2019 International Conference on ICT for Smart Society (ICISS).
DOI:	10.1109/iciss48059.2019.8969813
Popis:	There are hardly any open access speech corpora in Filipino that are structured and can be used to train speech recognition systems that utilize deep learning models. The existing data sets of audios in Filipino language are in forms that require pre-processing and cleansing. This paper proposed a method that would allow building up of Filipino words corpora which can be used as data set for deep learning speech recognition systems. The method utilized speech segmentation technique to extract words from a sound file composed of sentences, words, and syllables. Three trials with variances on the length of silence were conducted to increase accuracy of segmentation and create a more robust Filipino words corpus. Using several Python tools combined with Google Cloud Speech Recognition Application Program Interface, automatic speech segmentation with automatic labeling for Filipino language was achieved with 95% accuracy.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::e47d7ea58b26a39ca17b91dc6e61cece https://doi.org/10.1109/iciss48059.2019.8969813 Zobrazit plný text záznamu