Developing ASR for Indonesian-English Bilingual Language Teaching

Autor:	Ben Foley, Zara Maxwelll-Smith
Rok vydání:	2021
Předmět:	Data collection business.industry Computer science Context (language use) Language acquisition computer.software_genre Bottleneck language.human_language Indonesian language Language education Compiler Artificial intelligence Transcription (software) business computer Natural language processing
Zdroj:	Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching.
Popis:	Usage-based analyses of teacher corpora and code-switching (Boztepe, 2003) are an important next stage in understanding language acquisition. Multilingual corpora are difficult to compile and a classroom setting adds pedagogy to the mix of factors which make this data so rich and problematic to classify. Using quantitative methods to understand language learning and teaching is difficult work as the ‘transcription bottleneck’ constrains the size of datasets. We found that using an automatic speech recognition (ASR) toolkit with a small set of training data is likely to speed data collection in this context (Maxwelll-Smith et al., 2020).
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::e69d1cac9b2ff18152b0d4a441896408 https://doi.org/10.18653/v1/2021.calcs-1.17 Zobrazit plný text záznamu