Cross-Correlation Based Automated Segmentation of Audio Samples

Autor:	Emilian-Erman, Mahmut, Stelian, Nicola, Vasile, Stoicu-Tivadar
Rok vydání:	2020
Předmět:	Hearing Humans Language
Zdroj:	Studies in health technology and informatics. 272
ISSN:	1879-8365
Popis:	This paper presents an audio file segmentation method in an attempt to mitigate the issue of variable durations of the same utterance by different individuals, e.g.: Speech-Language Pathologist (SLP) and dyslalic subjects. The Method section describes the manner of determination of the maximum cross-correlation value between the 2 audio files and the subsequent automated segmentation thereof in order to extract 2 valid pronunciation samples of the target consonant. The method is aimed at pre-processing audio files and supplying homogeneously-trimmed audio samples to a computerized SSD Screening system. The results obtained on a batch of 30 pronunciations are presented and briefly discussed in the third section while the last section is reserved for conclusions and perspectives.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=pmid________::5fae2028a510713e6bd786b997a3e1ab https://pubmed.ncbi.nlm.nih.gov/32604646 Zobrazit plný text záznamu