AAYUDHA: A Tool for Automatic Segmentation and Labelling of Continuous Tamil Speech

Autor: M Suguna, B R Laxmi Sree
Rok vydání: 2016
Předmět:
Zdroj: International Journal of Computer Applications. 143:31-35
ISSN: 0975-8887
DOI: 10.5120/ijca2016910002
Popis: An effective way of communication between human is now becoming an alternative way to communicate between human and machine. This alternative way is now-a-days used in many real time systems for faster, easier and comfortable response and communication. Speech segmentation and labelling are the process that lay as a key to decide the accuracy of several speech related research. A tool "AAYUDHA" is proposed that enables automatic segmentation and labelling of continuous speech in Tamil. Two different segmentation algorithms, one based on Fast Fourier Transform (FFT) feature set and 2D filtering and other based on Discrete Wavelet Transform (DWT) feature set and its energy variation in different sub-bands are implemented. The segmentation accuracy of those algorithms is analyzed. Further the segmented speech is labelled using a baseline Hidden Markov Model (HMM) based acoustic model. A speech corpus named "KAZHANGIYAM" is created which includes the recorded Tamil speech of various speakers. The database also includes the information of manually segmented data of those speech data. This speech corpus is used to analyze the accuracy of the algorithms used in the proposed tool. This tool concentrates on the phonetic level segmentation of Tamil speech. The tool shows an acceptable segmentation and labelling accuracy.
Databáze: OpenAIRE