Audio classification and categorization based on wavelets and support vector Machine

Autor:	Shi-Huang Chen, Yukon Chang, Chien-Chang Lin, Trieu-Kien Truong
Rok vydání:	2005
Předmět:	Audio mining Audio signal Acoustics and Ultrasonics Computer science business.industry Speech recognition Feature extraction Pattern recognition computer.software_genre Support vector machine ComputingMethodologies_PATTERNRECOGNITION Wavelet Categorization Computer Vision and Pattern Recognition Mel-frequency cepstrum Artificial intelligence Electrical and Electronic Engineering business Audio signal processing computer Software
Zdroj:	IEEE Transactions on Speech and Audio Processing. 13:644-651
ISSN:	1063-6676
DOI:	10.1109/tsa.2005.851880
Popis:	In this paper, an improved audio classification and categorization technique is presented. This technique makes use of wavelets and support vector machines (SVMs) to accurately classify and categorize audio data. When a query audio is given, wavelets are first applied to extract acoustical features such as subband power and pitch information. Then, the proposed method uses a bottom-up SVM over these acoustical features and additional parameters, such as frequency cepstral coefficients, to accomplish audio classification and categorization. A public audio database (Muscle Fish), which consists of 410 sounds in 16 classes, is used to evaluate the performances of the proposed method against other similar schemes. Experimental results show that the classification errors are reduced from 16 (8.1%) to six (3.0%), and the categorization accuracy of a given audio sound can achieve 100% in the Top 2 matches.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::282794ecbad8c654671c18057309285c https://doi.org/10.1109/tsa.2005.851880 Zobrazit plný text záznamu