Audio classification and categorization based on wavelets and support vector Machine
Autor: | Shi-Huang Chen, Yukon Chang, Chien-Chang Lin, Trieu-Kien Truong |
---|---|
Rok vydání: | 2005 |
Předmět: |
Audio mining
Audio signal Acoustics and Ultrasonics Computer science business.industry Speech recognition Feature extraction Pattern recognition computer.software_genre Support vector machine ComputingMethodologies_PATTERNRECOGNITION Wavelet Categorization Computer Vision and Pattern Recognition Mel-frequency cepstrum Artificial intelligence Electrical and Electronic Engineering business Audio signal processing computer Software |
Zdroj: | IEEE Transactions on Speech and Audio Processing. 13:644-651 |
ISSN: | 1063-6676 |
DOI: | 10.1109/tsa.2005.851880 |
Popis: | In this paper, an improved audio classification and categorization technique is presented. This technique makes use of wavelets and support vector machines (SVMs) to accurately classify and categorize audio data. When a query audio is given, wavelets are first applied to extract acoustical features such as subband power and pitch information. Then, the proposed method uses a bottom-up SVM over these acoustical features and additional parameters, such as frequency cepstral coefficients, to accomplish audio classification and categorization. A public audio database (Muscle Fish), which consists of 410 sounds in 16 classes, is used to evaluate the performances of the proposed method against other similar schemes. Experimental results show that the classification errors are reduced from 16 (8.1%) to six (3.0%), and the categorization accuracy of a given audio sound can achieve 100% in the Top 2 matches. |
Databáze: | OpenAIRE |
Externí odkaz: |