Acoustic Scene Classification using Deep Learning Architectures

Autor: Shashidhar G. Koolagudi, Spoorthy., Manjunath Mulimani
Rok vydání: 2021
Předmět:
Zdroj: 2021 6th International Conference for Convergence in Technology (I2CT).
DOI: 10.1109/i2ct51068.2021.9418177
Popis: Enabling devices to make sense of sound is known as Acoustic Scene Classification (ASC). The analysis of various scenes by applying computational algorithms is known as computational auditory scene analysis. The main aim of this paper is to classify audio recordings based on the scenes/environment in which they are recorded. Deep learning is amongst the recent trends in most of the applications. In this paper, two deep learning algorithms are used to perform the classification of acoustic scenes, namely Convolution Neural Network (CNN) and Convolution-Recurrent Neural Network (CRNN). The model is evaluated on three activation functions, namely, ReLU, LeakyReLU and ELU. The highest recognition accuracy achieved for ASC task is 90.96% from CRNN model. The model performed well on basic convolution architecture with 10.9% improvement from the baseline system of this task.
Databáze: OpenAIRE