Data-Efficient Classification of Birdcall Through Convolutional Neural Networks Transfer Learning

Autor: Dina B. Efremova, Mangalam Sankupellay, Dmitry A. Konovalov
Jazyk: angličtina
Rok vydání: 2019
Předmět:
FOS: Computer and information sciences
0106 biological sciences
Sound (cs.SD)
Computer science
Computer Vision and Pattern Recognition (cs.CV)
Computer Science - Computer Vision and Pattern Recognition
02 engineering and technology
010603 evolutionary biology
01 natural sciences
Convolutional neural network
Computer Science - Sound
Domain (software engineering)
Set (abstract data type)
Audio and Speech Processing (eess.AS)
FOS: Electrical engineering
electronic engineering
information engineering

0202 electrical engineering
electronic engineering
information engineering

Training set
business.industry
Deep learning
Image and Video Processing (eess.IV)
Pattern recognition
Electrical Engineering and Systems Science - Image and Video Processing
Base (topology)
Multimedia (cs.MM)
Spectrogram
020201 artificial intelligence & image processing
Artificial intelligence
Transfer of learning
business
Computer Science - Multimedia
Electrical Engineering and Systems Science - Audio and Speech Processing
Zdroj: DICTA
Popis: Deep learning Convolutional Neural Network (CNN) models are powerful classification models but require a large amount of training data. In niche domains such as bird acoustics, it is expensive and difficult to obtain a large number of training samples. One method of classifying data with a limited number of training samples is to employ transfer learning. In this research, we evaluated the effectiveness of birdcall classification using transfer learning from a larger base dataset (2814 samples in 46 classes) to a smaller target dataset (351 samples in 10 classes) using the ResNet-50 CNN. We obtained 79% average validation accuracy on the target dataset in 5-fold cross-validation. The methodology of transfer learning from an ImageNet-trained CNN to a project-specific and a much smaller set of classes and images was extended to the domain of spectrogram images, where the base dataset effectively played the role of the ImageNet.
Accepted for IEEE Digital Image Computing: Techniques and Applications, 2019 (DICTA 2019), 2-4 December 2019 in Perth, Australia, http://dicta2019.dictaconference.org/index.html
Databáze: OpenAIRE