General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline
Autor: | Fonseca, Eduardo, Plakal, Manoj, Font, Frederic, Ellis, Daniel P. W., Favory, Xavier, Pons Puig, Jordi, Serra, Xavier |
---|---|
Rok vydání: | 2018 |
Předmět: |
FOS: Computer and information sciences
Computer Science - Machine Learning Sound (cs.SD) Audio dataset Machine Learning (stat.ML) Audio tagging Computer Science - Sound Machine Learning (cs.LG) Statistics - Machine Learning Audio and Speech Processing (eess.AS) Data collection FOS: Electrical engineering electronic engineering information engineering Electrical Engineering and Systems Science - Audio and Speech Processing |
Zdroj: | Recercat. Dipósit de la Recerca de Catalunya instname |
DOI: | 10.48550/arxiv.1807.09902 |
Popis: | This paper describes Task 2 of the DCASE 2018 Challenge, titled "General-purpose audio tagging of Freesound content with AudioSet labels". This task was hosted on the Kaggle platform as "Freesound General-Purpose Audio Tagging Challenge". The goal of the task is to build an audio tagging system that can recognize the category of an audio clip from a subset of 41 diverse categories drawn from the AudioSet Ontology. We present the task, the dataset prepared for the competition, and a baseline system. Comment: Camera ready for DCASE Workshop 2018 |
Databáze: | OpenAIRE |
Externí odkaz: |