General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline

Autor:	Fonseca, Eduardo, Plakal, Manoj, Font, Frederic, Ellis, Daniel P. W., Favory, Xavier, Pons Puig, Jordi, Serra, Xavier
Rok vydání:	2018
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning Sound (cs.SD) Audio dataset Machine Learning (stat.ML) Audio tagging Computer Science - Sound Machine Learning (cs.LG) Statistics - Machine Learning Audio and Speech Processing (eess.AS) Data collection FOS: Electrical engineering electronic engineering information engineering Electrical Engineering and Systems Science - Audio and Speech Processing
Zdroj:	Recercat. Dipósit de la Recerca de Catalunya instname
DOI:	10.48550/arxiv.1807.09902
Popis:	This paper describes Task 2 of the DCASE 2018 Challenge, titled "General-purpose audio tagging of Freesound content with AudioSet labels". This task was hosted on the Kaggle platform as "Freesound General-Purpose Audio Tagging Challenge". The goal of the task is to build an audio tagging system that can recognize the category of an audio clip from a subset of 41 diverse categories drawn from the AudioSet Ontology. We present the task, the dataset prepared for the competition, and a baseline system. Comment: Camera ready for DCASE Workshop 2018
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9c83c8eae0b21878973e94630dc0b5b9 Zobrazit plný text záznamu