Medleyvox: An Evaluation Dataset for Multiple Singing Voices Separation

Autor:	Jeon, Chang-Bin, Moon, Hyeongi, Choi, Keunwoo, Chon, Ben Sangbae, Lee, Kyogu
Rok vydání:	2023
Předmět:	FOS: Computer and information sciences Sound (cs.SD) Computer Science - Machine Learning Audio and Speech Processing (eess.AS) FOS: Electrical engineering electronic engineering information engineering Computer Science - Sound Machine Learning (cs.LG) Electrical Engineering and Systems Science - Audio and Speech Processing
Zdroj:	ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
DOI:	10.1109/icassp49357.2023.10095425
Popis:	Separation of multiple singing voices into each voice is a rarely studied area in music source separation research. The absence of a benchmark dataset has hindered its progress. In this paper, we present an evaluation dataset and provide baseline studies for multiple singing voices separation. First, we introduce MedleyVox, an evaluation dataset for multiple singing voices separation. We specify the problem definition in this dataset by categorizing it into i) unison, ii) duet, iii) main vs. rest, and iv) N-singing separation. Second, to overcome the absence of existing multi-singing datasets for a training purpose, we present a strategy for construction of multiple singing mixtures using various single-singing datasets. Third, we propose the improved super-resolution network (iSRNet), which greatly enhances initial estimates of separation networks. Jointly trained with the Conv-TasNet and the multi-singing mixture construction strategy, the proposed iSRNet achieved comparable performance to ideal time-frequency masks on duet and unison subsets of MedleyVox. Audio samples, the dataset, and codes are available on our website (https://github.com/jeonchangbin49/MedleyVox). 5 pages, 3 figures, 6 tables, To appear in ICASSP 2023 (camera-ready version)
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6bb62086604dfdfb56f095e49828d187 https://doi.org/10.1109/icassp49357.2023.10095425 Zobrazit plný text záznamu