Improving Automatic Jazz Melody Generation by Transfer Learning Techniques

Autor:	Yi-Hsuan Yang, Hsiao-Tzu Hung, Chung-Yang Wang, Hsin-Min Wang
Rok vydání:	2019
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning Sound (cs.SD) MIDI Computer science business.industry computer.file_format Machine learning computer.software_genre Autoencoder Computer Science - Sound Machine Learning (cs.LG) Data modeling Generative model Audio and Speech Processing (eess.AS) FOS: Electrical engineering electronic engineering information engineering Task analysis Artificial intelligence Jazz business Transfer of learning computer Classifier (UML) Electrical Engineering and Systems Science - Audio and Speech Processing
Zdroj:	APSIPA
DOI:	10.1109/apsipaasc47483.2019.9023224
Popis:	In this paper, we tackle the problem of transfer learning for Jazz automatic generation. Jazz is one of representative types of music, but the lack of Jazz data in the MIDI format hinders the construction of a generative model for Jazz. Transfer learning is an approach aiming to solve the problem of data insufficiency, so as to transfer the common feature from one domain to another. In view of its success in other machine learning problems, we investigate whether, and how much, it can help improve automatic music generation for under-resourced musical genres. Specifically, we use a recurrent variational autoencoder as the generative model, and use a genre-unspecified dataset as the source dataset and a Jazz-only dataset as the target dataset. Two transfer learning methods are evaluated using six levels of source-to-target data ratios. The first method is to train the model on the source dataset, and then fine-tune the resulting model parameters on the target dataset. The second method is to train the model on both the source and target datasets at the same time, but add genre labels to the latent vectors and use a genre classifier to improve Jazz generation. The evaluation results show that the second method seems to perform better overall, but it cannot take full advantage of the genre-unspecified dataset. Comment: 8 pages, Accepted to APSIPA ASC(Asia-Pacific Signal and Information Processing Association Annual Summit and Conference ) 2019
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4bcfe9fe3dc031aa88a0b24927832229 https://doi.org/10.1109/apsipaasc47483.2019.9023224 Zobrazit plný text záznamu