A simple method for user-driven music thumbnailing

Autor: van Nieuwenhuijsen, A.N., Burgoyne, J.A., Wiering, F., Sneekes, M., Cuming, J., Lee, J.H., McFee, B., Schedl, M., Devaney, J., McKay, C., Zangerle, E., de Reuse, T.
Přispěvatelé: ILLC (FGw), Language and Computation (ILLC, FNWI/FGw), ILLC (FNWI), Sub Multimedia, Multimedia
Rok vydání: 2020
Zdroj: Proceedings of the 21st International Society for Music Information Retrieval Conference: ISMIR MTL2020, Montréal, Québec, Canada, Virtual Conference, 11 to 16 October 2020, 223-230
STARTPAGE=223;ENDPAGE=230;TITLE=Proceedings of the 21st International Society for Music Information Retrieval Conference
DOI: 10.5281/zenodo.4245410
Popis: More and more music is becoming available digitally, increasing the need to navigate through large numbers of audio tracks easily. One approach for improving the browsing experience is music thumbnailing: the procedure of finding a continuous fragment that can represent the whole musical piece. This paper proposes a human-centred approach to creating thumbnails based on listeners' perception, directly asking listeners to identify the most characteristic fragment. We carried out a user study to assign representativeness scores to multiple fragments from a selection of popular music tracks. To strengthen the results, we performed a replication of the same user study with new participants and a different set of music. Thereafter, we used audio features, the segmentation algorithm, and participants' overall familiarity with the songs to predict representativeness scores. The results suggest that neither segmentation nor familiarity have a significant impact on users' thumbnail preferences: even segments with starting points that pay no regard to song structure can be suitable thumbnails. Three high-level audio characteristics, however, do impact the perceived representativeness of a fragment: Raw Intensity, Melodic Conventionality, and Conventionally of Intensity. Based on these findings, we propose a new, easy-to-apply method for music thumbnailing.
Databáze: OpenAIRE