Research on Multi-Modal Music Score Alignment Model for Online Music Education.

Autor: Ren, Dexin
Předmět:
Zdroj: Journal of Advanced Computational Intelligence & Intelligent Informatics; Sep2024, Vol. 28 Issue 5, p1075-1084, 10p
Abstrakt: As music data storage becomes increasingly diverse in the era of big data, ensuring alignment of music works with the same semantics for online music education is crucial. To achieve this, a multi-modal music score alignment algorithm model based on deep learning was developed and optimized. Experimental results demonstrated that Note + DCO feature combination yielded the best MIDI input characteristics (mean value: 13.27 ms), whereas CQT feature comparison produced the best results for audio input (average: 12.85 ms). The ResNet-34 network was noted to have the most effective music score alignment effect with alignment errors averaging less than one frame. Compared with other algorithms, the proposed algorithm had the lowest average value of 9.28 ms, median value of 5.85 ms, and standard deviation of 20.17 ms. Actual music retrieval showed a Top-1 retrieval accuracy of 10.93% that was close to 11%. Overall, the proposed algorithm is significant for score alignment and music retrieval recognition in online music education. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index