Improved automated classification of basketball crowd noise

Autor:	Spencer Wadsworth, Eric Todd, Katrina Pedersen, Mark K. Transtrum, Mylan R. Cook, Sean Warnick, Brooks A. Butler, Kent L. Gee
Rok vydání:	2019
Předmět:	Training set Basketball Acoustics and Ultrasonics Computer science Event (computing) business.industry Machine learning computer.software_genre ComputingMethodologies_PATTERNRECOGNITION Arts and Humanities (miscellaneous) Spectrogram Unsupervised learning Noise (video) Artificial intelligence Cluster analysis business computer
Zdroj:	The Journal of the Acoustical Society of America. 145:1816-1816
ISSN:	0001-4966
Popis:	This paper describes using both supervised and unsupervised machine learning (ML) methods to improve automatic classification of crowd responses to events at collegiate basketball games. This work builds on recent investigations by the research team where the two ML approaches were treated separately. In one case, crowd response events (cheers, applause, etc.) were manually labeled, and then, a subset of the labeled events were used as a training set for supervised-ML event classification. In the other, (unsupervised) k-means clustering was used to divide a game’s one-twelfth octave spectrogram into six distinct clusters. A comparison of the two approaches shows that the manually labeled crowd responses are grouped into only one or two of the six unsupervised clusters. This paper describes how the supervised ML labels guide improvements to the k-means clustering analysis, such as determining which additional audio features are required as inputs and how both approaches can be used in tandem to improve automated classification of crowd noise at basketball games. This paper describes using both supervised and unsupervised machine learning (ML) methods to improve automatic classification of crowd responses to events at collegiate basketball games. This work builds on recent investigations by the research team where the two ML approaches were treated separately. In one case, crowd response events (cheers, applause, etc.) were manually labeled, and then, a subset of the labeled events were used as a training set for supervised-ML event classification. In the other, (unsupervised) k-means clustering was used to divide a game’s one-twelfth octave spectrogram into six distinct clusters. A comparison of the two approaches shows that the manually labeled crowd responses are grouped into only one or two of the six unsupervised clusters. This paper describes how the supervised ML labels guide improvements to the k-means clustering analysis, such as determining which additional audio features are required as inputs and how both approaches can be used in tandem to improve au...
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::8e07fe63707d519db662bf1dd2f088b1 https://doi.org/10.1121/1.5101637 Zobrazit plný text záznamu