A Generative Model for Natural Sounds Based on Latent Force Modelling
Autor: | Dan Stowell, William J. Wilkinson, Joshua D. Reiss |
---|---|
Rok vydání: | 2018 |
Předmět: |
Computer science
business.industry Probabilistic logic Statistical model Pattern recognition 02 engineering and technology Latent variable 030507 speech-language pathology & audiology 03 medical and health sciences symbols.namesake Generative model Kriging Component (UML) 0202 electrical engineering electronic engineering information engineering symbols 020201 artificial intelligence & image processing Artificial intelligence 0305 other medical science Natural sounds business Gaussian process |
Zdroj: | Latent Variable Analysis and Signal Separation ISBN: 9783319937632 LVA/ICA |
DOI: | 10.1007/978-3-319-93764-9_25 |
Popis: | Generative models based on subband amplitude envelopes of natural sounds have resulted in convincing synthesis, showing subband amplitude modulation to be a crucial component of auditory perception. Probabilistic latent variable analysis can be particularly insightful, but existing approaches don’t incorporate prior knowledge about the physical behaviour of amplitude envelopes, such as exponential decay or feedback. We use latent force modelling, a probabilistic learning paradigm that encodes physical knowledge into Gaussian process regression, to model correlation across spectral subband envelopes. We augment the standard latent force model approach by explicitly modelling dependencies across multiple time steps. Incorporating this prior knowledge strengthens the interpretation of the latent functions as the source that generated the signal. We examine this interpretation via an experiment showing that sounds generated by sampling from our probabilistic model are perceived to be more realistic than those generated by comparative models based on nonnegative matrix factorisation, even in cases where our model is outperformed from a reconstruction error perspective. |
Databáze: | OpenAIRE |
Externí odkaz: |