Unifying Probabilistic Models for Time-Frequency Analysis

Autor: Michael Riis Andersen, Dan Stowell, Arno Solin, William J. Wilkinson, Joshua D. Reiss
Přispěvatelé: Queen Mary University of London, Probabilistic Machine Learning, Professorship Solin A., Department of Computer Science, Aalto-yliopisto, Aalto University
Jazyk: angličtina
Rok vydání: 2018
Předmět:
Signal Processing (eess.SP)
FOS: Computer and information sciences
Sound (cs.SD)
Computer Science - Machine Learning
Computer science
Gaussian processes
Inference
Machine Learning (stat.ML)
02 engineering and technology
computer.software_genre
Computer Science - Sound
Machine Learning (cs.LG)
Data modeling
symbols.namesake
Audio and Speech Processing (eess.AS)
Statistics - Machine Learning
FOS: Electrical engineering
electronic engineering
information engineering

0202 electrical engineering
electronic engineering
information engineering

Time domain
probabilistic time-frequency analysis
Electrical Engineering and Systems Science - Signal Processing
Audio signal processing
Gaussian process
state space models
State-space representation
Probabilistic logic
020206 networking & telecommunications
Amplitude
Kernel (statistics)
Frequency domain
symbols
computer
Algorithm
Electrical Engineering and Systems Science - Audio and Speech Processing
Zdroj: ICASSP
Popis: In audio signal processing, probabilistic time-frequency models have many benefits over their non-probabilistic counterparts. They adapt to the incoming signal, quantify uncertainty, and measure correlation between the signal's amplitude and phase information, making time domain resynthesis straightforward. However, these models are still not widely used since they come at a high computational cost, and because they are formulated in such a way that it can be difficult to interpret all the modelling assumptions. By showing their equivalence to Spectral Mixture Gaussian processes, we illuminate the underlying model assumptions and provide a general framework for constructing more complex models that better approximate real-world signals. Our interpretation makes it intuitive to inspect, compare, and alter the models since all prior knowledge is encoded in the Gaussian process kernel functions. We utilise a state space representation to perform efficient inference via Kalman smoothing, and we demonstrate how our interpretation allows for efficient parameter learning in the frequency domain.
Accepted to International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019
Databáze: OpenAIRE