Popis: |
Many scientific fields track variables through time to monitor trends, dynamics and abrupt changes. In this dissertation, we focus on the latter and aim to detect sudden distributional changes in time series data. Most of the existing change point detection methods proposed to automatically signal these abrupt shifts are univariate, targeting mean and other univariate statistics. This is an important limitation since in many applications, multiple variables (comprising a system) are monitored, and the events under study induce changes in multivariate rather than univariate parameters. Typical examples include excessive correlations of EEG signals during an epileptic seizure, increased dependence of financial assets during a financial crisis, or synchronized behavioral, experiential and physiological reactions to emotion-inducing stimuli. Next to changes in multivariate parameters, univariate parameters of multiple variables may also change simultaneously in reaction to a common event or stimulus. Therefore, a multivariate technique that can take all this information into account might have more power to pick up a meaningful change compared to univariate approaches. General-purpose multivariate non-parametric methods are attractive tools for detecting multivariate changes since they do not impose stringent distributional assumptions on the data. However, they have two major drawbacks: First, most of these methods perform well in detecting mean changes but are less sensitive to changes in other statistics. Second, when a change point is flagged by these methods, the user usually remains in the dark on what statistic changed exactly, as they are sensitive to any type of distributional change (e.g., mean, variance, autocorrelations and correlations). Yet, in substantive fields, knowing the nature of the change is central in understanding the studied phenomenon. In emotion psychology, for instance, distinct dynamic emotional features are reflected by specific statistics: emotional variability by the variance, emotional inertia by the autocorrelation and emotional differentiation by the correlations. In this dissertation, we present a flexible non-parametric change point detection approach to address these two mentioned drawbacks. The first three chapters focus on correlation change as methods to detect such change are largely lacking while it is an essential feature of multivariate data. In the last two chapters, we also study simultaneous change in univariate statistics (e.g., mean, variance, autocorrelation). In Chapter 1, four recent non-parametric change point detection methods are compared in terms of signaling both mean and correlation changes. Results from this comparison revealed that indeed correlation change is hard to capture and that KCP (kernel change point detection), outperforms the others. In chapters 2 and 3 we build on these results to further improve the detection of correlation change, by implementing KCP on the running correlations rather than on the raw data. In Chapter 2, a KCP based permutation test on the running correlations is proposed to signal the presence of correlation change points and is shown to outperform other significance tests. In Chapter 3, the substantial gain in sensitivity resulting from applying KCP on the running correlations rather than on the raw data is demonstrated. In Chapter 4, we apply KCP on the running autocorrelations and compare it with regime switching AR models. Finally, in Chapter 5 we take on a more comprehensive perspective and propose a KCP workflow to guide applied researchers in detecting different (and possibly intertwined) changes in real data. status: published |