Benchmarking monthly homogenization algorithms

Autor: Sorin Cheval, Victor Venema, Pavel Zahradníček, José Antonio Guijarro, Fiorella Acquaotta, M. J. Menne, J. Viarre, G. Müller-Westermeier, Peter Domonkos, C. Gruber, Ingeborg Auer, K. Kolokythas, Simona Fratianni, Elke Rustemeier, Petr Stepanek, T. Brandsma, Michele Brunetti, G. Vertacnik, M. Prohom Duran, Ralf Lindau, Tania Marinova, Pere Esteban, M. Klancar, Mónika Lakatos, O. Mestre, Enric Aguilar, Tamás Szentimrey, L. Andresen, D. Rasol, T. Likso, Claude N. Williams
Jazyk: angličtina
Rok vydání: 2011
Předmět:
Popis: The COST (European Cooperation in Science and Technology) Action ES0601: Advances in homogenization methods of climate series: an integrated approach (HOME) has executed a blind intercomparison and validation study for monthly homogenization algorithms. Time series of monthly temperature and precipitation were evaluated because of their importance for climate studies and because they represent two important types of statistics (additive and multiplicative). The algorithms were validated against a realistic benchmark dataset. The benchmark contains real inhomogeneous data as well as simulated data with inserted inhomogeneities. Random break-type inhomogeneities were added to the simulated datasets modeled as a Poisson process with normally distributed breakpoint sizes. To approximate real world conditions, breaks were introduced that occur simultaneously in multiple station series within a simulated network of station data. The simulated time series also contained outliers, missing data periods and local station trends. Further, a stochastic nonlinear global (network-wide) trend was added. Participants provided 25 separate homogenized contributions as part of the blind study as well as 22 additional solutions submitted after the details of the imposed inhomogeneities were revealed. These homogenized datasets were assessed by a number of performance metrics including (i) the centered root mean square error relative to the true homogeneous value at various averaging scales, (ii) the error in linear trend estimates and (iii) traditional contingency skill scores. The metrics were computed both using the individual station series as well as the network average regional series. The performance of the contributions depends significantly on the error metric considered. Contingency scores by themselves are not very informative. Although relative homogenization algorithms typically improve the homogeneity of temperature data, only the best ones improve precipitation data. Training was found to be very important. Moreover, state-of-the-art relative homogenization algorithms developed to work with an inhomogeneous reference are shown to perform best. The study showed that currently automatic algorithms can perform as well as manual ones.
Databáze: OpenAIRE