Scalable Bayesian inference for self-excitatory stochastic processes applied to big American gunfire data.

Autor: Holbrook AJ; Department of Biostatistics, University of California, Los Angeles, Los Angeles, USA., Loeffler CE; Department of Criminology, University of Pennsylvania, Philadelphia, USA., Flaxman SR; Department of Mathematics, Imperial College London, London, UK., Suchard MA; Department of Biostatistics, University of California, Los Angeles, Los Angeles, USA.; Department of Biomathematics, University of California, Los Angeles, Los Angeles, USA.; Department of Human Genetics, University of California, Los Angeles, Los Angeles, USA.
Jazyk: angličtina
Zdroj: Statistics and computing [Stat Comput] 2021 Jan; Vol. 31 (1). Date of Electronic Publication: 2021 Jan 12.
DOI: 10.1007/s11222-020-09980-4
Abstrakt: The Hawkes process and its extensions effectively model self-excitatory phenomena including earthquakes, viral pandemics, financial transactions, neural spike trains and the spread of memes through social networks. The usefulness of these stochastic process models within a host of economic sectors and scientific disciplines is undercut by the processes' computational burden: complexity of likelihood evaluations grows quadratically in the number of observations for both the temporal and spatiotemporal Hawkes processes. We show that, with care, one may parallelize these calculations using both central and graphics processing unit implementations to achieve over 100-fold speedups over single-core processing. Using a simple adaptive Metropolis-Hastings scheme, we apply our high-performance computing framework to a Bayesian analysis of big gunshot data generated in Washington D.C. between the years of 2006 and 2019, thereby extending a past analysis of the same data from under 10,000 to over 85,000 observations. To encourage widespread use, we provide hpHawkes, an open-source R package, and discuss high-level implementation and program design for leveraging aspects of computational hardware that become necessary in a big data setting.
Databáze: MEDLINE