Communication-efficient distributed statistical inference on zero-inflated Poisson models

Autor: Ran Wan, Yang Bai
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Statistical Theory and Related Fields, Vol 8, Iss 2, Pp 81-106 (2024)
Druh dokumentu: article
ISSN: 24754269
2475-4277
2475-4269
DOI: 10.1080/24754269.2023.2263721
Popis: Zero-inflated count outcomes are common in many studies, such as counting claim frequency in the insurance industry in which identifying and understanding excessive zeros are of interest. Moreover, with the progress of data collecting and storage techniques, the amount of data is too massive to be stored or processed by a single node or branch. Hence, to develop distributed data analysis is blossoming. In this paper, several communication-efficient distributed zero-inflated Poisson regression algorithms are developed to analyse such kind of large-scale zero-inflated data. Both asymptotic properties of the proposed estimators and algorithm complexities are well studied and conducted. Various simulation studies demonstrate that our proposed method and algorithm work well and efficiently. Finally, in the case study, we apply our proposed algorithms to car insurance data from Kaggle.
Databáze: Directory of Open Access Journals