Zobrazeno 1 - 10
of 65
pro vyhledávání: '"Zhu, Yingqiu"'
Autor:
Li, Xuetong, Gao, Yuan, Chang, Hong, Huang, Danyang, Ma, Yingying, Pan, Rui, Qi, Haobo, Wang, Feifei, Wu, Shuyuan, Xu, Ke, Zhou, Jing, Zhu, Xuening, Zhu, Yingqiu, Wang, Hansheng
This paper presents a selective review of statistical computation methods for massive data analysis. A huge amount of statistical methods for massive data computation have been rapidly developed in the past decades. In this work, we focus on three ca
Externí odkaz:
http://arxiv.org/abs/2403.11163
This article introduces CluBear, a Python-based open-source package for interactive massive data analysis. The key feature of CluBear is that it enables users to conduct convenient and interactive statistical analysis of massive data with only a trad
Externí odkaz:
http://arxiv.org/abs/2312.17065
In this paper, we studied a buffered mini-batch gradient descent (BMGD) algorithm for training complex model on massive datasets. The algorithm studied here is designed for fast training on a GPU-CPU system, which contains two steps: the buffering st
Externí odkaz:
http://arxiv.org/abs/2312.08728
Autor:
Zeng, Qianhan, Zhu, Yingqiu, Zhu, Xuening, Wang, Feifei, Zhao, Weichen, Sun, Shuning, Su, Meng, Wang, Hansheng
Labeling mistakes are frequently encountered in real-world applications. If not treated well, the labeling mistakes can deteriorate the classification performances of a model seriously. To address this issue, we propose an improved Naive Bayes method
Externí odkaz:
http://arxiv.org/abs/2304.06292
With the rapid development of online payment platforms, it is now possible to record massive transaction data. Clustering on transaction data significantly contributes to analyzing merchants' behavior patterns. This enables payment platforms to provi
Externí odkaz:
http://arxiv.org/abs/2203.02709
Publikováno v:
In Expert Systems With Applications 15 January 2025 260
Online social network platforms such as Twitter and Sina Weibo have been extremely popular over the past 20 years. Identifying the network community of a social platform is essential to exploring and understanding the users' interests. However, the r
Externí odkaz:
http://arxiv.org/abs/2110.13613
The emergence of massive data in recent years brings challenges to automatic statistical inference. This is particularly true if the data are too numerous to be read into memory as a whole. Accordingly, new sampling techniques are needed to sample da
Externí odkaz:
http://arxiv.org/abs/2110.00936
Publikováno v:
In Journal of Statistical Planning and Inference July 2024 231
Publikováno v:
In Computational Statistics and Data Analysis January 2024 189