A Histogram Method for Summarizing Multi-dimensional Probabilistic Data
Autor: | Hai Wang, Qigang Gao, Ashraf Iqbal |
---|---|
Rok vydání: | 2013 |
Předmět: |
Histogram
Kullback–Leibler divergence business.industry Computer science Probabilistic logic Histogram matching Probabilistic database Pattern recognition computer.software_genre Automatic summarization Probabilistic Database Data Summarization General Earth and Planetary Sciences Probability distribution Uncertain Database Artificial intelligence Data mining business Wavelet computer General Environmental Science |
Zdroj: | ANT/SEIT |
ISSN: | 1877-0509 |
DOI: | 10.1016/j.procs.2013.06.135 |
Popis: | Currently, many database applications deal with large imprecise and uncertain datasets. Probabilistic data summarization has recently emerged and has already become an active research area in the database community. In this paper, we propose a data summarization method to summarize multidimensional probabilistic data using histograms. The proposed method iteratively constructs a histogram to represent the probabilistic data while maintaining a trade-off between minimizing the relative entropy among probability distributions and minimizing the space used by the histogram. The experimental results show that the proposed method achieves small errors for various compression ratios. |
Databáze: | OpenAIRE |
Externí odkaz: |