DATA MINING AND PROBABILISTIC NETWORK-BASED MECHANISM FOR DATA WAREHOUSING

Autor: Shyu, Mei-Ling, Kashyap, R. L.
Rok vydání: 1999
Předmět:
Zdroj: Department of Electrical and Computer Engineering Technical Reports
Popis: A good multirmedia database managerment systerm (MDBMS) should be able to store, retrieve, and manage rich semantic data in multimedia database syst,ems. Due to the complexity of real-world applications, the number of databases and the volumes of data in databases have increased tremendously. With the explosive growth in the amount and complexity of data, how to effectively manage the network of databases and utilize the large amount of data becomes important. For this purpose, a probabilistic network-based mechanism for constructing a federation of data warehouses and speeding up information retrieval to facilitate the functionality of an MDBMS is proposed. Our solution procedure consists of three steps. First, we build the probabilistic network by reasoning the probability distributions and mining the generalized affinity-based associations from a set of historical data collected from the network of operational databases. By doing so, the summarized and useful knowledge can be discovered. Second, we derive a siirnilarity measure method to construct a federation of data warehouses so as to reduce the number of inter-warehouse accesses required for queries. Those databases with high similarity values are placed in the same data warehouse. The similarity value is measured via a stochastic process from the mined probability distributions. Third, a second stochastic process generates a list of possible paths with respect to a given query and specifies the particular media objects over the constructed data warehouses so as to speed up multimedia query processing and information retrieval.
Databáze: OpenAIRE