Enhancing data quality through attribute‐based metadata and cost evaluation in data warehouse environments
Autor: | Shan‐Shan Yang, Chen‐Chau Yang, Yu‐Chi Chu |
---|---|
Rok vydání: | 2001 |
Předmět: | |
Zdroj: | Journal of the Chinese Institute of Engineers. 24:497-507 |
ISSN: | 2158-7299 0253-3839 |
DOI: | 10.1080/02533839.2001.9670646 |
Popis: | Data quality will be a significant issue as data warehousing becomes more and more popular. This paper aims at investigating and analyzing the data quality issues in data warehouse environments. We present an attribute‐based metadata model for identifying data quality. A four‐phase process is introduced for data quality management during the life cycle of data warehouses. Overall data quality conditions can be identified and related information can be provided for determining whether the data meet “fit to use” criteria and whether they need to be improved. Furthermore, we use a cost/benefit evaluation model to ferret out the poor‐quality data and set priorities for improvement given limited resources. Our approach allows system developers to document relevant quality data as metadata, which may be associated with the whole life cycle of data warehouses. Quality metadata not only can enrich the interpretation of attribute data, but can also provide diagnostic information for finding the reasons fo... |
Databáze: | OpenAIRE |
Externí odkaz: |