An analysis of semantic data quality defiencies in a national data warehouse: a data mining approach
Autor: | Barth, Kirstin |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2018 |
Předmět: | |
Druh dokumentu: | Dissertation |
Popis: | This research determines whether data quality mining can be used to describe, monitor and evaluate the scope and impact of semantic data quality problems in the learner enrolment data on the National Learners’ Records Database. Previous data quality mining work has focused on anomaly detection and has assumed that the data quality aspect being measured exists as a data value in the data set being mined. The method for this research is quantitative in that the data mining techniques and model that are best suited for semantic data quality deficiencies are identified and then applied to the data. The research determines that unsupervised data mining techniques that allow for weighted analysis of the data would be most suitable for the data mining of semantic data deficiencies. Further, the academic Knowledge Discovery in Databases model needs to be amended when applied to data mining semantic data quality deficiencies. School of Computing M. Tech. (Information Technology) |
Databáze: | Networked Digital Library of Theses & Dissertations |
Externí odkaz: |