Dimensional Modeling of HIV Data Using Open Source
Autor: | Otine, Charles D., Kucel, Samuel B., Trojer, Lena |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2010 |
Předmět: | |
DOI: | 10.5281/zenodo.1071932 |
Popis: | Selecting the data modeling technique for an information system is determined by the objective of the resultant data model. Dimensional modeling is the preferred modeling technique for data destined for data warehouses and data mining, presenting data models that ease analysis and queries which are in contrast with entity relationship modeling. The establishment of data warehouses as components of information system landscapes in many organizations has subsequently led to the development of dimensional modeling. This has been significantly more developed and reported for the commercial database management systems as compared to the open sources thereby making it less affordable for those in resource constrained settings. This paper presents dimensional modeling of HIV patient information using open source modeling tools. It aims to take advantage of the fact that the most affected regions by the HIV virus are also heavily resource constrained (sub-Saharan Africa) whereas having large quantities of HIV data. Two HIV data source systems were studied to identify appropriate dimensions and facts these were then modeled using two open source dimensional modeling tools. Use of open source would reduce the software costs for dimensional modeling and in turn make data warehousing and data mining more feasible even for those in resource constrained settings but with data available. {"references":["Chen, P. (1976). The Entity Relationship model-Towards a unified view\nof data, ACM Transactions on Database Systems, 1, 1, 9-36.","Chilton, M.A. (2006). Data Modeling Education: The changing\ntechnology, Journal of Information Systems Educaion, 17,1, 17-20.","Coar, K. (2006). The Open source Definition , Retrieved on 18th Nov\n2008 from opensource.org: http://www.opensource.org/docs/osd","Dash, A.K and Agarwal, R. (2001). Dimensional modeling for Data\nwarehouse, ACM SIGSOFT software engineering notes, 26, 1, 83-84.","Golfarelli, M., Maio, D. and Rizzi, S. (1998). Conceptual Design of Data\nwarehouses from E-R schemes, Proceedings of the Hawaii International\nConference On System Sciences, January 6-9, Hawaii","Gui, Y., Tang, S., Tong, Y. and Yang,D. (2006). Tripple Driven Data\nModeling Methodology in Data warehousing: A case study, ACM\nworkshop on Data warehousing and OLAP, 59-66","Ilczuk, G. and Wakulicz-Deja, A. (2007). Selection of Important\nattributes for Medical Diagnosis Systems. Transactions on Rough Sets ,\n7,1, 70-84.","Jones, M. E. and Song, I.Y. (2008). Dimensional modeling:\nIdentification, classification and evaluation of patterns. Decision\nSupport Systems , 59-76.","Kleijen, J. P. (1995). Verification and validation of simulation models.\nEuropean Journal of Operations Research , 82,1, 145-162.\n[10] Kortinik, M. A. and Moody, D. L. (2003). From ER Models to\nDimensional Models: Bridging the Gap between OLTP and OLAP\nDesign. Business Intelligence Journal , 8,3, 1-17.\n[11] Laender H. F., Freitas, G.M., and Campos, M.L. (2002). MD2- Getting\nUsers Involved in the Development of Data Warehouse Applications.\n4th International Conference Workshop Design and Management of\nData warehouses. May 27, Toronto, University of British Columbia, 3-\n12.\n[12] Lambert, B. (1995). Break Old Habits To Define Data Warehousing\nRequirements. Data Management Review .\n[13] Malinowski, E. and Zimanyi, E. (2007). A conceptual model for\ntemporal data warehouses and its transformation to the the ER and\nobject-relational model. Data and Knowledge Engineering ,64, 101-133.\n[14] Martyn, T. (2004). Reconsidering Multi-Dimensional Schemas. ACMs\nSpecial Interest Group On Management of Data , 33,1, 83-88.\n[15] Nguyen, T. M., Tjoa, A. M., and Trujillo, J. (2005). Data Warehousing\nand Knowledge Discovery: A Chronological View of Research\nChallenges. Springer , 530-535.\n[16] Pearson, W. (2008, 1 24). Dimensional Model components: Dimensions\npart 1. Retrieved 11 19, 2008, from Database Journal:\nhttp://www.databasejournal.com/features/mssql/article.php/3723311/Di\nmensional-Model-Components--Dimensions-Part-I.htm\n[17] Phipps, C. and Davis, K.C. (2003). Automating Data warehouse\nconceptual Schema Design and Evaluation. Proceedings of the 4th\ninternational conference on Design and Management of Data\nwarehouses. May 27, Toronto Canada, 23-32\n[18] Pokorny, J. (2003). Modeling stars using XML.\n[19] Riadh, B. M., Omar, B., & Sabine, R. (2004). A new OLAP Aggregation\nBased on the AHC Technique. DOLAP (pp. 65-71). Washington,DC:\nACM.\n[20] UNAIDS. (2008). 2008 Report on the Global AIDS epidemic. Geneva:\nWHO Library Cataloguing-in-Publication Data."]} |
Databáze: | OpenAIRE |
Externí odkaz: |