On the Generation of a Classification Algorithm from DNA Based Microarray Studies

Autor: Davies, Robert William
Jazyk: angličtina
Rok vydání: 2010
Druh dokumentu: Diplomová práce
DOI: 10.20381/ruor-19342
Popis: The purpose of this thesis is to build a classification algorithm using a Genome Wide Association (GWA) study. Briefly, a GWA is a case-control study using genotypes derived from DNA microarrays for thousands of people. These microarrays are able to acquire the genotypes of hundreds of thousands of Single Nucleotide Polymorphisms (SNPs) for a person at a time. In this thesis, we first describe the processes necessary to prepare the data for analysis. Next, we introduce the Naive Bayes classification algorithm and a modification so that effects of a SNP on the disease of interest are weighted by a Bayesian posterior probability of association. This thesis then uses the data from three coronary artery disease GWAs, one as a training set and two as test sets, to build and test the classifier. Finally, this thesis discusses the relevance of the results and the generalizability of this method to future studies.
Databáze: Networked Digital Library of Theses & Dissertations