GOLD standard dataset for Alzheimer genes
Autor: | Anchal Vishnoi, Sushrutha Raj, Alok Kumar Srivastava |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2020 |
Předmět: |
Disease
Computational biology Biology Cross validation System modeling lcsh:Computer applications to medicine. Medical informatics GOLD standard Machine learning medicine False positive paradox Association Class lcsh:Science (General) Decision Science Genetic association Multidisciplinary Alzheimer gene association Gold standard (test) medicine.disease Reference data Alzheimer genes Meta-analysis Text classification lcsh:R858-859.7 Alzheimer's disease Meta analysis lcsh:Q1-390 |
Zdroj: | Data in Brief Data in Brief, Vol 30, Iss, Pp 105439-(2020) |
ISSN: | 2352-3409 |
Popis: | Alzheimer disease is a genetically complex multigenic neurodegenerative disorder, resulting from the interaction between multiple genes. Most of the earlier studies reported only few specific genes that have involvement in Alzheimer. However more than hundreds of susceptible genes have been observed, that have significant role in the development and progression of Alzheimer. Among all the existing data resources, Genetic association database is the most popular data source that contains information about genes, their association classes into positive, negative and neutral class and supporting reference. However, it contains lot of false positives and negatives associations. We have taken this data as reference and performed the double fold cross validation to compile the comprehensive list of Alzheimer genes, their association class viz, positive, negative or ambiguous with the disease and reference sentence confirming the association. The data generated will be used as a GOLD standard reference data set for the training of machine learning classifier to predict the classification of published literature not only in Alzheimer but in other diseases as well. In addition, positive associated genes data can also be used for the system level modelling or meta analysis of Alzheimer. |
Databáze: | OpenAIRE |
Externí odkaz: |