The power of representation: Statistical analysis of diversity in US Alzheimer's disease genetics data.

Autor: Xue D; Institute for Public Health Genetics University of Washington School of Public Health Seattle Washington USA., Blue EE; Institute for Public Health Genetics University of Washington School of Public Health Seattle Washington USA.; Division of Medical Genetics Department of Medicine University of Washington Seattle Washington USA.; Brotman Baty Institute Seattle Washington USA., Conomos MP; Department of Biostatistics University of Washington School of Public Health Seattle Washington USA., Fohner AE; Institute for Public Health Genetics University of Washington School of Public Health Seattle Washington USA.; Department of Epidemiology University of Washington School of Public Health Seattle Washington USA.
Jazyk: angličtina
Zdroj: Alzheimer's & dementia (New York, N. Y.) [Alzheimers Dement (N Y)] 2024 Mar 18; Vol. 10 (1), pp. e12462. Date of Electronic Publication: 2024 Mar 18 (Print Publication: 2024).
DOI: 10.1002/trc2.12462
Abstrakt: Introduction: Alzheimer's disease (AD) is a complex disease influenced by genetics and environment. More than 75 susceptibility loci have been linked to late-onset AD, but most of these loci were discovered in genome-wide association studies (GWAS) exclusive to non-Hispanic White individuals. There are wide disparities in AD risk across racially stratified groups, and while these disparities are not due to genetic differences, underrepresentation in genetic research can further exacerbate and contribute to their persistence. We investigated the racial/ethnic representation of participants in United States (US)-based AD genetics and the statistical implications of current representation.
Methods: We compared racial/ethnic data of participants from array and sequencing studies in US AD genetics databases, including National Institute on Aging Genetics of Alzheimer's Disease Data Storage Site (NIAGADS) and NIAGADS Data Sharing Service (dssNIAGADS), to AD and related dementia (ADRD) prevalence and mortality. We then simulated the statistical power of these datasets to identify risk variants from non-White populations.
Results: There is insufficient statistical power (probability <80%) to detect single nucleotide polymorphisms (SNPs) with low to moderate effect sizes (odds ratio [OR]<1.5) using array data from Black and Hispanic participants; studies of Asian participants are not powered to detect variants OR <= 2. Using available and projected sequencing data from Black and Hispanic participants, risk variants with OR = 1.2 are detectable at high allele frequencies. Sample sizes remain insufficiently powered to detect these variants in Asian populations.
Discussion: AD genetics datasets are largely representative of US ADRD burden. However, there is a wide discrepancy between proportional representation and statistically meaningful representation. Most variation identified in GWAS of non-Hispanic White individuals have low to moderate effects. Comparable risk variants in non-White populations are not detectable given current sample sizes, which could lead to disparities in future studies and drug development. We urge AD genetics researchers and institutions to continue investing in recruiting diverse participants and use community-based participatory research practices.
Competing Interests: The authors declare no conflicts of interest. Author disclosures are available in the supporting information.
(© 2024 The Authors. Alzheimer's & Dementia: Translational Research & Clinical Interventions published by Wiley Periodicals LLC on behalf of Alzheimer's Association.)
Databáze: MEDLINE