Single molecule long-read real-time amplicon-based sequencing of CYP2D6: a proof-of-concept with hybrid haplotypes

Autor: Rachael Dong, Megana Thamilselvan, Xiuying Hu, Beatriz Carvalho Henriques, Yabing Wang, Keanna Wallace, Sudhakar Sivapalan, Avery Buchner, Vasyl Yavorskyy, Kristina Martens, Wolfgang Maier, Neven Henigsberg, Joanna Hauser, Annamaria Cattaneo, Ole Mors, Marcella Rietschel, Gerald Pfeffer, Katherine J. Aitchison
Rok vydání: 2022
DOI: 10.1101/2022.08.17.503990
Popis: CYP2D6 is a widely expressed human xenobiotic metabolizing enzyme, best known for its role in the hepatic phase I metabolism of up to 25% of prescribed medications, which is also expressed in other organs including the brain, where its potential role in physiology and mental health traits and disorders is under further investigation. Owing to the presence of homologous pseudogenes in the CYP2D locus and transposable repeat elements in the intergenic regions, the gene encoding the CYP2D6 enzyme, CYP2D6, is one of the most hypervariable known human genes - with more than 140 core haplotypes. Haplotypes include structural variants, with a subtype of these known as fusion genes comprising part of CYP2D6 and part of its adjacent pseudogene, CYP2D7. The fusion genes are particularly challenging to identify. The CYP2D6 enzyme activity corresponding to some of these fusion genes is known, while for others it is unknown. The most recent (high fidelity, or HiFi) version of single molecule real-time (SMRT) long-read sequencing can cover whole CYP2D6 haplotypes in a single continuous sequence read, ideal for structural variant detection. In addition, the accuracy of base calling has increased to a level sufficient for accurate characterization of single nucleotide variants. As new CYP2D6 haplotypes are continuously being discovered, and likely many more remain to be identified in populations that are relatively understudied to date, a method of characterization that employs sequencing with at least this degree of accuracy is required. The aim of the work reported herein was to develop an efficient and accurate HiFi SMRT amplicon-based method capable of detecting the full range of CYP2D6 haplotypes including fusion genes. We report proof-of-concept for 20 amplicons, aligned to fusion gene haplotypes, with prior cross-validation data. Amplicons with CYP2D6-D7 fusion genes aligned to *36, *63, *68, and *4 (*4-like; *4N, or *4.013) hybrid haplotypes. Amplicons with CYP2D7-D6 fusion genes aligned to the *13 subhaplotypes predicted (e.g., *13F, *13A2). Data analysis was efficient, and further method development indicates that this technique could suffice for the characterization of the full range of CYP2D6 haplotypes. Although included in drug labelling by regulatory bodies (the U.S. Food and Drug Administration, the European Medicines Agency, the Pharmaceuticals and Medical Devices Agency) and prescribing recommendations by consortia (Clinical Pharmacogenetics Implementation Consortium and the Dutch Pharmacogenetics Working Group), the identification of CYP2D6 variants is not yet routine in clinical practice. The HiFi sequencing method reported herein is suitable for high throughput, efficient, identification of the full range of known CYP2D6 haplotypes and novel haplotypes, and can be completed in a week or less. Moreover, the method that we have developed could be extended to other complex loci and to other species in a multiplexed high throughput assay.
Databáze: OpenAIRE