Kinetic sequencing (k-Seq) as a massively parallel assay for ribozyme kinetics: utility and critical parameters
Autor: | Evan Janzen, Irene A. Chen, Yuning Shen, Abe Pressman |
---|---|
Rok vydání: | 2020 |
Předmět: |
Accuracy and precision
Sequence analysis AcademicSubjects/SCI00010 Computer science Computational biology 010402 general chemistry 01 natural sciences Models Biological DNA sequencing 03 medical and health sciences Genetics RNA Catalytic Massively parallel Bootstrapping (statistics) 030304 developmental biology 0303 health sciences Sequence biology Bootstrapping Ribozyme High-Throughput Nucleotide Sequencing Experimental data Sequence Analysis DNA 0104 chemical sciences Kinetics Narese/28 Mutation biology.protein Methods Online Identifiability |
Zdroj: | Nucleic Acids Research |
DOI: | 10.1101/2020.12.02.407346 |
Popis: | Characterization of genotype-phenotype relationships of genetically encoded molecules (e.g., ribozymes) requires accurate quantification of activity for a large set of molecules. Kinetic measurement using high-throughput sequencing (e.g., k-Seq) is an emerging assay applicable in various domains that potentially scales up measurement throughput to 105 ~ 106 unique sequences. However, technical challenges introduced by sequence heterogeneity and DNA sequencing must be understood to realize the utility and limitations of such assays. We characterized the k-Seq method in terms of model identifiability, effects of sequencing error, accuracy and precision using simulated datasets and experimental data from a variant pool constructed from previously identified ribozymes. Relative abundance, kinetic coefficients, and measurement noise were found to affect the measurement of each sequence. We introduced bootstrapping to robustly quantify the uncertainty in estimating model parameters and proposed interpretable metrics to quantify model identifiability. These efforts enabled the rigorous reporting of data quality for individual sequences in k-Seq experiments. Critical experimental factors were examined, and general guidelines are proposed to maximize the number of sequences having precisely estimated and identifiable kinetic coefficients from k-Seq data. Practices analogous to those laid out here could be applied to improve the rigor of similar sequencing-based assays. |
Databáze: | OpenAIRE |
Externí odkaz: |