Link Between Individual Codon Frequencies and Protein Expression: Going Beyond Codon Adaptation Index.

Autor: Zaytsev K; Bach Institute of Biochemistry, Federal Research Center of Biotechnology of the Russian Academy of Sciences, Moscow 119071, Russia., Bogatyreva N; Bach Institute of Biochemistry, Federal Research Center of Biotechnology of the Russian Academy of Sciences, Moscow 119071, Russia., Fedorov A; Bach Institute of Biochemistry, Federal Research Center of Biotechnology of the Russian Academy of Sciences, Moscow 119071, Russia.
Jazyk: angličtina
Zdroj: International journal of molecular sciences [Int J Mol Sci] 2024 Oct 29; Vol. 25 (21). Date of Electronic Publication: 2024 Oct 29.
DOI: 10.3390/ijms252111622
Abstrakt: An important role of a particular synonymous codon composition of a gene in its expression level is well known. There are a number of algorithms optimizing codon usage of recombinant genes to maximize their expression in host cells. Nevertheless, the underlying mechanism remains unsolved and is of significant relevance. In the realm of modern biotechnology, directing protein production to a specific level is crucial for metabolic engineering, genome rewriting and a growing number of other applications. In this study, we propose two new simple statistical and empirical methods for predicting the protein expression level from the nucleotide sequence of the corresponding gene: Codon Expression Index Score (CEIS) and Codon Productivity Score (CPS). Both of these methods are based on the influence of each individual codon in the gene on the overall expression level of the encoded protein and the frequencies of isoacceptors in the species. Our predictions achieve a correlation level of up to r = 0.7 with experimentally measured quantitative proteome data of Escherichia coli , which is superior to any previously proposed methods. Our work helps understand how codons determine protein abundances. Based on these methods, it is possible to design proteins optimized for expression in a particular organism.
Databáze: MEDLINE
Nepřihlášeným uživatelům se plný text nezobrazuje