Autor: |
Dingle K; Department of Chemical Engineering and Biotechnology, Cambridge University, Cambridge CB2 1TN, UK.; Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA.; Department of Mathematics and Natural Sciences, Centre for Applied Mathematics and Bioinformatics (CAMB), Gulf University for Science and Technology, 32093, Kuwait., Novev JK; Department of Chemical Engineering and Biotechnology, Cambridge University, Cambridge CB2 1TN, UK., Ahnert SE; Department of Chemical Engineering and Biotechnology, Cambridge University, Cambridge CB2 1TN, UK., Louis AA; Department of Physics, Rudolf Peierls Centre for Theoretical Physics, Oxford University, Oxford OX1 2JD, UK. |
Abstrakt: |
Unravelling the structure of genotype-phenotype (GP) maps is an important problem in biology. Recently, arguments inspired by algorithmic information theory (AIT) and Kolmogorov complexity have been invoked to uncover simplicity bias in GP maps, an exponentially decaying upper bound in phenotype probability with the increasing phenotype descriptional complexity. This means that phenotypes with many genotypes assigned via the GP map must be simple, while complex phenotypes must have few genotypes assigned. Here, we use similar arguments to bound the probability P ( x → y ) that phenotype x , upon random genetic mutation, transitions to phenotype y . The bound is [Formula: see text], where [Formula: see text] is the estimated conditional complexity of y given x , quantifying how much extra information is required to make y given access to x . This upper bound is related to the conditional form of algorithmic probability from AIT. We demonstrate the practical applicability of our derived bound by predicting phenotype transition probabilities (and other related quantities) in simulations of RNA and protein secondary structures. Our work contributes to a general mathematical understanding of GP maps and may facilitate the prediction of transition probabilities directly from examining phenotype themselves, without utilizing detailed knowledge of the GP map. |