Optimizing DNA assembly based on statistical language modelling
Autor: | Gang Fang, Shemin Zhang, Yafei Dong |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2017 |
Předmět: |
0301 basic medicine
Theoretical computer science media_common.quotation_subject Biology BioBrick Bioinformatics 03 medical and health sciences 0302 clinical medicine Genetics media_common Models Statistical Grammar String (computer science) Process (computing) DNA Dynamic programming 030104 developmental biology Probability distribution Methods Online Programming Languages Synthetic Biology Language model Genetic Engineering 030217 neurology & neurosurgery Sentence Algorithms |
Zdroj: | Nucleic Acids Research |
ISSN: | 1362-4962 0305-1048 |
Popis: | By successively assembling genetic parts such as BioBrick according to grammatical models, complex genetic constructs composed of dozens of functional blocks can be built. However, usually every category of genetic parts includes a few or many parts. With increasing quantity of genetic parts, the process of assembling more than a few sets of these parts can be expensive, time consuming and error prone. At the last step of assembling it is somewhat difficult to decide which part should be selected. Based on statistical language model, which is a probability distribution P(s) over strings S that attempts to reflect how frequently a string S occurs as a sentence, the most commonly used parts will be selected. Then, a dynamic programming algorithm was designed to figure out the solution of maximum probability. The algorithm optimizes the results of a genetic design based on a grammatical model and finds an optimal solution. In this way, redundant operations can be reduced and the time and cost required for conducting biological experiments can be minimized. |
Databáze: | OpenAIRE |
Externí odkaz: |