Optimizing DNA assembly based on statistical language modelling

Autor:	Gang Fang, Shemin Zhang, Yafei Dong
Jazyk:	angličtina
Rok vydání:	2017
Předmět:	0301 basic medicine Theoretical computer science media_common.quotation_subject Biology BioBrick Bioinformatics 03 medical and health sciences 0302 clinical medicine Genetics media_common Models Statistical Grammar String (computer science) Process (computing) DNA Dynamic programming 030104 developmental biology Probability distribution Methods Online Programming Languages Synthetic Biology Language model Genetic Engineering 030217 neurology & neurosurgery Sentence Algorithms
Zdroj:	Nucleic Acids Research
ISSN:	1362-4962 0305-1048
Popis:	By successively assembling genetic parts such as BioBrick according to grammatical models, complex genetic constructs composed of dozens of functional blocks can be built. However, usually every category of genetic parts includes a few or many parts. With increasing quantity of genetic parts, the process of assembling more than a few sets of these parts can be expensive, time consuming and error prone. At the last step of assembling it is somewhat difficult to decide which part should be selected. Based on statistical language model, which is a probability distribution P(s) over strings S that attempts to reflect how frequently a string S occurs as a sentence, the most commonly used parts will be selected. Then, a dynamic programming algorithm was designed to figure out the solution of maximum probability. The algorithm optimizes the results of a genetic design based on a grammatical model and finds an optimal solution. In this way, redundant operations can be reduced and the time and cost required for conducting biological experiments can be minimized.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6156abd88bf24af2e78f7dda518809f0 http://europepmc.org/articles/PMC5727464 Zobrazit plný text záznamu