Pattern matching between two non-aligned random sequences.

Autor: Sheng KN; Department of Statistics, Rutgers, State University of New Jersey, New Brunswick 08903., Naus JI
Jazyk: angličtina
Zdroj: Bulletin of mathematical biology [Bull Math Biol] 1994 Nov; Vol. 56 (6), pp. 1143-62.
DOI: 10.1007/BF02460290
Abstrakt: Given two independent sequences of letters, we seek the probability distribution of the length of the longest matching word. This word can be in different positions in the two sequences and we consider both perfect and nearly perfect matching. We derive bounds and approximations for the probability and compare them with other bounds and approximations. The results can be applied to DNA sequences in molecular biology and generalized matching between two independent random sequences.
Databáze: MEDLINE