Autor: |
Zi-yang CHEN, Yu-jun HAN, Xuan WANG, Jun-feng ZHOU |
Jazyk: |
čínština |
Rok vydání: |
2014 |
Předmět: |
|
Zdroj: |
Tongxin xuebao, Vol 35, Pp 10-20 (2014) |
Druh dokumentu: |
article |
ISSN: |
1000-436X |
DOI: |
10.3969/j.issn.1000-436x.2014.12.002 |
Popis: |
Computing top-k similar strings based on edit distance,i.e.,given a query string σ and string set S,finding k similar strings to σ based on edit distance from S.Firstly,two adaptive filter strategies based on length-skip index are proposed,such that to reduce the times of edit distance computation between two strings.Then the lower bound of edit distance between query string and unmatched string set is proposed,such that to further reduce the times of edit dis-tance computation when processing strings that do not have common signatures with the query string.Finally efficient algorithms to return top-k similar strings are proposed.Experimental results on three real datasets verify the benefits over the state-of-the-art algorithm. |
Databáze: |
Directory of Open Access Journals |
Externí odkaz: |
|