A methodology for measuring structure similarity of fuzzy XML documents
Autor: | Zhen Zhao, Zongmin Ma |
---|---|
Rok vydání: | 2017 |
Předmět: |
Document Structure Description
computer.internet_protocol Computer science Well-formed document 02 engineering and technology Similarity measure computer.software_genre Theoretical Computer Science Simple API for XML 020204 information systems 0202 electrical engineering electronic engineering information engineering XML schema computer.programming_language Numerical Analysis Information retrieval XML validation Computer Science Applications XML framework Computational Mathematics Computational Theory and Mathematics ComputingMethodologies_DOCUMENTANDTEXTPROCESSING 020201 artificial intelligence & image processing Data mining computer Software XML |
Zdroj: | Computing. 99:493-506 |
ISSN: | 1436-5057 0010-485X |
DOI: | 10.1007/s00607-017-0553-x |
Popis: | Document matching has become a crucial task for data integration. A considerable amount of algorithms for comparing XML documents have been proposed in the literature. Yet, the existing approaches fall short in ability to identify structural similarities of fuzzy XML documents. To fill this gap, in this paper, we provide an integrated comparison approach to cope with structural similarities of the fuzzy XML documents. Firstly, we propose a new fuzzy XML document tree model to represent fuzzy XML document. Secondly, we offer element/attribute features similarity measure approach to identify matching nodes. Thirdly, we present an effective algorithm based on the tree edit distance to detect the structural similarities between fuzzy XML document trees represented with the proposed model. Finally, the experimental results demonstrate that our approach can efficiently perform structural similarity measure of the fuzzy XML documents. |
Databáze: | OpenAIRE |
Externí odkaz: |