Knowledge Graphs as Context Models: Improving the Detection of Cross-Language Plagiarism with Paraphrasing

Autor: Paolo Rosso, Marc Franco-Salvador, Parth Gupta
Jazyk: angličtina
Rok vydání: 2013
Předmět:
Zdroj: RiuNet. Repositorio Institucional de la Universitat Politécnica de Valéncia
instname
Lecture Notes in Computer Science ISBN: 9783642547973
PROMISE Winter School
Popis: Cross-language plagiarism detection attempts to identify and extract automatically plagiarism among documents in different languages. Plagiarized fragments can be translated verbatim copies or may alter their structure to hide the copying, which is known as paraphrasing and is more difficult to detect. In order to improve the paraphrasing detection, we use a knowledge graph-based approach to obtain and compare context models of document fragments in different languages. Experimental results in German-English and Spanish-English cross-language plagiarism detection indicate that our knowledge graph-based approach offers a better performance compared to other state-of-the-art models.
The research has been carried out in the framework of the European Commission WIQ-EIIRSES (no. 269180) and DIANA-APPLICATIONS - Finding Hidden Knowledge in Texts:Applications (TIN2012-38603-C02-01) projects as well as the VLC/CAMPUS Microcluster on Multimodal Interaction in Intelligent Systems.
Databáze: OpenAIRE