An unsupervised method for detecting style breaches in a document

Autor: Seifeddine Mechti, Maryam Elamine, Lamia Hadrich Belguith
Předmět:
Zdroj: Web of Science
AICCSA
Popis: In this paper, we propose an unsupervised method for identifying style breaches in a given document, also known as intrinsic plagiarism detection. In fact, plagiarism is one of the major challenges in various domains. Supervised learning techniques fail to capture the stylistic changes in a text effectively and this is mainly due to the static segmentation of the text. For this reason, we present in this paper our proposed method for intrinsic plagiarism detection, we experimented with the unsupervised algorithm Kmeans and the similarity measure Cosine. Our results are comparable to best systems presented in PAN@CLEF competitive conference.
Databáze: OpenAIRE