Intelligent bar chart plagiarism detection in documents.

Autor: Al-Dabbagh MM; Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul, Iraq., Salim N; Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia., Rehman A; MIS Department, CBA, Salman Bin Abdulaziz University, Alkharj, Saudi Arabia., Alkawaz MH; Faculty of Computing, Universiti Teknologi Malaysia, 81310 Skudai, Johor, Malaysia ; Faculty of Computer Sciences and Mathematics, University of Mosul, Mosul, Iraq., Saba T; College of Computer and Information Sciences (CCIS), Prince Sultan University, Riyadh, Saudi Arabia., Al-Rodhaan M; Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh, Saudi Arabia., Al-Dhelaan A; Computer Science Department, College of Computer & Information Sciences, King Saud University, Riyadh, Saudi Arabia.
Jazyk: angličtina
Zdroj: TheScientificWorldJournal [ScientificWorldJournal] 2014; Vol. 2014, pp. 612787. Date of Electronic Publication: 2014 Sep 17.
DOI: 10.1155/2014/612787
Abstrakt: This paper presents a novel features mining approach from documents that could not be mined via optical character recognition (OCR). By identifying the intimate relationship between the text and graphical components, the proposed technique pulls out the Start, End, and Exact values for each bar. Furthermore, the word 2-gram and Euclidean distance methods are used to accurately detect and determine plagiarism in bar charts.
Databáze: MEDLINE