Duplicate and Plagiarism Search in Program Code Using Suffix Trees Over Compiled Code
Autor: | Igor Andrianov, Svetlana Rzheutskaya, Alexey Sukonschikov, Dmitry Kochkin, Anatoly Shvetsov, Arseny Sorokin |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2020 |
Předmět: | |
Zdroj: | Proceedings of the XXth Conference of Open Innovations Association FRUCT, Vol 26, Iss 1, Pp 16-22 (2020) |
Druh dokumentu: | article |
ISSN: | 2305-7254 2343-0737 |
DOI: | 10.23919/FRUCT48808.2020.9087465 |
Popis: | The search for duplicate source code allow both to improve the quality of the software being developed and to detect plagiarism. In this paper, it is proposed to use a set of features of modern optimizing compilers to simplify and reduce this task to a search by similarity of text fragments. In this case, many types of cosmetic changes in code do not affect the search result. In order to effectively search by similarity, we use sparse suffix trees built on binary encoded data. Algorithms for constructing such a tree and performing a search are presented. The application of the results to detect cheating in a distance programming workshop is described. |
Databáze: | Directory of Open Access Journals |
Externí odkaz: |