posted on 2025-05-09, 19:04authored byOscar Karnalim, Simon
When using code similarity detection to uncover code plagiarism and collusion, the marker needs to determine whether any detected similarities might be the result of coincidence. But understanding the similarities can be difficult and might be prone to human error, because few tools facilitate the investigation process, and if they do, the similarities are not explicitly explained in human language. This paper presents STRANGE, an investigation module that exclusively explains code similarities in natural language (English and Indonesian). For the purpose of reusability, STRANGE can be embedded in JPlag and other code similarity detection tools. It can also act as a standalone tool for measuring source code similarity. Our evaluation shows that STRANGE is more helpful than JPlag in the investigation process since it explains the similarities in natural language. Further, its effectiveness is comparable to that of JPlag but higher on trivial disguises of the sort that novice students will tend to apply when disguising copied code.
History
Journal title
IEEE Access
Volume
9
Issue
2021
Pagination
59935-59948
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Language
en, English
College/Research Centre
College of Engineering, Science and Environment
School
School of Electrical Engineering and Computer Science
Rights statement
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/