Kitsune: Structurally Aware and Adaptable Plagiarism Detection

Zachary Monroe, A. Bansal
{"title":"Kitsune: Structurally Aware and Adaptable Plagiarism Detection","authors":"Zachary Monroe, A. Bansal","doi":"10.1109/FIE49875.2021.9637364","DOIUrl":null,"url":null,"abstract":"Plagiarism is a huge problem in a learning environment. In programming classes especially, plagiarism can be hard to detect as source codes' appearance can be easily modified without changing the intent through simple formatting changes or refactoring. Many source code plagiarism tools do not support a high number of languages because doing so requires maintaining too large of a codebase. It is also difficult to add support for new languages because each language can be vastly different syntactically. Tools that are more extensible often do so by reducing the features of a language that are encoded and end up closer to text comparison tools than structurally aware program analysis tools [27]. This paper introduces a new tool called Kitsune, a plagiarism detection tool, focused on syntactically and structurally aware yet adaptable plagiarism detection. Kitsune has been evaluated for 10 of the languages in the Antlr4 grammar repository with success and could easily be extended to support all the grammars currently developed by Antlr4 or future grammars which are developed as new languages are written.","PeriodicalId":408497,"journal":{"name":"2021 IEEE Frontiers in Education Conference (FIE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Frontiers in Education Conference (FIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FIE49875.2021.9637364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Plagiarism is a huge problem in a learning environment. In programming classes especially, plagiarism can be hard to detect as source codes' appearance can be easily modified without changing the intent through simple formatting changes or refactoring. Many source code plagiarism tools do not support a high number of languages because doing so requires maintaining too large of a codebase. It is also difficult to add support for new languages because each language can be vastly different syntactically. Tools that are more extensible often do so by reducing the features of a language that are encoded and end up closer to text comparison tools than structurally aware program analysis tools [27]. This paper introduces a new tool called Kitsune, a plagiarism detection tool, focused on syntactically and structurally aware yet adaptable plagiarism detection. Kitsune has been evaluated for 10 of the languages in the Antlr4 grammar repository with success and could easily be extended to support all the grammars currently developed by Antlr4 or future grammars which are developed as new languages are written.
Kitsune:结构意识和适应性抄袭检测
抄袭是学习环境中的一个大问题。特别是在编程课上,抄袭很难被发现,因为源代码的外观可以很容易地修改,而不需要通过简单的格式更改或重构来改变意图。许多源代码抄袭工具不支持大量的语言,因为这样做需要维护太大的代码库。添加对新语言的支持也很困难,因为每种语言在语法上可能有很大的不同。可扩展性更强的工具通常通过减少编码语言的特征来实现,最终更接近于文本比较工具,而不是结构感知的程序分析工具[27]。本文介绍了一种新的抄袭检测工具Kitsune,该工具专注于句法和结构感知且适应性强的抄袭检测。Kitsune已经成功地为Antlr4语法库中的10种语言进行了评估,并且可以很容易地扩展到支持Antlr4当前开发的所有语法,或者随着新语言的编写而开发的未来语法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信