{"title":"Kitsune:结构意识和适应性抄袭检测","authors":"Zachary Monroe, A. Bansal","doi":"10.1109/FIE49875.2021.9637364","DOIUrl":null,"url":null,"abstract":"Plagiarism is a huge problem in a learning environment. In programming classes especially, plagiarism can be hard to detect as source codes' appearance can be easily modified without changing the intent through simple formatting changes or refactoring. Many source code plagiarism tools do not support a high number of languages because doing so requires maintaining too large of a codebase. It is also difficult to add support for new languages because each language can be vastly different syntactically. Tools that are more extensible often do so by reducing the features of a language that are encoded and end up closer to text comparison tools than structurally aware program analysis tools [27]. This paper introduces a new tool called Kitsune, a plagiarism detection tool, focused on syntactically and structurally aware yet adaptable plagiarism detection. Kitsune has been evaluated for 10 of the languages in the Antlr4 grammar repository with success and could easily be extended to support all the grammars currently developed by Antlr4 or future grammars which are developed as new languages are written.","PeriodicalId":408497,"journal":{"name":"2021 IEEE Frontiers in Education Conference (FIE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Kitsune: Structurally Aware and Adaptable Plagiarism Detection\",\"authors\":\"Zachary Monroe, A. Bansal\",\"doi\":\"10.1109/FIE49875.2021.9637364\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Plagiarism is a huge problem in a learning environment. In programming classes especially, plagiarism can be hard to detect as source codes' appearance can be easily modified without changing the intent through simple formatting changes or refactoring. Many source code plagiarism tools do not support a high number of languages because doing so requires maintaining too large of a codebase. It is also difficult to add support for new languages because each language can be vastly different syntactically. Tools that are more extensible often do so by reducing the features of a language that are encoded and end up closer to text comparison tools than structurally aware program analysis tools [27]. This paper introduces a new tool called Kitsune, a plagiarism detection tool, focused on syntactically and structurally aware yet adaptable plagiarism detection. Kitsune has been evaluated for 10 of the languages in the Antlr4 grammar repository with success and could easily be extended to support all the grammars currently developed by Antlr4 or future grammars which are developed as new languages are written.\",\"PeriodicalId\":408497,\"journal\":{\"name\":\"2021 IEEE Frontiers in Education Conference (FIE)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE Frontiers in Education Conference (FIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FIE49875.2021.9637364\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Frontiers in Education Conference (FIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FIE49875.2021.9637364","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Kitsune: Structurally Aware and Adaptable Plagiarism Detection
Plagiarism is a huge problem in a learning environment. In programming classes especially, plagiarism can be hard to detect as source codes' appearance can be easily modified without changing the intent through simple formatting changes or refactoring. Many source code plagiarism tools do not support a high number of languages because doing so requires maintaining too large of a codebase. It is also difficult to add support for new languages because each language can be vastly different syntactically. Tools that are more extensible often do so by reducing the features of a language that are encoded and end up closer to text comparison tools than structurally aware program analysis tools [27]. This paper introduces a new tool called Kitsune, a plagiarism detection tool, focused on syntactically and structurally aware yet adaptable plagiarism detection. Kitsune has been evaluated for 10 of the languages in the Antlr4 grammar repository with success and could easily be extended to support all the grammars currently developed by Antlr4 or future grammars which are developed as new languages are written.