代码可视化的抄袭检测

D. Bernhauer
{"title":"代码可视化的抄袭检测","authors":"D. Bernhauer","doi":"10.1109/SNAMS58071.2022.10062664","DOIUrl":null,"url":null,"abstract":"The use of deep convolutional neural networks is very common in computer graphics. With this, methods for exploiting knowledge in other fields are also developing. Finding plagiarism among student source codes is challenging, especially when students have the same assignment. In this case, we try to find differences between two semantically identical codes at the level of syntax, approach, or just style. This paper aims to visualize binary codes and verify if it is possible to detect plagiarism using deep convolution neural networks. Using the siamese network, we trained a neural network to evaluate the similarity between the two programs. The training data for our network are the ICPC competition submissions for which we can be confident of their authorship. The overall success rate of our model consistently reaches 75 to 80 % accuracy, which mainly shows that the visualization of inherently non-graphical entities (like source code) can be useful in the application of neural networks designed primarily for graphical purposes.","PeriodicalId":371668,"journal":{"name":"2022 Ninth International Conference on Social Networks Analysis, Management and Security (SNAMS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Code Visualization for Plagiarism Detection\",\"authors\":\"D. Bernhauer\",\"doi\":\"10.1109/SNAMS58071.2022.10062664\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of deep convolutional neural networks is very common in computer graphics. With this, methods for exploiting knowledge in other fields are also developing. Finding plagiarism among student source codes is challenging, especially when students have the same assignment. In this case, we try to find differences between two semantically identical codes at the level of syntax, approach, or just style. This paper aims to visualize binary codes and verify if it is possible to detect plagiarism using deep convolution neural networks. Using the siamese network, we trained a neural network to evaluate the similarity between the two programs. The training data for our network are the ICPC competition submissions for which we can be confident of their authorship. The overall success rate of our model consistently reaches 75 to 80 % accuracy, which mainly shows that the visualization of inherently non-graphical entities (like source code) can be useful in the application of neural networks designed primarily for graphical purposes.\",\"PeriodicalId\":371668,\"journal\":{\"name\":\"2022 Ninth International Conference on Social Networks Analysis, Management and Security (SNAMS)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 Ninth International Conference on Social Networks Analysis, Management and Security (SNAMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SNAMS58071.2022.10062664\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Ninth International Conference on Social Networks Analysis, Management and Security (SNAMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SNAMS58071.2022.10062664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

深度卷积神经网络的使用在计算机图形学中非常普遍。与此同时,其他领域的知识开发方法也在不断发展。在学生的源代码中发现抄袭是很有挑战性的,尤其是当学生有相同的作业时。在这种情况下,我们试图在语法、方法或风格级别上找到两个语义相同的代码之间的差异。本文旨在可视化二进制代码,并验证是否有可能使用深度卷积神经网络检测剽窃。使用暹罗网络,我们训练了一个神经网络来评估两个程序之间的相似性。我们网络的训练数据是ICPC竞赛提交的,我们可以确信它们是作者。我们的模型的总体成功率始终达到75%到80%的准确率,这主要表明固有的非图形实体(如源代码)的可视化在主要为图形目的设计的神经网络的应用中是有用的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Code Visualization for Plagiarism Detection
The use of deep convolutional neural networks is very common in computer graphics. With this, methods for exploiting knowledge in other fields are also developing. Finding plagiarism among student source codes is challenging, especially when students have the same assignment. In this case, we try to find differences between two semantically identical codes at the level of syntax, approach, or just style. This paper aims to visualize binary codes and verify if it is possible to detect plagiarism using deep convolution neural networks. Using the siamese network, we trained a neural network to evaluate the similarity between the two programs. The training data for our network are the ICPC competition submissions for which we can be confident of their authorship. The overall success rate of our model consistently reaches 75 to 80 % accuracy, which mainly shows that the visualization of inherently non-graphical entities (like source code) can be useful in the application of neural networks designed primarily for graphical purposes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信