{"title":"更大规模的程式设计课程抄袭侦测问题","authors":"Michal Duracík, Emil Krsák, Patrik Hrkút","doi":"10.1109/ICETA.2018.8572260","DOIUrl":null,"url":null,"abstract":"Informatics and programming are currently one of the most popular areas of study. Evolution in this area happens so quickly that one of the main sources which students use during their studies is the Internet. The issue is that this variety of resources available on the Internet often encourage students to copy these resources to their work and to commit plagiarism. In this paper, we will focus on plagiarism problems in computer science teaching. We compare available tools to detect plagiarism in the source code with the algorithm proposed by us. As the test dataset we will use the semester works from the courses taught at our university. For this comparison, it will be necessary to customize the proposed algorithm, so it can handle Java source codes. We will focus on the accuracy and completeness of each algorithm. We will also attempt to remove the useless detecting of automatically generated code as a plagiarism.","PeriodicalId":304523,"journal":{"name":"2018 16th International Conference on Emerging eLearning Technologies and Applications (ICETA)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Issues with the Detection of Plagiarism in Programming Courses on a Larger Scale\",\"authors\":\"Michal Duracík, Emil Krsák, Patrik Hrkút\",\"doi\":\"10.1109/ICETA.2018.8572260\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Informatics and programming are currently one of the most popular areas of study. Evolution in this area happens so quickly that one of the main sources which students use during their studies is the Internet. The issue is that this variety of resources available on the Internet often encourage students to copy these resources to their work and to commit plagiarism. In this paper, we will focus on plagiarism problems in computer science teaching. We compare available tools to detect plagiarism in the source code with the algorithm proposed by us. As the test dataset we will use the semester works from the courses taught at our university. For this comparison, it will be necessary to customize the proposed algorithm, so it can handle Java source codes. We will focus on the accuracy and completeness of each algorithm. We will also attempt to remove the useless detecting of automatically generated code as a plagiarism.\",\"PeriodicalId\":304523,\"journal\":{\"name\":\"2018 16th International Conference on Emerging eLearning Technologies and Applications (ICETA)\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 16th International Conference on Emerging eLearning Technologies and Applications (ICETA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICETA.2018.8572260\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 16th International Conference on Emerging eLearning Technologies and Applications (ICETA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETA.2018.8572260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Issues with the Detection of Plagiarism in Programming Courses on a Larger Scale
Informatics and programming are currently one of the most popular areas of study. Evolution in this area happens so quickly that one of the main sources which students use during their studies is the Internet. The issue is that this variety of resources available on the Internet often encourage students to copy these resources to their work and to commit plagiarism. In this paper, we will focus on plagiarism problems in computer science teaching. We compare available tools to detect plagiarism in the source code with the algorithm proposed by us. As the test dataset we will use the semester works from the courses taught at our university. For this comparison, it will be necessary to customize the proposed algorithm, so it can handle Java source codes. We will focus on the accuracy and completeness of each algorithm. We will also attempt to remove the useless detecting of automatically generated code as a plagiarism.