{"title":"Efficient approach to detect logical equivalence in the paradigm of software plagiarism","authors":"Rupayan Dutta","doi":"10.1109/C3IT.2015.7060114","DOIUrl":null,"url":null,"abstract":"Software plagiarism is one of the major obstacles which complicates the development and deployment of software systems. Legal complications related to use of copyright protected code has a far reaching effect. The challenge is to detect the existence of logical equivalence in an otherwise seemingly unmatched code. In this paper, a novel approach has been described which detects plagiarism only in logical statements of the code i.e. it does not detect copy-pasting in inclusion statements (e.g. #include in C/C++, import in JAVA), macro definitions (e.g. #define in C/C++), declaration statements and inside commented code. It also takes into account possible cosmetic changes of the code e.g. variable renaming, data type alterations, code snippets insertions or minor modifications. The method does comparison in an efficient minimalistic way, thus making it scalable to large code bases.","PeriodicalId":402311,"journal":{"name":"Proceedings of the 2015 Third International Conference on Computer, Communication, Control and Information Technology (C3IT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2015 Third International Conference on Computer, Communication, Control and Information Technology (C3IT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/C3IT.2015.7060114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Software plagiarism is one of the major obstacles which complicates the development and deployment of software systems. Legal complications related to use of copyright protected code has a far reaching effect. The challenge is to detect the existence of logical equivalence in an otherwise seemingly unmatched code. In this paper, a novel approach has been described which detects plagiarism only in logical statements of the code i.e. it does not detect copy-pasting in inclusion statements (e.g. #include in C/C++, import in JAVA), macro definitions (e.g. #define in C/C++), declaration statements and inside commented code. It also takes into account possible cosmetic changes of the code e.g. variable renaming, data type alterations, code snippets insertions or minor modifications. The method does comparison in an efficient minimalistic way, thus making it scalable to large code bases.