{"title":"A hybrid approach (syntactic and textual) to clone detection","authors":"Marco Funaro, Daniele Braga, A. Campi, C. Ghezzi","doi":"10.1145/1808901.1808914","DOIUrl":null,"url":null,"abstract":"Code clone detection has been so far tackled with several approaches (mainly textual, syntactic, and semantic), each performing best w.r.t. specific metrics and against consolidated but specific benchmarks. Hybrid approaches have been recently proposed as well. This paper proposes a novel hybrid (syntactic, textual) approach using the abstract syntax tree to identify clone candidates and textual methods to discard false positives. The novelty of the approach is in the combination of two well-grounded techniques, in a way that has not been explored yet.\n The paper shows the result of empirical evaluation methods and hints of a prototype implementation. Initial experimental results show that the approach is effective and can find clones that were not detected by state of the art approaches.","PeriodicalId":374295,"journal":{"name":"International Workshop on Software Clones","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Workshop on Software Clones","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1808901.1808914","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 19
Abstract
Code clone detection has been so far tackled with several approaches (mainly textual, syntactic, and semantic), each performing best w.r.t. specific metrics and against consolidated but specific benchmarks. Hybrid approaches have been recently proposed as well. This paper proposes a novel hybrid (syntactic, textual) approach using the abstract syntax tree to identify clone candidates and textual methods to discard false positives. The novelty of the approach is in the combination of two well-grounded techniques, in a way that has not been explored yet.
The paper shows the result of empirical evaluation methods and hints of a prototype implementation. Initial experimental results show that the approach is effective and can find clones that were not detected by state of the art approaches.