{"title":"Figure plagiarism detection based on textual features representation","authors":"T. Eisa, N. Salim, Salha M. Alzahrani","doi":"10.1109/ICT-ISPC.2017.8075305","DOIUrl":null,"url":null,"abstract":"In an academic environment, plagiarism is the process of copying someone else's text, idea or data verbatim or without due recognition of the source, which is a serious academic offence. Many techniques have been proposed in the literature for detecting plagiarism in texts, but only a few techniques exist for detecting figure plagiarism. The main problem associated with existing techniques of plagiarism detection is that they are not applicable to non-textual elements of figures in research publications. This paper focuses on detecting plagiarism in scientific figures. Textual-reference representation based figure plagiarism detection techniques are proposed and evaluated, based on existing limitations. The proposed techniques use enhanced feature extraction such as textual features and similarity computation methods such as similarity based on textual-reference of figures. The enhanced feature extraction method was found to be capable of extracting textual references such as captions and description texts. The similarity detection method was capable of categorising a given figure as either plagiarised or non-plagiarised from a source collection of scientific publications, depending on a certain threshold value. Results showed that the proposed technique achieved precision=0.78 and recall=0.67 result in terms of the evaluation measure.","PeriodicalId":377665,"journal":{"name":"2017 6th ICT International Student Project Conference (ICT-ISPC)","volume":"535 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 6th ICT International Student Project Conference (ICT-ISPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICT-ISPC.2017.8075305","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In an academic environment, plagiarism is the process of copying someone else's text, idea or data verbatim or without due recognition of the source, which is a serious academic offence. Many techniques have been proposed in the literature for detecting plagiarism in texts, but only a few techniques exist for detecting figure plagiarism. The main problem associated with existing techniques of plagiarism detection is that they are not applicable to non-textual elements of figures in research publications. This paper focuses on detecting plagiarism in scientific figures. Textual-reference representation based figure plagiarism detection techniques are proposed and evaluated, based on existing limitations. The proposed techniques use enhanced feature extraction such as textual features and similarity computation methods such as similarity based on textual-reference of figures. The enhanced feature extraction method was found to be capable of extracting textual references such as captions and description texts. The similarity detection method was capable of categorising a given figure as either plagiarised or non-plagiarised from a source collection of scientific publications, depending on a certain threshold value. Results showed that the proposed technique achieved precision=0.78 and recall=0.67 result in terms of the evaluation measure.