利用自由文本指纹分析进行抄袭检测

Mohamed Elkhidir, Mohannad Ibrahim, T. A. Khalid, S. Ibrahim, M. Awadalla
{"title":"利用自由文本指纹分析进行抄袭检测","authors":"Mohamed Elkhidir, Mohannad Ibrahim, T. A. Khalid, S. Ibrahim, M. Awadalla","doi":"10.1109/WSCNIS.2015.7368306","DOIUrl":null,"url":null,"abstract":"Plagiarism generally defined as using other people's ideas or work and representing it as one's own original work. Free-text plagiarism detection is an application based on analyzing the texts contained in researches, thesis, scientific reports and also literary products, these analyzed data will be used to compare a group of documents to find out how much these documents are similar. This paper proposes a Free Text Plagiarism Detection Software (FTPDS); which is a software tool that uses documents' fingerprints to detect the likelihood that the documents are plagiarized from each other. The system is able to detect plagiarism between two given documents, given document and group of local documents, and between given document and online available documents. Agile software methodology was used to develop the software and some open source libraries were manipulated and used to search the internet and read PDF documents respectively. The speed of the detection process, the inaccurate detection of the same file and the lag of online search and downloading are stated as future work aspects. Source in this paper means the suspected document which we want to detect the amount of plagiarized data contained in it. The target is the document which is probably the document where the author plagiarized the data from it and claimed that he\\she owns that data.","PeriodicalId":253256,"journal":{"name":"2015 World Symposium on Computer Networks and Information Security (WSCNIS)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Plagiarism detection using free-text fingerprint analysis\",\"authors\":\"Mohamed Elkhidir, Mohannad Ibrahim, T. A. Khalid, S. Ibrahim, M. Awadalla\",\"doi\":\"10.1109/WSCNIS.2015.7368306\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Plagiarism generally defined as using other people's ideas or work and representing it as one's own original work. Free-text plagiarism detection is an application based on analyzing the texts contained in researches, thesis, scientific reports and also literary products, these analyzed data will be used to compare a group of documents to find out how much these documents are similar. This paper proposes a Free Text Plagiarism Detection Software (FTPDS); which is a software tool that uses documents' fingerprints to detect the likelihood that the documents are plagiarized from each other. The system is able to detect plagiarism between two given documents, given document and group of local documents, and between given document and online available documents. Agile software methodology was used to develop the software and some open source libraries were manipulated and used to search the internet and read PDF documents respectively. The speed of the detection process, the inaccurate detection of the same file and the lag of online search and downloading are stated as future work aspects. Source in this paper means the suspected document which we want to detect the amount of plagiarized data contained in it. The target is the document which is probably the document where the author plagiarized the data from it and claimed that he\\\\she owns that data.\",\"PeriodicalId\":253256,\"journal\":{\"name\":\"2015 World Symposium on Computer Networks and Information Security (WSCNIS)\",\"volume\":\"32 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 World Symposium on Computer Networks and Information Security (WSCNIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WSCNIS.2015.7368306\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 World Symposium on Computer Networks and Information Security (WSCNIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WSCNIS.2015.7368306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

摘要

抄袭通常被定义为使用他人的想法或作品并将其作为自己的原创作品。自由文本抄袭检测是一种基于分析研究,论文,科学报告和文学产品中包含的文本的应用程序,这些分析的数据将用于比较一组文档,以找出这些文档的相似程度。本文提出了一种自由文本抄袭检测软件(FTPDS);这是一种软件工具,它使用文件的指纹来检测文件相互抄袭的可能性。该系统能够检测两个给定文档之间的抄袭,给定文档和一组本地文档之间的抄袭,以及给定文档和在线可用文档之间的抄袭。软件的开发采用了敏捷软件方法,并对一些开源库进行了操作,分别用于互联网搜索和PDF文档的阅读。检测过程的速度、同一文件的检测不准确以及在线搜索和下载的滞后是今后的工作方向。本文中的来源是指我们要检测其中所包含的剽窃数据量的可疑文件。目标是文件,这可能是作者剽窃其中的数据并声称他/她拥有该数据的文件。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Plagiarism detection using free-text fingerprint analysis
Plagiarism generally defined as using other people's ideas or work and representing it as one's own original work. Free-text plagiarism detection is an application based on analyzing the texts contained in researches, thesis, scientific reports and also literary products, these analyzed data will be used to compare a group of documents to find out how much these documents are similar. This paper proposes a Free Text Plagiarism Detection Software (FTPDS); which is a software tool that uses documents' fingerprints to detect the likelihood that the documents are plagiarized from each other. The system is able to detect plagiarism between two given documents, given document and group of local documents, and between given document and online available documents. Agile software methodology was used to develop the software and some open source libraries were manipulated and used to search the internet and read PDF documents respectively. The speed of the detection process, the inaccurate detection of the same file and the lag of online search and downloading are stated as future work aspects. Source in this paper means the suspected document which we want to detect the amount of plagiarized data contained in it. The target is the document which is probably the document where the author plagiarized the data from it and claimed that he\she owns that data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信