Plagiarism Detection Algorithm Model Based on NLP Technology

Ahmed A. Elngar, Mohamed Gamal, Amar Fathy, Basma Moustafa, Omar Mahmoud, M. Shaban
{"title":"Plagiarism Detection Algorithm Model Based on NLP Technology","authors":"Ahmed A. Elngar, Mohamed Gamal, Amar Fathy, Basma Moustafa, Omar Mahmoud, M. Shaban","doi":"10.54216/jcim.050104","DOIUrl":null,"url":null,"abstract":"We can bear in mind that each of us has plagiarized a text without realizing that it was plagiarism, Plagiarism can happen in Articles, Papers, Researches, literature, music, software, scientific, newspapers, websites, Master and PHD Thesis and many other fields, So plagiarism has become serious major problem to teachers, researchers and publishers, There are divergent opinions about how to define plagiarism and what makes plagiarism serious. So, the detecting plagiarism is very important, so in this survey we explicate the concept of ;plagiarism ; and provide an overview of different plagiarism software and tools to solve the plagiarism problem, and will discuss the plagiarism process, types and detection methodologies. We can define that plagiarism is the brief and the description of this sentence ;someone used someone else’s mental product (such as its texts, ideas, or privacy). We suggest that what makes plagiarism so reprehensible is that it distorts scientific credit. In addition, intentional plagiarism indicates dishonesty. Moreover, there are a number of possible negative consequences of plagiarism. So we just create a framework for external plagiarism detection in which a some NLP processes are applied to process a set of suspicious and original documents, we have classified the different plagiarism detection techniques based on Lexical, Semantic, Syntactic and grammar analysis algorithms, And all of these algorithms precedes it NLP processing.","PeriodicalId":169383,"journal":{"name":"Journal of Cybersecurity and Information Management","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cybersecurity and Information Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54216/jcim.050104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

We can bear in mind that each of us has plagiarized a text without realizing that it was plagiarism, Plagiarism can happen in Articles, Papers, Researches, literature, music, software, scientific, newspapers, websites, Master and PHD Thesis and many other fields, So plagiarism has become serious major problem to teachers, researchers and publishers, There are divergent opinions about how to define plagiarism and what makes plagiarism serious. So, the detecting plagiarism is very important, so in this survey we explicate the concept of ;plagiarism ; and provide an overview of different plagiarism software and tools to solve the plagiarism problem, and will discuss the plagiarism process, types and detection methodologies. We can define that plagiarism is the brief and the description of this sentence ;someone used someone else’s mental product (such as its texts, ideas, or privacy). We suggest that what makes plagiarism so reprehensible is that it distorts scientific credit. In addition, intentional plagiarism indicates dishonesty. Moreover, there are a number of possible negative consequences of plagiarism. So we just create a framework for external plagiarism detection in which a some NLP processes are applied to process a set of suspicious and original documents, we have classified the different plagiarism detection techniques based on Lexical, Semantic, Syntactic and grammar analysis algorithms, And all of these algorithms precedes it NLP processing.
基于NLP技术的抄袭检测算法模型
我们可以记住,我们每个人都抄袭了一篇文章,而没有意识到这是抄袭,抄袭可以发生在文章,论文,研究,文学,音乐,软件,科学,报纸,网站,硕士和博士论文和许多其他领域,所以抄袭已经成为教师,研究人员和出版商严重的主要问题,关于如何定义抄袭和什么使抄袭严重的意见分歧。因此,发现抄袭是非常重要的,因此在本调查中,我们阐述了;抄袭;并概述了解决抄袭问题的不同抄袭软件和工具,并将讨论抄袭的过程、类型和检测方法。我们可以将剽窃定义为这句话的简要和描述;某人使用了他人的精神产物(例如其文本、思想或隐私)。我们认为,剽窃之所以应该受到谴责,是因为它扭曲了科学信誉。此外,故意抄袭表明不诚实。此外,抄袭还可能带来一些负面后果。因此,我们只是创建了一个外部抄袭检测框架,在这个框架中,我们应用了一些NLP过程来处理一组可疑的和原始的文档,我们根据词法、语义、句法和语法分析算法对不同的抄袭检测技术进行了分类,所有这些算法都先于NLP处理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
0.70
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信