The duplicated of partial content detection based on PSO

Qingwei Ye, Dongxing Wu, Yu Zhou, Xiaodong Wang
{"title":"The duplicated of partial content detection based on PSO","authors":"Qingwei Ye, Dongxing Wu, Yu Zhou, Xiaodong Wang","doi":"10.1109/BICTA.2010.5645302","DOIUrl":null,"url":null,"abstract":"It is discussed how to detect the duplicated of partial contents between two documents in this paper. There are some algorithms which can detect similarity among documents. But these algorithms cannot detect the duplicated of partial contents in documents. A new effective algorithm of the duplicated of partial contents detection in documents is put forward in this paper. The new algorithm is using PSO algorithm to search the optimized partial contents which is the most similar in two documents. For PSO algorithm, it provides the encoding of the particles. A new related coefficient of strings is defined for strings similarity. And the new evaluation function of PSO is designed based on the related coefficient function. The hybrid mutation PSO algorithm is used for searching the most similar partial contents quickly and accurately. The simulation experiments indicate that the algorithm can search the most similar partial contents in two documents effectively.","PeriodicalId":302619,"journal":{"name":"2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA)","volume":"214 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BICTA.2010.5645302","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

It is discussed how to detect the duplicated of partial contents between two documents in this paper. There are some algorithms which can detect similarity among documents. But these algorithms cannot detect the duplicated of partial contents in documents. A new effective algorithm of the duplicated of partial contents detection in documents is put forward in this paper. The new algorithm is using PSO algorithm to search the optimized partial contents which is the most similar in two documents. For PSO algorithm, it provides the encoding of the particles. A new related coefficient of strings is defined for strings similarity. And the new evaluation function of PSO is designed based on the related coefficient function. The hybrid mutation PSO algorithm is used for searching the most similar partial contents quickly and accurately. The simulation experiments indicate that the algorithm can search the most similar partial contents in two documents effectively.
基于粒子群算法的部分内容检测的复制
本文讨论了如何检测两个文档之间部分内容的重复。有一些算法可以检测文档之间的相似度。但是这些算法无法检测到文档中部分内容的重复。提出了一种新的有效的文档部分内容重复检测算法。该算法利用粒子群算法在两个文档中搜索最相似的优化部分内容。对于粒子群算法,提供了粒子的编码。定义了一个新的弦相似度相关系数。在相关系数函数的基础上,设计了新的粒子群评价函数。采用混合变异粒子群算法快速准确地搜索出最相似的部分内容。仿真实验表明,该算法可以有效地搜索到两个文档中最相似的部分内容。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信