Semantic tracking and recommendation using fourfold similarity measure from large scale data using hadoop distributed framework in cloud

IF 0.8 Q4 ROBOTICS
R. Priyadarshini, L. Tamilselvan, N. Rajendran
{"title":"Semantic tracking and recommendation using fourfold similarity measure from large scale data using hadoop distributed framework in cloud","authors":"R. Priyadarshini, L. Tamilselvan, N. Rajendran","doi":"10.1108/ijius-07-2019-0030","DOIUrl":null,"url":null,"abstract":"\nPurpose\nThe purpose of this paper is to propose a fourfold semantic similarity that results in more accuracy compared to the existing literature. The change detection in the URL and the recommendation of the source documents is facilitated by means of a framework in which the fourfold semantic similarity is implied. The latest trends in technology emerge with the continuous growth of resources on the collaborative web. This interactive and collaborative web pretense big challenges in recent technologies like cloud and big data.\n\n\nDesign/methodology/approach\nThe enormous growth of resources should be accessed in a more efficient manner, and this requires clustering and classification techniques. The resources on the web are described in a more meaningful manner.\n\n\nFindings\nIt can be descripted in the form of metadata that is constituted by resource description framework (RDF). Fourfold similarity is proposed compared to three-fold similarity proposed in the existing literature. The fourfold similarity includes the semantic annotation based on the named entity recognition in the user interface, domain-based concept matching and improvised score-based classification of domain-based concept matching based on ontology, sequence-based word sensing algorithm and RDF-based updating of triples. The aggregation of all these similarity measures including the components such as semantic user interface, semantic clustering, and sequence-based classification and semantic recommendation system with RDF updating in change detection.\n\n\nResearch limitations/implications\nThe existing work suggests that linking resources semantically increases the retrieving and searching ability. Previous literature shows that keywords can be used to retrieve linked information from the article to determine the similarity between the documents using semantic analysis.\n\n\nPractical implications\nThese traditional systems also lack in scalability and efficiency issues. The proposed study is to design a model that pulls and prioritizes knowledge-based content from the Hadoop distributed framework. This study also proposes the Hadoop-based pruning system and recommendation system.\n\n\nSocial implications\nThe pruning system gives an alert about the dynamic changes in the article (virtual document). The changes in the document are automatically updated in the RDF document. This helps in semantic matching and retrieval of the most relevant source with the virtual document.\n\n\nOriginality/value\nThe recommendation and detection of changes in the blogs are performed semantically using n-triples and automated data structures. User-focussed and choice-based crawling that is proposed in this system also assists the collaborative filtering. Consecutively collaborative filtering recommends the user focussed source documents. The entire clustering and retrieval system is deployed in multi-node Hadoop in the Amazon AWS environment and graphs are plotted and analyzed.\n","PeriodicalId":42876,"journal":{"name":"International Journal of Intelligent Unmanned Systems","volume":" ","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2019-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1108/ijius-07-2019-0030","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Unmanned Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ijius-07-2019-0030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose The purpose of this paper is to propose a fourfold semantic similarity that results in more accuracy compared to the existing literature. The change detection in the URL and the recommendation of the source documents is facilitated by means of a framework in which the fourfold semantic similarity is implied. The latest trends in technology emerge with the continuous growth of resources on the collaborative web. This interactive and collaborative web pretense big challenges in recent technologies like cloud and big data. Design/methodology/approach The enormous growth of resources should be accessed in a more efficient manner, and this requires clustering and classification techniques. The resources on the web are described in a more meaningful manner. Findings It can be descripted in the form of metadata that is constituted by resource description framework (RDF). Fourfold similarity is proposed compared to three-fold similarity proposed in the existing literature. The fourfold similarity includes the semantic annotation based on the named entity recognition in the user interface, domain-based concept matching and improvised score-based classification of domain-based concept matching based on ontology, sequence-based word sensing algorithm and RDF-based updating of triples. The aggregation of all these similarity measures including the components such as semantic user interface, semantic clustering, and sequence-based classification and semantic recommendation system with RDF updating in change detection. Research limitations/implications The existing work suggests that linking resources semantically increases the retrieving and searching ability. Previous literature shows that keywords can be used to retrieve linked information from the article to determine the similarity between the documents using semantic analysis. Practical implications These traditional systems also lack in scalability and efficiency issues. The proposed study is to design a model that pulls and prioritizes knowledge-based content from the Hadoop distributed framework. This study also proposes the Hadoop-based pruning system and recommendation system. Social implications The pruning system gives an alert about the dynamic changes in the article (virtual document). The changes in the document are automatically updated in the RDF document. This helps in semantic matching and retrieval of the most relevant source with the virtual document. Originality/value The recommendation and detection of changes in the blogs are performed semantically using n-triples and automated data structures. User-focussed and choice-based crawling that is proposed in this system also assists the collaborative filtering. Consecutively collaborative filtering recommends the user focussed source documents. The entire clustering and retrieval system is deployed in multi-node Hadoop in the Amazon AWS environment and graphs are plotted and analyzed.
在云环境下使用hadoop分布式框架对大规模数据进行四倍相似度度量的语义跟踪和推荐
本文的目的是提出一种四倍的语义相似度,与现有文献相比,这种相似度可以提高语义的准确性。URL中的变更检测和源文档的推荐是通过隐含四倍语义相似性的框架来实现的。随着协作网络上资源的不断增长,最新的技术趋势也随之出现。这种互动性和协作性的网络对云计算和大数据等最新技术构成了巨大挑战。设计/方法/方法应该以更有效的方式访问大量增长的资源,这需要聚类和分类技术。网络上的资源以一种更有意义的方式被描述。可以用由资源描述框架(RDF)构成的元数据的形式来描述查找结果。与现有文献中提出的三倍相似度相比,提出了四倍相似度。四重相似度包括基于用户界面命名实体识别的语义标注、基于本体的基于领域的概念匹配和基于即兴得分的领域概念匹配分类、基于序列的词感知算法和基于rdf的三元组更新。所有这些相似度度量的集合包括语义用户界面、语义聚类和基于序列的分类和语义推荐系统等组件,并在变化检测中使用RDF更新。研究局限/启示现有的研究表明,语义上的资源链接提高了检索和搜索能力。以前的文献表明,关键词可以用来检索文章中的链接信息,通过语义分析来确定文档之间的相似度。这些传统系统还缺乏可扩展性和效率问题。提出的研究是设计一个模型,从Hadoop分布式框架中提取和优先考虑基于知识的内容。本研究还提出了基于hadoop的剪枝系统和推荐系统。社会意义修剪系统对文章(虚拟文档)中的动态变化发出警报。文档中的更改在RDF文档中自动更新。这有助于语义匹配和检索与虚拟文档最相关的源。原创性/价值使用n-三元组和自动化数据结构在语义上执行博客更改的推荐和检测。该系统提出的以用户为中心和基于选择的爬行也有助于协同过滤。连续协同过滤推荐用户关注的源文档。整个聚类和检索系统部署在Amazon AWS环境下的多节点Hadoop上,并对图进行了绘制和分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.50
自引率
0.00%
发文量
21
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信