Random Walk-Based Semantic Annotation for On-demand Printing Products

Mingxi Zhang, Guanying Su
{"title":"Random Walk-Based Semantic Annotation for On-demand Printing Products","authors":"Mingxi Zhang, Guanying Su","doi":"10.11648/J.AJNNA.20190501.15","DOIUrl":null,"url":null,"abstract":"Nowadays, the scale of real network is increasing day by day, while also brings sparse problems. It is usually necessary to maintain a large number of product information. To organize this product information, a feasible way is to add semantic tags to the information. In this article, we aim to solve the problem of semantic annotation of on-demand printing products. Based on good properties of random walk in global networks, we deal with the sparsity problem by applying it, and then propose an efficient ProRWR algorithm. Firstly, it processes the text description dataset of printed products based on TF-IDF algorithm, and builds “product-term” bipartite network. Secondly, ProRWR builds square matrix using the TF-IDF weight matrix, rewrite the equation of random walk, and use the normalized square matrix as the input of rewrite ProRWR algorithm. By random walks, terms with the highest convergence probability in each product document are selected as the most relevant feature terms of the product. A large number of experiments have been done on Amazon dataset. The results show that the precision and recall of our algorithm are 73.5% and 60%, respectively, indicating that ProRWR has discovered the potential semantic association and implemented the semantic annotation of on-demand printed products.","PeriodicalId":325288,"journal":{"name":"American Journal of Neural Networks and Applications","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Neural Networks and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.11648/J.AJNNA.20190501.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Nowadays, the scale of real network is increasing day by day, while also brings sparse problems. It is usually necessary to maintain a large number of product information. To organize this product information, a feasible way is to add semantic tags to the information. In this article, we aim to solve the problem of semantic annotation of on-demand printing products. Based on good properties of random walk in global networks, we deal with the sparsity problem by applying it, and then propose an efficient ProRWR algorithm. Firstly, it processes the text description dataset of printed products based on TF-IDF algorithm, and builds “product-term” bipartite network. Secondly, ProRWR builds square matrix using the TF-IDF weight matrix, rewrite the equation of random walk, and use the normalized square matrix as the input of rewrite ProRWR algorithm. By random walks, terms with the highest convergence probability in each product document are selected as the most relevant feature terms of the product. A large number of experiments have been done on Amazon dataset. The results show that the precision and recall of our algorithm are 73.5% and 60%, respectively, indicating that ProRWR has discovered the potential semantic association and implemented the semantic annotation of on-demand printed products.
基于随机行走的按需印刷产品语义标注
如今,真实网络的规模日益增大,同时也带来了稀疏问题。通常需要维护大量的产品信息。为了组织这些产品信息,一种可行的方法是在信息中添加语义标记。本文旨在解决按需印刷产品的语义标注问题。基于随机漫步在全局网络中的良好特性,将其应用于稀疏性问题,提出了一种高效的proorwr算法。首先,基于TF-IDF算法对印刷品文本描述数据集进行处理,构建“产品-术语”二部网络;其次,proorwr利用TF-IDF权矩阵构建方阵,重写随机游走方程,并将归一化方阵作为重写proorwr算法的输入。通过随机漫步,在每个产品文档中选择收敛概率最高的项作为产品最相关的特征项。在Amazon数据集上进行了大量的实验。结果表明,算法的准确率和召回率分别为73.5%和60%,表明proorwr已经发现了潜在的语义关联,并实现了按需印刷产品的语义标注。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信