Information Retrieval Based on Word Semantic Clustering

Chia-Yang Chang, Yan-Ting Lin, Shie-Jue Lee, Chih-Chin Lai
{"title":"Information Retrieval Based on Word Semantic Clustering","authors":"Chia-Yang Chang, Yan-Ting Lin, Shie-Jue Lee, Chih-Chin Lai","doi":"10.1109/CISP-BMEI.2018.8633017","DOIUrl":null,"url":null,"abstract":"Information retrieval is an important topic in the modern age. With the advance of Internet, it is more and more easy to retrieve other people's writings or publications. However, how to retrieve desirable information efficiently is a challenging work. Traditional methods like vector space model or bag-of-words are short of providing a good solution due to the incapability of handling the semantics of words satisfactorily. In this paper, we propose a new method for information retrieval. We use Word2vec to transform the words into word vectors which are able to represent the semantic relationship among different words. By considering the semantic of words and clustering the word vectors into concepts, information retrieval can be done effectively.","PeriodicalId":117227,"journal":{"name":"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 11th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP-BMEI.2018.8633017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Information retrieval is an important topic in the modern age. With the advance of Internet, it is more and more easy to retrieve other people's writings or publications. However, how to retrieve desirable information efficiently is a challenging work. Traditional methods like vector space model or bag-of-words are short of providing a good solution due to the incapability of handling the semantics of words satisfactorily. In this paper, we propose a new method for information retrieval. We use Word2vec to transform the words into word vectors which are able to represent the semantic relationship among different words. By considering the semantic of words and clustering the word vectors into concepts, information retrieval can be done effectively.
基于词语义聚类的信息检索
信息检索是当今时代的一个重要课题。随着互联网的发展,检索他人的著作或出版物变得越来越容易。然而,如何有效地检索需要的信息是一项具有挑战性的工作。传统的方法如向量空间模型、词袋模型等,由于不能很好地处理词的语义,不能很好地解决问题。本文提出了一种新的信息检索方法。我们使用Word2vec将单词转换成能够表示不同单词之间语义关系的单词向量。通过考虑词的语义,将词向量聚类成概念,可以有效地进行信息检索。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信