{"title":"Chinese keyword extraction based on weighted complex network","authors":"Yin-feng Liang","doi":"10.1109/ISKE.2017.8258737","DOIUrl":null,"url":null,"abstract":"Aiming at the problem of low precision of keyword extraction in traditional complex network method, we propose a keyword extraction method based on an improved weighted complex network, called IWCN algorithm. First, based on the word semantic similarity, we construct a complex network to obtain semantic weight of words. Next, the statistical weight of words is obtained by the introduction of term frequency (TF) and inverse document frequency (IDF). Finally, we combine semantic and statistical weights of words to get keywords. Comparing to traditional complex network approach, the proposed method can avoid the deviations and thus improves extraction accuracy. Simulation results shows that the proposed method achieves higher precision and recall.","PeriodicalId":208009,"journal":{"name":"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)","volume":"262 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 12th International Conference on Intelligent Systems and Knowledge Engineering (ISKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISKE.2017.8258737","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Aiming at the problem of low precision of keyword extraction in traditional complex network method, we propose a keyword extraction method based on an improved weighted complex network, called IWCN algorithm. First, based on the word semantic similarity, we construct a complex network to obtain semantic weight of words. Next, the statistical weight of words is obtained by the introduction of term frequency (TF) and inverse document frequency (IDF). Finally, we combine semantic and statistical weights of words to get keywords. Comparing to traditional complex network approach, the proposed method can avoid the deviations and thus improves extraction accuracy. Simulation results shows that the proposed method achieves higher precision and recall.