Jinqing Yang , Xingyu Luo , Ruhan Yang , Zhifeng Liu , Shengzhi Huang
{"title":"不同同义词如何影响基于细粒度知识共现网络的实验结果","authors":"Jinqing Yang , Xingyu Luo , Ruhan Yang , Zhifeng Liu , Shengzhi Huang","doi":"10.1016/j.ipm.2025.104311","DOIUrl":null,"url":null,"abstract":"<div><div>Despite efforts to reduce the effects of dissimilar synonyms on the construction of knowledge networks, few researchers have examined the extent to which it affects the results of experiments. In this work, we developed a multi-tiered comparative analysis framework to investigate how the dissimilar synonym issue influences the topology structure and functional dynamics of knowledge networks. Specifically, <em>Pearson</em> correlation analysis was performed to quantify the relationship between topology structure variables in the ontology knowledge network and their counterparts in the raw term-based network. Subsequently, we applied the Levenshtein distance algorithm to assess sequence dissimilarity in the ordinal sequences between variable pairs. Finally, we calculated the difference between the topology variables of the same knowledge node in the two networks. To evaluate the effect of the dissimilar synonym issue, we applied our framework to the scenario of knowledge impact prediction and ranking. The experimental results show that (1) the similarity values of the ordinal sequences of eigenvector centrality, <em>PageRank</em> coefficient, betweenness centrality, and closeness centrality variables are respectively 0.410, 0.404, 0.342, and 0.407, which means the dissimilar synonym issue has a considerable effect on the topology structure calculations of knowledge networks; and (2) higher-ranked knowledge nodes show lower overlap rates between the raw term-based knowledge network and the ontology knowledge network, suggesting that the dissimilar synonym issue influences the reliability of detecting high-impact knowledge. (3) in the scenario of knowledge impact prediction, the dissimilar synonym issue has minimal effect on the task performance.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"63 1","pages":"Article 104311"},"PeriodicalIF":7.4000,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"How dissimilar synonyms affect the results of experiments based on fine-grained knowledge co-occurrence networks\",\"authors\":\"Jinqing Yang , Xingyu Luo , Ruhan Yang , Zhifeng Liu , Shengzhi Huang\",\"doi\":\"10.1016/j.ipm.2025.104311\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Despite efforts to reduce the effects of dissimilar synonyms on the construction of knowledge networks, few researchers have examined the extent to which it affects the results of experiments. In this work, we developed a multi-tiered comparative analysis framework to investigate how the dissimilar synonym issue influences the topology structure and functional dynamics of knowledge networks. Specifically, <em>Pearson</em> correlation analysis was performed to quantify the relationship between topology structure variables in the ontology knowledge network and their counterparts in the raw term-based network. Subsequently, we applied the Levenshtein distance algorithm to assess sequence dissimilarity in the ordinal sequences between variable pairs. Finally, we calculated the difference between the topology variables of the same knowledge node in the two networks. To evaluate the effect of the dissimilar synonym issue, we applied our framework to the scenario of knowledge impact prediction and ranking. The experimental results show that (1) the similarity values of the ordinal sequences of eigenvector centrality, <em>PageRank</em> coefficient, betweenness centrality, and closeness centrality variables are respectively 0.410, 0.404, 0.342, and 0.407, which means the dissimilar synonym issue has a considerable effect on the topology structure calculations of knowledge networks; and (2) higher-ranked knowledge nodes show lower overlap rates between the raw term-based knowledge network and the ontology knowledge network, suggesting that the dissimilar synonym issue influences the reliability of detecting high-impact knowledge. (3) in the scenario of knowledge impact prediction, the dissimilar synonym issue has minimal effect on the task performance.</div></div>\",\"PeriodicalId\":50365,\"journal\":{\"name\":\"Information Processing & Management\",\"volume\":\"63 1\",\"pages\":\"Article 104311\"},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2025-07-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Processing & Management\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306457325002523\",\"RegionNum\":1,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325002523","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
How dissimilar synonyms affect the results of experiments based on fine-grained knowledge co-occurrence networks
Despite efforts to reduce the effects of dissimilar synonyms on the construction of knowledge networks, few researchers have examined the extent to which it affects the results of experiments. In this work, we developed a multi-tiered comparative analysis framework to investigate how the dissimilar synonym issue influences the topology structure and functional dynamics of knowledge networks. Specifically, Pearson correlation analysis was performed to quantify the relationship between topology structure variables in the ontology knowledge network and their counterparts in the raw term-based network. Subsequently, we applied the Levenshtein distance algorithm to assess sequence dissimilarity in the ordinal sequences between variable pairs. Finally, we calculated the difference between the topology variables of the same knowledge node in the two networks. To evaluate the effect of the dissimilar synonym issue, we applied our framework to the scenario of knowledge impact prediction and ranking. The experimental results show that (1) the similarity values of the ordinal sequences of eigenvector centrality, PageRank coefficient, betweenness centrality, and closeness centrality variables are respectively 0.410, 0.404, 0.342, and 0.407, which means the dissimilar synonym issue has a considerable effect on the topology structure calculations of knowledge networks; and (2) higher-ranked knowledge nodes show lower overlap rates between the raw term-based knowledge network and the ontology knowledge network, suggesting that the dissimilar synonym issue influences the reliability of detecting high-impact knowledge. (3) in the scenario of knowledge impact prediction, the dissimilar synonym issue has minimal effect on the task performance.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.