{"title":"Breaking barriers in hotspot mining: a novel approach to reflecting domain characteristics and correlations","authors":"Wei Chen, Zhengtao Yu, Shengxiang Gao, Yantuan Xian","doi":"10.1007/s10489-024-06136-z","DOIUrl":null,"url":null,"abstract":"<div><p>Hotspot mining is essential for acquiring information on hotspots and knowledge in a given domain, and it is also of great value for improving the efficiency and quality of scientific research work in the profession. Previous literature on hotspot mining did not take into account the domain characteristics of the literature and the diverse associations of the domain-specific literature itself. It is a challenging task to reflect the domain characteristics of the literature and use multiple correlations among the literature in the model. In this study, we depict each association link using a heterogeneous network of metallurgical literature and simultaneously fuse metallurgical domain-specific knowledge by aggregating the knowledge graph data of the neighbors into the term nodes of the heterogeneous network of the literature. A proposed heterogeneous academic network metallurgical literature hotspot mining method incorporates domain-specific knowledge. This method reflects various types of associational relation information in the literature via the heterogeneous network. In the meantime, it weights and analyzes the paths in the heterogeneous network, identifies the most critical paths for vectorized representation, and highlights the impact of essential paths and domain knowledge on representation learning, enhancing the information representation of diverse data in the model and improving its accuracy. The suggested model is compared with GCN, the MAGNN standard model, and its ablation model as applied to public and metallurgical literature datasets. The findings on the public dataset show that the proposed method is superior to the other two approaches. In contrast, the results for the metallurgical literature dataset are more conspicuous, with the proposed method exhibiting a more remarkable improvement in HR and NGCC.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 7","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-024-06136-z","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Hotspot mining is essential for acquiring information on hotspots and knowledge in a given domain, and it is also of great value for improving the efficiency and quality of scientific research work in the profession. Previous literature on hotspot mining did not take into account the domain characteristics of the literature and the diverse associations of the domain-specific literature itself. It is a challenging task to reflect the domain characteristics of the literature and use multiple correlations among the literature in the model. In this study, we depict each association link using a heterogeneous network of metallurgical literature and simultaneously fuse metallurgical domain-specific knowledge by aggregating the knowledge graph data of the neighbors into the term nodes of the heterogeneous network of the literature. A proposed heterogeneous academic network metallurgical literature hotspot mining method incorporates domain-specific knowledge. This method reflects various types of associational relation information in the literature via the heterogeneous network. In the meantime, it weights and analyzes the paths in the heterogeneous network, identifies the most critical paths for vectorized representation, and highlights the impact of essential paths and domain knowledge on representation learning, enhancing the information representation of diverse data in the model and improving its accuracy. The suggested model is compared with GCN, the MAGNN standard model, and its ablation model as applied to public and metallurgical literature datasets. The findings on the public dataset show that the proposed method is superior to the other two approaches. In contrast, the results for the metallurgical literature dataset are more conspicuous, with the proposed method exhibiting a more remarkable improvement in HR and NGCC.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.