{"title":"Heterogeneous hypergraph learning for literature retrieval based on citation intents","authors":"Kaiwen Shi, Kan Liu, Xinyan He","doi":"10.1007/s11192-024-05066-4","DOIUrl":null,"url":null,"abstract":"<p>Literature retrieval helps scientists find previous work that is relative to their own research or even get new research ideas. However, the discrepancy between retrieval results and the ultimate intention of citation is neglected by most literature retrieval models. Citation intent refers to the researcher’s motivation for citing a paper. A citation intent graph with homogeneous nodes and heterogeneous hyperedges can represent different types of citation intents. By leveraging the citation intent information included in a hypergraph, a retrieval model can guide researchers on where to cite its retrieval result by understanding the citation behaviour in the graph. We present a ranking model called CitenGL (<b>Ci</b>tation In<b>ten</b>t <b>G</b>raph <b>L</b>earning) that aims to extract citation intent information and textual matching signals. The proposed model consists of a heterogeneous hypergraph encoder and a lightweight deep fusion unit for efficiency trade-offs. Compared to traditional literature retrieval, our model fills the gap between retrieval results and citation intention and yields an understandable graph-structured output. We evaluated our model on publicly available full-text paper datasets. Experimental results show that CitenGL outperforms most existing neural ranking models that only consider textual information, which illustrates the effectiveness of integrating citation intent information with textual information. Further ablation analyses show how citation intent information complements text-matching signals and citation networks.</p>","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"44 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientometrics","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1007/s11192-024-05066-4","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Literature retrieval helps scientists find previous work that is relative to their own research or even get new research ideas. However, the discrepancy between retrieval results and the ultimate intention of citation is neglected by most literature retrieval models. Citation intent refers to the researcher’s motivation for citing a paper. A citation intent graph with homogeneous nodes and heterogeneous hyperedges can represent different types of citation intents. By leveraging the citation intent information included in a hypergraph, a retrieval model can guide researchers on where to cite its retrieval result by understanding the citation behaviour in the graph. We present a ranking model called CitenGL (Citation Intent Graph Learning) that aims to extract citation intent information and textual matching signals. The proposed model consists of a heterogeneous hypergraph encoder and a lightweight deep fusion unit for efficiency trade-offs. Compared to traditional literature retrieval, our model fills the gap between retrieval results and citation intention and yields an understandable graph-structured output. We evaluated our model on publicly available full-text paper datasets. Experimental results show that CitenGL outperforms most existing neural ranking models that only consider textual information, which illustrates the effectiveness of integrating citation intent information with textual information. Further ablation analyses show how citation intent information complements text-matching signals and citation networks.
期刊介绍:
Scientometrics aims at publishing original studies, short communications, preliminary reports, review papers, letters to the editor and book reviews on scientometrics. The topics covered are results of research concerned with the quantitative features and characteristics of science. Emphasis is placed on investigations in which the development and mechanism of science are studied by means of (statistical) mathematical methods.
The Journal also provides the reader with important up-to-date information about international meetings and events in scientometrics and related fields. Appropriate bibliographic compilations are published as a separate section. Due to its fully interdisciplinary character, Scientometrics is indispensable to research workers and research administrators throughout the world. It provides valuable assistance to librarians and documentalists in central scientific agencies, ministries, research institutes and laboratories.
Scientometrics includes the Journal of Research Communication Studies. Consequently its aims and scope cover that of the latter, namely, to bring the results of research investigations together in one place, in such a form that they will be of use not only to the investigators themselves but also to the entrepreneurs and research workers who form the object of these studies.