{"title":"Metapath and attribute-based academic collaborator recommendation in heterogeneous academic networks","authors":"Hui Li, Yaohua Hu","doi":"10.1007/s11192-024-05043-x","DOIUrl":null,"url":null,"abstract":"<p>Academic collaboration is fundamental to the advancement of scientific research. However, with the growing number of publications and researchers, it becomes increasingly challenging to identify suitable collaborators. Academic collaborator recommendation is a promising solution to this problem. Traditional recommendation methods based on collaborative filtering suffer serious data sparsity. In recent years, network topology-based methods have shown good recommendation performance while alleviating the data sparsity issue to some extent by exploiting the relationships between nodes and their attributes. Nevertheless, these methods are typically based on homogeneous collaboration networks that consist only of scholar nodes and collaboration relationships, leading to suboptimal performance. In reality, collaboration involves many different types of nodes and relations that accumulate multiplex information. To address this issue, we construct a heterogeneous academic information network comprising four types of nodes: scholars, papers, organizations, and publication venues. An academic collaborator recommendation model is designed to capture multi-type attribute features and network topology features of nodes through metapaths based on the network. Specifically, the attribute features of nodes are embedded by a node type-aware embedding method. The topology features are then extracted through the node type-aware aggregation and metapath instance aggregation procedure. After that, we utilize a metapath aggregation method to gather different types of metapaths, each representing a factor that affects collaboration. Thus, the topology information and attribute information are preserved, while encompassing multi-type factors of collaboration. Finally, we compute the vector similarity to determine collaborators. Through rigorous experimentation on a large-scale interdisciplinary academic dataset, we found that the proposed model exhibits outstanding performance in practical applications. Unlike traditional approaches confined to homogeneous collaboration networks, our model delves deeper by mining and leveraging diverse node attributes and multiple collaboration influencing factors. This approach significantly enhances the accuracy and effectiveness of collaborator recommendations. Ultimately, we aspire to contribute to a more efficient and accessible platform that simplifies the search for suitable collaborators.</p>","PeriodicalId":21755,"journal":{"name":"Scientometrics","volume":"19 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientometrics","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1007/s11192-024-05043-x","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Academic collaboration is fundamental to the advancement of scientific research. However, with the growing number of publications and researchers, it becomes increasingly challenging to identify suitable collaborators. Academic collaborator recommendation is a promising solution to this problem. Traditional recommendation methods based on collaborative filtering suffer serious data sparsity. In recent years, network topology-based methods have shown good recommendation performance while alleviating the data sparsity issue to some extent by exploiting the relationships between nodes and their attributes. Nevertheless, these methods are typically based on homogeneous collaboration networks that consist only of scholar nodes and collaboration relationships, leading to suboptimal performance. In reality, collaboration involves many different types of nodes and relations that accumulate multiplex information. To address this issue, we construct a heterogeneous academic information network comprising four types of nodes: scholars, papers, organizations, and publication venues. An academic collaborator recommendation model is designed to capture multi-type attribute features and network topology features of nodes through metapaths based on the network. Specifically, the attribute features of nodes are embedded by a node type-aware embedding method. The topology features are then extracted through the node type-aware aggregation and metapath instance aggregation procedure. After that, we utilize a metapath aggregation method to gather different types of metapaths, each representing a factor that affects collaboration. Thus, the topology information and attribute information are preserved, while encompassing multi-type factors of collaboration. Finally, we compute the vector similarity to determine collaborators. Through rigorous experimentation on a large-scale interdisciplinary academic dataset, we found that the proposed model exhibits outstanding performance in practical applications. Unlike traditional approaches confined to homogeneous collaboration networks, our model delves deeper by mining and leveraging diverse node attributes and multiple collaboration influencing factors. This approach significantly enhances the accuracy and effectiveness of collaborator recommendations. Ultimately, we aspire to contribute to a more efficient and accessible platform that simplifies the search for suitable collaborators.
期刊介绍:
Scientometrics aims at publishing original studies, short communications, preliminary reports, review papers, letters to the editor and book reviews on scientometrics. The topics covered are results of research concerned with the quantitative features and characteristics of science. Emphasis is placed on investigations in which the development and mechanism of science are studied by means of (statistical) mathematical methods.
The Journal also provides the reader with important up-to-date information about international meetings and events in scientometrics and related fields. Appropriate bibliographic compilations are published as a separate section. Due to its fully interdisciplinary character, Scientometrics is indispensable to research workers and research administrators throughout the world. It provides valuable assistance to librarians and documentalists in central scientific agencies, ministries, research institutes and laboratories.
Scientometrics includes the Journal of Research Communication Studies. Consequently its aims and scope cover that of the latter, namely, to bring the results of research investigations together in one place, in such a form that they will be of use not only to the investigators themselves but also to the entrepreneurs and research workers who form the object of these studies.