{"title":"Red alert: Millions of “homeless” publications in Scopus should be resettled","authors":"Weishu Liu, Haifeng Wang","doi":"10.1002/asi.25011","DOIUrl":"10.1002/asi.25011","url":null,"abstract":"<p>Scopus is increasingly regarded as a high-quality and reliable data source for research and evaluation of scientific and scholarly activity. However, a puzzling phenomenon has been discovered occasionally: millions of records with author affiliation information collected in Scopus are oddly labeled as “country-undefined” by Scopus, which is rarely detected in its counterpart Web of Science. This huge number of “homeless” records in Scopus will challenge the reliability of various Scopus-based literature retrieval, analysis and evaluation and therefore is unacceptable for a widely used high-quality bibliographic database. By using data from the past 124 years, this article tries to probe these affiliated but country-undefined records in Scopus. Our analysis identifies four primary causes for these “homeless” records: incomplete author affiliation addresses, Scopus' inability to recognize different variants of country/territory names, misspelled country/territory names in author affiliation addresses, and Scopus' insufficiency in correctly splitting and identifying the clean affiliation addresses. To address this pressing issue, we put forward several recommendations to relevant stakeholders, with the aim of resettling millions of “homeless” records in Scopus and reducing its potential impact on Scopus-based literature retrieval, analysis, and evaluation.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 10","pages":"1283-1291"},"PeriodicalIF":4.3,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145062625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Finding “similar” universities using ChatGPT for institutional benchmarking: A large-scale comparison of European universities","authors":"Benedetto Lepori, Lutz Bornmann, Mario Gay","doi":"10.1002/asi.25010","DOIUrl":"10.1002/asi.25010","url":null,"abstract":"<p>The study objective was to evaluate the efficacy of ChatGPT in identifying “similar” institutions for benchmarking the research performance of a university. Benchmarking is deemed a promising approach to compare “similar with similar” as a better alternative to rankings (comparing “different” universities). Current approaches either focus on a limited number of “quantitative” dimensions or are too complex for most users. We conducted large-scale testing by tasking ChatGPT with identifying the most similar European universities in terms of research performance, utilizing the European Tertiary Education Register data. We tested whether the peers suggested by ChatGPT were similar to the focal university on size, research intensity, and subject composition. Additionally, we evaluated whether providing more specific instructions improved the results. The findings offer a nuanced perspective on the potential and risks of using ChatGPT to identify peer institutions for benchmarking. On one hand, solely using ChatGPT would replicate the visibility biases associated with university rankings, thereby undermining the rationale for benchmarking. On the other hand, relying on semantic associations might capture dimensions of university similarity that are relevant and difficult to capture through quantitative methods. We finally reflected on the broader implications for scholars in higher education and science studies research.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 9","pages":"1174-1187"},"PeriodicalIF":4.3,"publicationDate":"2025-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Framework for assessing the risk to a field from fraudulent researchers: A case study of Alzheimer's disease","authors":"Chaoqun Ni, B. Ian Hutchins","doi":"10.1002/asi.25009","DOIUrl":"10.1002/asi.25009","url":null,"abstract":"<p>Concerns over research integrity are rising, with increasing attention to potential threats from untrustworthy authors. We established a framework to gauge the potential negative influence of researchers potentially engaged in misconduct. The field of Alzheimer's disease (AD) research has been a focal point of these worries. This study aims to assess the risk posed by questionable studies or individuals potentially engaging in fraudulent science in research by examining citation relationships among papers, taking AD research as an illustrative example. Analysis of citation network structure can elucidate the potential propagation of misinformation arising at the author level. Our analysis revealed that there aren't any single authors or papers whose citation connections jeopardize a major portion of the field's literature. This indicates a low probability of single entities undermining the majority of works in this area. However, our findings suggest that attention to the research integrity of the most influential scientists is warranted. Some scientists can reach a sizable minority of the literature through citations to their work. Emphasizing oversight of the integrity of these authors is crucial, given their influence on the field. Our study introduces an analytical framework adaptable across various fields and disciplines to evaluate potential risks from fraudulence.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 9","pages":"1162-1173"},"PeriodicalIF":4.3,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://asistdl.onlinelibrary.wiley.com/doi/epdf/10.1002/asi.25009","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jocelyn Cranefield, Matthew Lewellen, Spencer Lilley, Gillian Oliver
{"title":"Envisaging Data Nirvana: A Delphi study of ideal data culture","authors":"Jocelyn Cranefield, Matthew Lewellen, Spencer Lilley, Gillian Oliver","doi":"10.1002/asi.25008","DOIUrl":"10.1002/asi.25008","url":null,"abstract":"<p>In recent decades, the proliferation of data and advances in information technology have led organizations to value data more highly and aim to build a data culture that is suitable for promoting and sustaining data-related strategic outcomes. However, what a “good” data culture comprises is often expressed abstractly and there is no consensus about how such a culture should manifest in practice. This study explores the key dimensions and attributes of an ideal data culture, as perceived by expert practitioners in large, data-rich public sector organizations. Using a two-stage Delphi method, we engaged with 14 data management experts from Aotearoa New Zealand to understand their views on achieving “Data Nirvana” in practice, focusing on the attributes that explain an ideal data culture. Five categories of ideal data culture are identified: strategic agility, ethical use, human centricity, capability, and controls and discipline. These are linked through two unifying themes: trust and trustworthiness, and value integration. The resulting framework for data culture comprises seven elements. The study provides insights into the aspirational potential of data and the realities of organizational data practice, contributing to a deeper understanding of data culture.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 9","pages":"1147-1161"},"PeriodicalIF":4.3,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://asistdl.onlinelibrary.wiley.com/doi/epdf/10.1002/asi.25008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144923697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"SciConNav: Knowledge navigation through contextual learning of extensive scientific research trajectories","authors":"Shibing Xiang, Xin Jiang, Bing Liu, Yurui Huang, Chaolin Tian, Yifang Ma","doi":"10.1002/asi.25005","DOIUrl":"10.1002/asi.25005","url":null,"abstract":"<p>New knowledge builds upon existing foundations, which means an interdependent relationship exists between knowledge, manifested in the historical records of the scientific system for hundreds of years. By leveraging natural language processing techniques, this study introduces the Scientific Concept Navigator, an embedding-based navigation model to infer the “knowledge pathway” from the research trajectories of millions of scholars. We validate that the learned representations effectively delineate disciplinary boundaries and capture the intricate relationships between diverse concepts. Utility of the navigation space is showcased through multiple applications. Firstly, we demonstrate the multi-step analogy inferences between concepts from various disciplines. Secondly, we formulate the cross-domain conceptual dimensions of knowledge, observing the distributional shifts of 19 disciplines along these conceptual dimensions, including “Theoretical” to “Applied,” and “Societal” to “Economic,” highlighting the evolution of functional attributes across diverse domains. Lastly, by analyzing the knowledge network structure, we find that knowledge connects with shorter global pathways, and interdisciplinary concepts play a critical role in enhancing accessibility. Our framework offers a novel approach to mining knowledge inheritance pathways from extensive scientific literature, which is of great significance for understanding scientific progression patterns, tailoring scientific learning trajectories, and accelerating scientific progress.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 10","pages":"1308-1339"},"PeriodicalIF":4.3,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145062522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Wenceslao Arroyo-Machado, Enrique Herrera-Viedma, Daniel Torres-Salinas
{"title":"The botization of science? Large-scale study of the presence and impact of Twitter bots in science dissemination","authors":"Wenceslao Arroyo-Machado, Enrique Herrera-Viedma, Daniel Torres-Salinas","doi":"10.1002/asi.24998","DOIUrl":"10.1002/asi.24998","url":null,"abstract":"<p>The rise of social media has brought new dynamics to the dissemination of scientific research, with Twitter (X) playing a significant role. This study focuses on the role of social bots—automated accounts designed to mimic human behavior and amplify content—in scientific communication. By analyzing over 3.7 million papers published between 2017 and 2021 and their 51 million Twitter mentions. Using a novel hybrid method that includes BotometerLite and specific activity parameters, with verification via a robustness check, it was found that 0.23% of accounts were bots. Despite their small numbers, these bots contributed to 4.72% of all mentions, indicating a significant presence, but with varied impact. Bots were particularly active in Mathematics, Physics, and Space Sciences, where they generated over 70% of tweets in some cases. Automated accounts disproportionately influence the visibility and perceived impact of research in these disciplines, which underscores the need for discipline-specific analysis when considering Twitter's role in scientific communication. This large-scale study highlights the potential for bots to skew altmetric indicators, misleading stakeholders about true engagement.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 8","pages":"1105-1122"},"PeriodicalIF":4.3,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"How does scientific research influence policymaking? A study of four types of citation pathways between research articles and AI policy documents","authors":"Zhe Cao, Lin Zhang, Ying Huang, Gunnar Sivertsen","doi":"10.1002/asi.25006","DOIUrl":"10.1002/asi.25006","url":null,"abstract":"<p>The importance of evidence-based policymaking is widely recognized, but how science influences policy remains insufficiently explored. This study aims to examine how policy documents cite research articles, thereby tracing the complex impact process of scientific research on policymaking. A conceptual model is proposed to classify four types of citation pathways by distinguishing between direct and indirect impacts and observing whether a reinforcement effect is present. To operationalize this model, we collected nearly 10 thousand policy documents related to artificial intelligence (AI) and over 1.6 million links between these policies and their referenced articles. A large-scale data analysis and a case study were conducted. Results exhibit distinct citation pathways among specific types of institutions, geopolitical areas, and policy areas. Indirect influences emerge as an important mechanism. Research articles from EU countries primarily serve the policymaking of inter-governmental organizations (IGOs) and the EU, while research articles from the USA significantly support both domestic and foreign policymaking. Notably, IGOs serve as key intermediaries, facilitating the indirect influence of research on policymaking. In addition, while the knowledge from the social sciences provides substantial support for policies in various areas, an increasing involvement of the natural sciences in the development of AI-related policies is found.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 10","pages":"1340-1356"},"PeriodicalIF":4.3,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145062859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Who funds whom exactly? A study of funding acknowledgments","authors":"Anna Panova, Nataliya Matveeva, Ivan Sterligov","doi":"10.1002/asi.25004","DOIUrl":"10.1002/asi.25004","url":null,"abstract":"<p>Research funding plays a crucial role in the production of knowledge, and its nature varies considerably from country to country. Numerous studies have analyzed research funding from a bibliometric perspective. However, the role of individual authors in attracting funding remains understudied, and it may be crucial for many actors. We propose a new approach that provides a more accurate picture and test it on post-Soviet countries with low scientific production. We analyze the funding sources of the most visible part of the natural sciences by focusing on the funding acknowledgments of their papers in Nature Index journals published in 2017–2021. Both the country of origin and types of sources are accounted for. Our approach reveals marked differences between traditionally used paper-level and proposed author-level funding links. The shares of funding sources measured in this way are very different, especially with regard to foreign sources and the role of specific countries. This is particularly important when studying international papers and the roles of the countries involved, even more so for the countries with lower research capacity. Utilizing a case-driven funding sources classification, we paint a rich picture of diverging post-Soviet funding landscapes, mostly driven by national grants and EU-wide programmes.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 10","pages":"1292-1307"},"PeriodicalIF":4.3,"publicationDate":"2025-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145062456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bayesian and frequentist statistical models to predict publishing output and article processing charge totals","authors":"Philip M. Dixon, Eric Schares","doi":"10.1002/asi.24981","DOIUrl":"10.1002/asi.24981","url":null,"abstract":"<p>Academic libraries, institutions, and publishers are interested in predicting future publishing output to help evaluate publishing agreements. Current predictive models are overly simplistic and provide inaccurate predictions. This paper presents Bayesian and frequentist statistical models to predict future article counts and costs. These models use the past year's counts of corresponding authored peer-reviewed articles to predict the distribution of the number of articles in a future year. Article counts for each journal and year are modeled as a log-linear function of year with journal-specific coefficients. Journal-specific predictions are summed to predict the distribution of total paper count and combined with journal-specific costs to predict the distribution of total cost. We fit models to three data sets: 366 Wiley journals for 2016–2020, 376 Springer-Nature journals from 2017 to 2021, and 313 Wiley journals from 2017 to 2021. For each dataset, we compared predictions for the subsequent year to actual counts. The model predicts two datasets better than using either the annual mean count or a linear trend regression. For the third, no method predicts output well. A Bayesian model provides prediction uncertainties that account for all modeled sources of uncertainty. Better estimates of future publishing activity and costs provide critical, independent information for open publishing negotiations.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 6","pages":"917-932"},"PeriodicalIF":4.3,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asi.24981","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143901014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bringing information science perspectives to data science: Opportunities and gaps","authors":"Matthew S. Mayernik","doi":"10.1002/asi.25000","DOIUrl":"10.1002/asi.25000","url":null,"abstract":"<p>Data science has many articulation points with information science, both in academic research contexts and in professional situations. Several recent journal special issues show the need for reflexivity in identifying and further building out these articulation points. In this brief communication, I outline aspects of data science that were not extensively discussed in detail within these special issues and deserve more attention from the <i>JASIST</i> community. I discuss how the information science community has important roles in building stronger theoretical understanding of data and data science, developing a more detailed understanding of the data science publishing landscape, and in mapping different manifestations of data science across societal sectors. Information science-informed work in these areas will enable further understanding of data and data science as academic and societal phenomena.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 8","pages":"1047-1051"},"PeriodicalIF":4.3,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144767914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}