Red alert: Millions of “homeless” publications in Scopus should be resettled

IF 4.3 2区 管理学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Weishu Liu, Haifeng Wang
{"title":"Red alert: Millions of “homeless” publications in Scopus should be resettled","authors":"Weishu Liu,&nbsp;Haifeng Wang","doi":"10.1002/asi.25011","DOIUrl":null,"url":null,"abstract":"<p>Scopus is increasingly regarded as a high-quality and reliable data source for research and evaluation of scientific and scholarly activity. However, a puzzling phenomenon has been discovered occasionally: millions of records with author affiliation information collected in Scopus are oddly labeled as “country-undefined” by Scopus, which is rarely detected in its counterpart Web of Science. This huge number of “homeless” records in Scopus will challenge the reliability of various Scopus-based literature retrieval, analysis and evaluation and therefore is unacceptable for a widely used high-quality bibliographic database. By using data from the past 124 years, this article tries to probe these affiliated but country-undefined records in Scopus. Our analysis identifies four primary causes for these “homeless” records: incomplete author affiliation addresses, Scopus' inability to recognize different variants of country/territory names, misspelled country/territory names in author affiliation addresses, and Scopus' insufficiency in correctly splitting and identifying the clean affiliation addresses. To address this pressing issue, we put forward several recommendations to relevant stakeholders, with the aim of resettling millions of “homeless” records in Scopus and reducing its potential impact on Scopus-based literature retrieval, analysis, and evaluation.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 10","pages":"1283-1291"},"PeriodicalIF":4.3000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Association for Information Science and Technology","FirstCategoryId":"91","ListUrlMain":"https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.25011","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Scopus is increasingly regarded as a high-quality and reliable data source for research and evaluation of scientific and scholarly activity. However, a puzzling phenomenon has been discovered occasionally: millions of records with author affiliation information collected in Scopus are oddly labeled as “country-undefined” by Scopus, which is rarely detected in its counterpart Web of Science. This huge number of “homeless” records in Scopus will challenge the reliability of various Scopus-based literature retrieval, analysis and evaluation and therefore is unacceptable for a widely used high-quality bibliographic database. By using data from the past 124 years, this article tries to probe these affiliated but country-undefined records in Scopus. Our analysis identifies four primary causes for these “homeless” records: incomplete author affiliation addresses, Scopus' inability to recognize different variants of country/territory names, misspelled country/territory names in author affiliation addresses, and Scopus' insufficiency in correctly splitting and identifying the clean affiliation addresses. To address this pressing issue, we put forward several recommendations to relevant stakeholders, with the aim of resettling millions of “homeless” records in Scopus and reducing its potential impact on Scopus-based literature retrieval, analysis, and evaluation.

Abstract Image

Abstract Image

Abstract Image

红色警报:Scopus中数百万“无家可归”的出版物应该重新安置
Scopus越来越被认为是科学和学术活动研究和评估的高质量和可靠的数据源。然而,偶尔会发现一个令人困惑的现象:Scopus中收集的数百万条带有作者归属信息的记录被Scopus奇怪地标记为“未定义国家”,而这在对应的Web of Science中很少被发现。Scopus中如此庞大的“无家可归”记录将挑战各种基于Scopus的文献检索、分析和评估的可靠性,因此对于广泛使用的高质量书目数据库来说是不可接受的。通过使用过去124年的数据,本文试图探索Scopus中这些相关但未定义国家的记录。我们的分析确定了这些“无家可归”记录的四个主要原因:不完整的作者归属地址,Scopus无法识别不同变体的国家/地区名称,作者归属地址中拼写错误的国家/地区名称,以及Scopus在正确分割和识别干净的归属地址方面的不足。为了解决这一紧迫问题,我们向相关利益相关者提出了几项建议,旨在重新安置Scopus中数百万“无家可归”的记录,并减少其对基于Scopus的文献检索、分析和评估的潜在影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.30
自引率
8.60%
发文量
115
期刊介绍: The Journal of the Association for Information Science and Technology (JASIST) is a leading international forum for peer-reviewed research in information science. For more than half a century, JASIST has provided intellectual leadership by publishing original research that focuses on the production, discovery, recording, storage, representation, retrieval, presentation, manipulation, dissemination, use, and evaluation of information and on the tools and techniques associated with these processes. The Journal welcomes rigorous work of an empirical, experimental, ethnographic, conceptual, historical, socio-technical, policy-analytic, or critical-theoretical nature. JASIST also commissions in-depth review articles (“Advances in Information Science”) and reviews of print and other media.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信