{"title":"Bayesian and frequentist statistical models to predict publishing output and article processing charge totals","authors":"Philip M. Dixon, Eric Schares","doi":"10.1002/asi.24981","DOIUrl":"https://doi.org/10.1002/asi.24981","url":null,"abstract":"<p>Academic libraries, institutions, and publishers are interested in predicting future publishing output to help evaluate publishing agreements. Current predictive models are overly simplistic and provide inaccurate predictions. This paper presents Bayesian and frequentist statistical models to predict future article counts and costs. These models use the past year's counts of corresponding authored peer-reviewed articles to predict the distribution of the number of articles in a future year. Article counts for each journal and year are modeled as a log-linear function of year with journal-specific coefficients. Journal-specific predictions are summed to predict the distribution of total paper count and combined with journal-specific costs to predict the distribution of total cost. We fit models to three data sets: 366 Wiley journals for 2016–2020, 376 Springer-Nature journals from 2017 to 2021, and 313 Wiley journals from 2017 to 2021. For each dataset, we compared predictions for the subsequent year to actual counts. The model predicts two datasets better than using either the annual mean count or a linear trend regression. For the third, no method predicts output well. A Bayesian model provides prediction uncertainties that account for all modeled sources of uncertainty. Better estimates of future publishing activity and costs provide critical, independent information for open publishing negotiations.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 6","pages":"917-932"},"PeriodicalIF":2.8,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asi.24981","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143901014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lucía Céspedes, Diego Kozlowski, Carolina Pradier, Maxime Holmberg Sainte-Marie, Natsumi Solange Shokida, Pierre Benz, Constance Poitras, Anton Boudreau Ninkov, Saeideh Ebrahimy, Philips Ayeni, Sarra Filali, Bing Li, Vincent Larivière
{"title":"Evaluating the linguistic coverage of OpenAlex: An assessment of metadata accuracy and completeness","authors":"Lucía Céspedes, Diego Kozlowski, Carolina Pradier, Maxime Holmberg Sainte-Marie, Natsumi Solange Shokida, Pierre Benz, Constance Poitras, Anton Boudreau Ninkov, Saeideh Ebrahimy, Philips Ayeni, Sarra Filali, Bing Li, Vincent Larivière","doi":"10.1002/asi.24979","DOIUrl":"https://doi.org/10.1002/asi.24979","url":null,"abstract":"<p>Clarivate's Web of Science (WoS) and Elsevier's Scopus have been for decades the main sources of bibliometric information. Although highly curated, these closed, proprietary databases are largely biased toward English-language publications, underestimating the use of other languages in research dissemination. Launched in 2022, OpenAlex promised comprehensive, inclusive, and open-source research information. While already in use by scholars and research institutions, the quality of its metadata is currently still being assessed. This paper contributes to this literature by assessing the completeness and accuracy of OpenAlex's metadata related to language, through a comparison with WoS, as well as an in-depth manual validation of a sample of 6836 articles. Results show that OpenAlex exhibits a far more balanced linguistic coverage than WoS. However, language metadata are not always accurate, which leads OpenAlex to overestimate the place of English while underestimating that of other languages. If used critically, OpenAlex can provide comprehensive and representative analyses of languages used for scholarly publishing, but more work is needed at infrastructural level to ensure the quality of metadata on language.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 6","pages":"884-895"},"PeriodicalIF":2.8,"publicationDate":"2025-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asi.24979","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143901110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Rona Nisa Sofia Amriza, Tzu-Chuan Chou, Wiwit Ratnasari
{"title":"Understanding the shifting nature of fake news research: Consumption, dissemination, and detection","authors":"Rona Nisa Sofia Amriza, Tzu-Chuan Chou, Wiwit Ratnasari","doi":"10.1002/asi.24980","DOIUrl":"https://doi.org/10.1002/asi.24980","url":null,"abstract":"<p>Fake news on social media spreads faster and has become a major societal concern, prompting numerous publications and knowledge sharing among researchers. This research aims to understand the shifting nature of fake news by investigating the citation relationships between significant publications using key route main path analysis (MPA). The process involves generating keywords, collecting and selecting relevant data, and conducting MPA on fake news in social media. The study analyzes 4.057 publications from 2010 to 2023, identifying 27 influential works shaping the knowledge diffusion in fake news research. Findings reveal two main phases: understanding fake news consumption patterns and analyzing its dissemination and detection mechanisms. Through multiple-global MPA, five research trends are identified: health misinformation, fact-checking, sharing behavior, fake news recognition, and physiological interventions. The study shows a continuous rise in publications and citations, with current trends focusing on health-related misinformation. This analysis offers insights into the development and diffusion of fake news topics on social media, emphasizing the importance of historical development in guiding future research by uncovering current trends. Highlighting the historical progression of research provides valuable context, enabling a more nuanced understanding of the field.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 6","pages":"896-916"},"PeriodicalIF":2.8,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143901108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Death by AI: Will large language models diminish Wikipedia?","authors":"Christian Wagner, Ling Jiang","doi":"10.1002/asi.24975","DOIUrl":"https://doi.org/10.1002/asi.24975","url":null,"abstract":"<p>We argue that advances in large language models (LLMs) and generative Artificial Intelligence (AI) will diminish the value of Wikipedia, due to a withdrawal by human content producers, who will withhold their efforts, perceiving less need for their efforts and increased “AI competition.” We believe the greatest threat to Wikipedia stems from the fact that Wikipedia is a user-generated product, relying on the “selfish altruism” of its human contributors. Contributors who reduce their contribution efforts as AI pervades the platform, will thus leave Wikipedia increasingly dependent on additional AI activity. This, combined with a dynamic where readership creates authorship and readers being disintermediated, will inevitably cause a vicious cycle leading to a staling of the content and diminishing value of this venerable knowledge resource.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 5","pages":"743-751"},"PeriodicalIF":2.8,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asi.24975","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143801501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"The use of bibliometrics for ranking the all-time greatest music artists","authors":"Timothy L. Urban","doi":"10.1002/asi.24976","DOIUrl":"https://doi.org/10.1002/asi.24976","url":null,"abstract":"<p>This brief communication presents a novel adaptation of common bibliometric measures to provide a quantitative assessment of an artist's music catalog that incorporates both impact and productivity. Data from <i>Billboard</i>'s weekly Hot 100™ music charts are used to rank the all-time greatest artists. Since the sorted data are increasing in value—that is, a number 1 hit is best—a transformation is applied to provide a convex, monotonically decreasing curve. Furthermore, since conventional bibliometrics result in several artists with identical measures, metrics inspired by the multidimensional <span></span><math>\u0000 <mrow>\u0000 <mi>h</mi>\u0000 </mrow></math>- and <span></span><math>\u0000 <mrow>\u0000 <mi>g</mi>\u0000 </mrow></math>-indices are used to rank the artists. We find that this approach provides a simple, yet unbiased, approach for ranking the all-time greatest music artists.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 6","pages":"843-847"},"PeriodicalIF":2.8,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143900975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mohammad Hasan Payandeh, Orland Hoeber, Miriam Boon, Dale Storie, Veronica Ramshaw
{"title":"A study of drag-and-drop query refinement and query history visualization for mobile exploratory search","authors":"Mohammad Hasan Payandeh, Orland Hoeber, Miriam Boon, Dale Storie, Veronica Ramshaw","doi":"10.1002/asi.24977","DOIUrl":"https://doi.org/10.1002/asi.24977","url":null,"abstract":"<p>When undertaking complex search scenarios, the underlying information need cannot be satisfied by finding a single optimal resource; instead, searchers need to engage in exploratory search processes to find multiple resources by iteratively revising and reformulation their queries. This process of query refinement is particularly challenging when using a mobile device, where typing is difficult. Furthermore, in mobile search contexts interruptions can lead to searchers losing track of what they were doing. To address these challenges, we designed a public digital library search interface for mobile devices that includes two novel features: drag-and-drop query refinement and query history visualization. To assess the value of this interface compared to a typical baseline, we conducted a controlled laboratory study with 32 participants that included pursuing complex search scenarios, being interrupted in the midst of the search, and resuming the search after the interruption. While participants took more time, they generated longer queries and reported positive subjective opinions about the usability of the exploratory search and task resumption features, along with a greater increase in certainty. These findings show the value of leveraging new touch-based interaction mechanisms within mobile search contexts, and the benefits that visualization can bring to supporting search task resumption.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 6","pages":"848-866"},"PeriodicalIF":2.8,"publicationDate":"2024-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asi.24977","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143900976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Spoken conversational search: Evaluating the effect of system clarifications on user experience through Wizard-of-Oz study","authors":"Souvick Ghosh, Chirag Shah","doi":"10.1002/asi.24974","DOIUrl":"https://doi.org/10.1002/asi.24974","url":null,"abstract":"<p>Prior research in human–computer interaction suggests that system-level clarifications are necessary for understanding user intent and communicating effectively with the user. Such clarifications or explanations could contain the system's abstract knowledge of the search or a functional description of the search process (queries and information sources employed). While these interactions may aid the user and the agent in better understanding each other, very few studies have explored the influence of such clarifications on the users' search experience. This research examines whether and how system-level clarifications (or explanations) affect the user experience when searching through spoken dialogues. We analyzed user satisfaction and preferences in systems with and without explicit clarifications in a within-subjects Wizard-of-Oz user study. We recruited 25 participants and collected user–system interaction data for 50 search sessions. The user feedback was collected using pre- and post-task surveys and exit interviews. Statistical and qualitative analysis of user responses yielded some interesting findings. While Wilcoxon Signed Rank Test found that using explicit system-level clarifications had no positive influence on the user's search experience, the overall search experience degraded with system clarifications (<i>Z</i> = −2.066, <i>p</i> = 0.04). The user interview data provided valuable insights into how and when clarifications should be offered to the user.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 5","pages":"819-839"},"PeriodicalIF":2.8,"publicationDate":"2024-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143801495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Wonchan Choi, Hyerin Bak, Jiaxin An, Yan Zhang, Besiki Stvilia
{"title":"College students' credibility assessments of GenAI-generated information for academic tasks: An interview study","authors":"Wonchan Choi, Hyerin Bak, Jiaxin An, Yan Zhang, Besiki Stvilia","doi":"10.1002/asi.24978","DOIUrl":"https://doi.org/10.1002/asi.24978","url":null,"abstract":"<p>The study explored college students' use of generative artificial intelligence (GenAI) tools, such as ChatGPT, for academic tasks and their perceptions and behaviors in assessing the credibility of GenAI-generated information. Semistructured interviews were conducted with 25 college students in the United States. Interview transcripts were analyzed using the qualitative content analysis method. The study identified various types of academic tasks for which students used ChatGPT, including writing, programming, and learning. Guided by two models of credibility assessment Hilligoss and Rieh (2008); Metzger (2007), six factors influencing students' motivation and ability to assess the credibility of GenAI-generated information were identified (e.g., task salience, social pressure). We also identified 9 constructs (e.g., refinedness, explainability), 5 heuristics (e.g., inter- and intrasystem consistency heuristics), and 10 cues (e.g., version and tone) used by students to assess the credibility of GenAI-generated information. This study provides theoretical and empirical findings regarding students' use of GenAI tools in the academic context and credibility evaluation of the system outputs using rich, qualitative interview data.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 6","pages":"867-883"},"PeriodicalIF":2.8,"publicationDate":"2024-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143900974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Using the S-DIKW framework to transform data visualization into data storytelling","authors":"Angelica Lo Duca, Kate McDowell","doi":"10.1002/asi.24973","DOIUrl":"https://doi.org/10.1002/asi.24973","url":null,"abstract":"<p>Communicating insights from data effectively requires design skills, technical knowledge, and experience. Data must be accurately represented with aesthetically pleasing visuals and engaging text to effectively communicate to the intended audience. Data storytelling has received much attention lately, but as of yet, it does not have a theoretical and practical foundation in information science. A data story adds context, narrative, and structure to the visual representation of data, providing audiences with character, plot, and a holistic experience of narrative. This paper proposes a methodological approach to transform a data visualization into a data story based on the Data-Information-Knowledge-Wisdom (DIKW) pyramid and the S-DIKW Framework. Starting from the bottom of the pyramid, the proposed approach defines a strategy to represent insights extracted from data. Data is then turned into information by identifying character(s) facing a problem, adding textual and graphic content; information is turned into knowledge by organizing what happens as a plot. Finally, a call to wise action—always informed by cultural and community values—completes the storytelling transformation to create a data story. This article contributes to the theoretical understanding of data stories as emerging information forms, supporting richer understandings of a story as information in the information sciences.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 5","pages":"803-818"},"PeriodicalIF":2.8,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/asi.24973","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143801665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Beyond decomposition: Hierarchical dependency management in multi-document question answering","authors":"Xiaoyan Zheng, Zhi Li, Qianglong Chen, Yin Zhang","doi":"10.1002/asi.24971","DOIUrl":"https://doi.org/10.1002/asi.24971","url":null,"abstract":"<p>When using retrieval-augmented generation (RAG) to handle multi-document question answering (MDQA) tasks, it is beneficial to decompose complex queries into multiple simpler ones to enhance retrieval results. However, previous strategies always employ a one-shot approach of question decomposition, overlooking subquestions dependency problem and failing to ensure that the derived subqueries are single-hop. To overcome this challenge, we introduce a novel framework called DSRC-QCS. Decompose-solve-renewal-cycle (DSRC) is an iterative multi-hop question processing module. The key idea of DSRC involves using a unique symbol to achieve hierarchical dependency management and employing a cyclical process of question decomposition, solving, and renewal to continuously generate and resolve all single-hop subquestions. Query-chain selector (QCS) functions as a voting mechanism that effectively utilizes the reasoning process of DSRC to assess and select solutions. We compare DSRC-QCS against five RAG approaches across three datasets and three LLMs. DSRC-QCS demonstrates superior performance. Compared to the Direct Retrieval method, DSRC-QCS improves the average F1 score by 17.36% with Alpaca-7b, 10.83% with LLaMa2-Chat-7b, and 11.88% with GPT-3.5-Turbo. We also conduct ablation studies to validate the performance of both DSRC and QCS and explore factors influencing the effectiveness of DSRC. We have included all prompts in the Appendix.</p>","PeriodicalId":48810,"journal":{"name":"Journal of the Association for Information Science and Technology","volume":"76 5","pages":"770-789"},"PeriodicalIF":2.8,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143801466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}