Yang Zhao, Mengting Zhang, Xiaoli Chen, Zhixiong Zhang
{"title":"Early identification of scientific breakthroughs through outlier analysis based on research entities","authors":"Yang Zhao, Mengting Zhang, Xiaoli Chen, Zhixiong Zhang","doi":"10.2478/jdis-2024-0027","DOIUrl":"https://doi.org/10.2478/jdis-2024-0027","url":null,"abstract":"Purpose To address the “anomalies” that occur when scientific breakthroughs emerge, this study focuses on identifying early signs and nascent stages of breakthrough innovations from the perspective of outliers, aiming to achieve early identification of scientific breakthroughs in papers. Design/methodology/approach This study utilizes semantic technology to extract research entities from the titles and abstracts of papers to represent each paper’s research content. Outlier detection methods are then employed to measure and analyze the anomalies in breakthrough papers during their early stages. The development and evolution process are traced using literature time tags. Finally, a case study is conducted using the key publications of the 2021 Nobel Prize laureates in Physiology or Medicine. Findings Through manual analysis of all identified outlier papers, the effectiveness of the proposed method for early identifying potential scientific breakthroughs is verified. Research limitations The study’s applicability has only been empirically tested in the biomedical field. More data from various fields are needed to validate the robustness and generalizability of the method. Practical implications This study provides a valuable supplement to current methods for early identification of scientific breakthroughs, effectively supporting technological intelligence decision-making and services. Originality/Value The study introduces a novel approach to early identification of scientific breakthroughs by leveraging outlier analysis of research entities, offering a more sensitive, precise, and fine-grained alternative method compared to traditional citation-based evaluations, which enhances the ability to identify nascent breakthrough innovations.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"49 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Community detection on elite mathematicians’ collaboration network","authors":"Yurui Huang, Zimo Wang, Chaolin Tian, Yifang Ma","doi":"10.2478/jdis-2024-0026","DOIUrl":"https://doi.org/10.2478/jdis-2024-0026","url":null,"abstract":"Purpose This study focuses on understanding the collaboration relationships among mathematicians, particularly those esteemed as elites, to reveal the structures of their communities and evaluate their impact on the field of mathematics. Design/methodology/approach Two community detection algorithms, namely Greedy Modularity Maximization and Infomap, are utilized to examine collaboration patterns among mathematicians. We conduct a comparative analysis of mathematicians’ centrality, emphasizing the influence of award-winning individuals in connecting network roles such as Betweenness, Closeness, and Harmonic centrality. Additionally, we investigate the distribution of elite mathematicians across communities and their relationships within different mathematical sub-fields. Findings The study identifies the substantial influence exerted by award-winning mathematicians in connecting network roles. The elite distribution across the network is uneven, with a concentration within specific communities rather than being evenly dispersed. Secondly, the research identifies a positive correlation between distinct mathematical sub-fields and the communities, indicating collaborative tendencies among scientists engaged in related domains. Lastly, the study suggests that reduced research diversity within a community might lead to a higher concentration of elite scientists within that specific community. Research limitations The study’s limitations include its narrow focus on mathematicians, which may limit the applicability of the findings to broader scientific fields. Issues with manually collected data affect the reliability of conclusions about collaborative networks. Practical implications This study offers valuable insights into how elite mathematicians collaborate and how knowledge is disseminated within mathematical circles. Understanding these collaborative behaviors could aid in fostering better collaboration strategies among mathematicians and institutions, potentially enhancing scientific progress in mathematics. Originality/value The study adds value to understanding collaborative dynamics within the realm of mathematics, offering a unique angle for further exploration and research.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"4 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142203534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data-enhanced revealing of trends in Geoscience","authors":"Yu Zhao, Meng Wang, Jiaxin Ding, Jiexing Qi, Lyuwen Wu, Sibo Zhang, Luoyi Fu, Xinbing Wang, Li Cheng","doi":"10.2478/jdis-2024-0023","DOIUrl":"https://doi.org/10.2478/jdis-2024-0023","url":null,"abstract":"Purpose This article presents an in-depth analysis of global research trends in Geosciences from 2014 to 2023. By integrating bibliometric analysis with expert insights from the Deeptime Digital Earth (DDE) initiative, this article identifies key emerging themes shaping the landscape of Earth Sciences<jats:sup>①</jats:sup>. Design/methodology/approach The identification process involved a meticulous analysis of over 400,000 papers from 466 Geosciences journals and approximately 5,800 papers from 93 interdisciplinary journals sourced from the Web of Science and Dimensions database. To map relationships between articles, citation networks were constructed, and spectral clustering algorithms were then employed to identify groups of related research, resulting in 407 clusters. Relevant research terms were extracted using the Log-Likelihood Ratio (LLR) algorithm, followed by statistical analyses on the volume of papers, average publication year, and average citation count within each cluster. Additionally, expert knowledge from DDE Scientific Committee was utilized to select top 30 trends based on their representation, relevance, and impact within Geosciences, and finalize naming of these top trends with consideration of the content and implications of the associated research. This comprehensive approach in systematically delineating and characterizing the trends in a way which is understandable to geoscientists. Findings Thirty significant trends were identified in the field of Geosciences, spanning five domains: deep space, deep time, deep Earth, habitable Earth, and big data. These topics reflect the latest trends and advancements in Geosciences and have the potential to address real-world problems that are closely related to society, science, and technology. Research limitations The analyzed data of this study only contain those were included in the Web of Science. Practical implications This study will strongly support the organizations and individual scientists to understand the modern frontier of earth science, especially on solid earth. The organizations such as the surveys or natural science fund could map out areas for future exploration and analyze the hot topics reference to this study. Originality/value This paper integrates bibliometric analysis with expert insights to highlight the most significant trends on earth science and reach the individual scientist and public by global voting.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"80 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141882818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Navigating interdisciplinary research: Historical progression and contemporary challenges","authors":"Xiaoqiang Li, Fen Cai, Jintao Bao, Yuqing Jian, Zehui Sun, Xin Xie","doi":"10.2478/jdis-2024-0025","DOIUrl":"https://doi.org/10.2478/jdis-2024-0025","url":null,"abstract":"Interdisciplinary research plays a crucial role in addressing complex problems by integrating knowledge from multiple disciplines. This integration fosters innovative solutions and enhances understanding across various fields. This study explores the historical and sociological development of interdisciplinary research and maps its evolution through three distinct phases: pre-disciplinary, disciplinary, and post-disciplinary. It identifies key internal dynamics, such as disciplinary diversification, reorganization, and innovation, as primary drivers of this evolution. Additionally, this study highlights how external factors, particularly the urgency of World War II and the subsequent political and economic changes, have accelerated its advancement. The rise of interdisciplinary research has significantly reshaped traditional educational paradigms, promoting its integration across different educational levels. However, the inherent contradictions within interdisciplinary research present cognitive, emotional, and institutional challenges for researchers. Meanwhile, finding a balance between the breadth and depth of knowledge remains a critical challenge in interdisciplinary education.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"187 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141882813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ziyan Xu, Hongqi Han, Linna Li, Junsheng Zhang, Zexu Zhou
{"title":"Identifying multidisciplinary problems from scientific publications based on a text generation method","authors":"Ziyan Xu, Hongqi Han, Linna Li, Junsheng Zhang, Zexu Zhou","doi":"10.2478/jdis-2024-0021","DOIUrl":"https://doi.org/10.2478/jdis-2024-0021","url":null,"abstract":"Purpose A text generation based multidisciplinary problem identification method is proposed, which does not rely on a large amount of data annotation. Design/methodology/approach The proposed method first identifies the research objective types and disciplinary labels of papers using a text classification technique; second, it generates abstractive titles for each paper based on abstract and research objective types using a generative pre-trained language model; third, it extracts problem phrases from generated titles according to regular expression rules; fourth, it creates problem relation networks and identifies the same problems by exploiting a weighted community detection algorithm; finally, it identifies multidisciplinary problems based on the disciplinary labels of papers. Findings Experiments in the “Carbon Peaking and Carbon Neutrality” field show that the proposed method can effectively identify multidisciplinary research problems. The disciplinary distribution of the identified problems is consistent with our understanding of multidisciplinary collaboration in the field. Research limitations It is necessary to use the proposed method in other multidisciplinary fields to validate its effectiveness. Practical implications Multidisciplinary problem identification helps to gather multidisciplinary forces to solve complex real-world problems for the governments, fund valuable multidisciplinary problems for research management authorities, and borrow ideas from other disciplines for researchers. Originality/value This approach proposes a novel multidisciplinary problem identification method based on text generation, which identifies multidisciplinary problems based on generative abstractive titles of papers without data annotation required by standard sequence labeling techniques.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"39 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141777492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Publication behaviour and (dis)qualification of chief editors in Turkish national Social Sciences journals","authors":"Lokman Tutuncu","doi":"10.2478/jdis-2024-0022","DOIUrl":"https://doi.org/10.2478/jdis-2024-0022","url":null,"abstract":"Purpose This study investigated the publication behaviour of 573 chief editors managing 432 Social Sciences journals in Turkey. Direct inquiries into editorial qualifications are rare, and this research aims to shed light on editors’ scientific leadership capabilities. Design/methodology/approach This study contrasts insider publication behaviour in national journals with international articles in journals indexed by the Web of Science (WOS) and Scopus. It argues that editors demonstrating a consistent ability to publish in competitive WOS and Scopus indexed journals signal high qualifications, while editors with persistent insider behaviour and strong local orientation signal low qualification. Scientific leadership capability is measured by first-authored publications. Correlation and various regression tests are conducted to identify significant determinants of publication behaviour. Findings International publications are rare and concentrated on a few individuals, while insider publications are endemic and constitute nearly 40% of all national articles. Editors publish 3.2 insider papers and 8.1 national papers for every SSCI article. 62% (58%) of the editors have no SSCI (Scopus) article, 53% (63%) do not have a single lead-authored WOS (Scopus) article, and 89% publish at least one insider paper. Only a minority consistently publish in international journals; a fifth of the editors have three or more SSCI publications, and a quarter have three or more Scopus articles. Editors with foreign Ph.D. degrees are the most qualified and internationally oriented, whereas non-mobile editors are the most underqualified and underperform other editors by every measure. Illustrating the overall lack of qualification, nearly half of the professor editors and the majority of the WOS and Scopus indexed journal editors have no record of SSCI or Scopus publications. Research limitations This research relies on local settings that encourage national publications at the expense of international journals. Findings should be evaluated in light of this setting and bearing in mind that narrow localities are more prone to peer favouritism. Practical implications Incompetent and nepotistic editors pose an imminent threat to Turkish national literature. A lasting solution would likely include the dismissal and replacement of unqualified editors, as well as delisting and closure of dozens of journals that operate in questionable ways and serve little scientific purpose. Originality/value To my knowledge, this is the first study to document the publication behaviour of national journal chief editors.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"45 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141777528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xintong Zhao, Kyle Langlois, Jacob Furst, Yuan An, Xiaohua Hu, Diego Gomez Gualdron, Fernando Uribe-Romo, Jane Greenberg
{"title":"Research evolution of metal organic frameworks: A scientometric approach with human-in-the-loop","authors":"Xintong Zhao, Kyle Langlois, Jacob Furst, Yuan An, Xiaohua Hu, Diego Gomez Gualdron, Fernando Uribe-Romo, Jane Greenberg","doi":"10.2478/jdis-2024-0019","DOIUrl":"https://doi.org/10.2478/jdis-2024-0019","url":null,"abstract":"Purpose This paper reports on a scientometric analysis bolstered by human-in-the-loop, domain experts, to examine the field of metal-organic frameworks (MOFs) research. Scientometric analyses reveal the intellectual landscape of a field. The study engaged MOF scientists in the design and review of our research workflow. MOF materials are an essential component in next-generation renewable energy storage and biomedical technologies. The research approach demonstrates how engaging experts, via human-in-the-loop processes, can help develop a comprehensive view of a field’s research trends, influential works, and specialized topics. Design/methodology/approach A scientometric analysis was conducted, integrating natural language processing (NLP), topic modeling, and network analysis methods. The analytical approach was enhanced through a human-in-the-loop iterative process involving MOF research scientists at selected intervals. MOF researcher feedback was incorporated into our method. The data sample included 65,209 MOF research articles. Python3 and software tool <jats:italic>VOSviewer</jats:italic> were used to perform the analysis. Findings The findings demonstrate the value of including domain experts in research workflows, refinement, and interpretation of results. At each stage of the analysis, the MOF researchers contributed to interpreting the results and method refinements targeting our focus on MOF research. This study identified influential works and their themes. Our findings also underscore four main MOF research directions and applications. Research limitations This study is limited by the sample (articles identified and referenced by the Cambridge Structural Database) that informed our analysis. Practical implications Our findings contribute to addressing the current gap in fully mapping out the comprehensive landscape of MOF research. Additionally, the results will help domain scientists target future research directions. Originality/value To the best of our knowledge, the number of publications collected for analysis exceeds those of previous studies. This enabled us to explore a more extensive body of MOF research compared to previous studies. Another contribution of our work is the iterative engagement of domain scientists, who brought in-depth, expert interpretation to the data analysis, helping hone the study.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"61 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Tracking direct and indirect impact on technology and policy of transformative research via ego citation network","authors":"Xian Li, Xiaojun Hu","doi":"10.2478/jdis-2024-0018","DOIUrl":"https://doi.org/10.2478/jdis-2024-0018","url":null,"abstract":"Purpose The disseminating of academic knowledge to nonacademic audiences partly relies on the transition of subsequent citing papers. This study aims to investigate direct and indirect impact on technology and policy originating from transformative research based on ego citation network. Design/methodology/approach Key Nobel Prize-winning publications (NPs) in fields of gene engineering and astrophysics are regarded as a proxy for transformative research. In this contribution, we introduce a network-structural indicator of citing patents to measure technological impact of a target article and use policy citations as a preliminary tool for policy impact. Findings The results show that the impact on technology and policy of NPs are higher than that of their subsequent citation generations in gene engineering but not in astrophysics. Research limitations The selection of Nobel Prizes is not balanced and the database used in this study, <jats:italic>Dimensions</jats:italic>, suffers from incompleteness and inaccuracy of citation links. Practical implications Our findings provide useful clues to better understand the characteristics of transformative research in technological and policy impact. Originality/value This study proposes a new framework to explore the direct and indirect impact on technology and policy originating from transformative research.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"138 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141745706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Detecting LLM-assisted writing in scientific communication: Are we there yet?","authors":"Teddy Lazebnik, Ariel Rosenfeld","doi":"10.2478/jdis-2024-0020","DOIUrl":"https://doi.org/10.2478/jdis-2024-0020","url":null,"abstract":"Large Language Models (LLMs), exemplified by ChatGPT, have significantly reshaped text generation, particularly in the realm of writing assistance. While ethical considerations underscore the importance of transparently acknowledging LLM use, especially in scientific communication, genuine acknowledgment remains infrequent. A potential avenue to encourage accurate acknowledging of LLM-assisted writing involves employing automated detectors. Our evaluation of four cutting-edge LLM-generated text detectors reveals their suboptimal performance compared to a simple ad-hoc detector designed to identify abrupt writing style changes around the time of LLM proliferation. We contend that the development of specialized detectors exclusively dedicated to LLM-assisted writing detection is necessary. Such detectors could play a crucial role in fostering more authentic recognition of LLM involvement in scientific communication, addressing the current challenges in acknowledgment practices.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"26 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141577787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Beyond authorship: Analyzing contributions in PLOS ONE and the challenges of appropriate attribution","authors":"Abdelghani Maddi, Jaime A. Teixeira da Silva","doi":"10.2478/jdis-2024-0015","DOIUrl":"https://doi.org/10.2478/jdis-2024-0015","url":null,"abstract":"Purpose This study aims to evaluate the accuracy of authorship attributions in scientific publications, focusing on the fairness and precision of individual contributions within academic works. Design/methodology/approach The study analyzes 81,823 publications from the journal <jats:italic>PLOS ONE</jats:italic>, covering the period from January 2018 to June 2023. It examines the authorship attributions within these publications to try and determine the prevalence of inappropriate authorship. It also investigates the demographic and professional profiles of affected authors, exploring trends and potential factors contributing to inaccuracies in authorship. Findings Surprisingly, 9.14% of articles feature at least one author with inappropriate authorship, affecting over 14,000 individuals (2.56% of the sample). Inappropriate authorship is more concentrated in Asia, Africa, and specific European countries like Italy. Established researchers with significant publication records and those affiliated with companies or nonprofits show higher instances of potential monetary authorship. Research limitations Our findings are based on contributions as declared by the authors, which implies a degree of trust in their transparency. However, this reliance on self-reporting may introduce biases or inaccuracies into the dataset. Further research could employ additional verification methods to enhance the reliability of the findings. Practical implications These findings have significant implications for journal publishers, highlighting the necessity for robust control mechanisms to ensure the integrity of authorship attributions. Moreover, researchers must exercise discernment in determining when to acknowledge a contributor and when to include them in the author list. Addressing these issues is crucial for maintaining the credibility and fairness of academic publications. Originality/value This study contributes to an understanding of critical issues within academic authorship, shedding light on the prevalence and impact of inappropriate authorship attributions. By calling for a nuanced approach to ensure accurate credit is given where it is due, the study underscores the importance of upholding ethical standards in scholarly publishing.","PeriodicalId":44622,"journal":{"name":"Journal of Data and Information Science","volume":"75 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141187893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}