The published role of artificial intelligence in drug discovery and development: a bibliometric and social network analysis from 1990 to 2023

IF 7.1 2区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Murat Koçak, Zafer Akçalı
{"title":"The published role of artificial intelligence in drug discovery and development: a bibliometric and social network analysis from 1990 to 2023","authors":"Murat Koçak,&nbsp;Zafer Akçalı","doi":"10.1186/s13321-025-00988-4","DOIUrl":null,"url":null,"abstract":"<div><p>Today, drug discovery and development is one of the fields where Artificial Intelligence (AI) is used extensively. Therefore, this study aims to systematically analyze the scientific literature on the application of AI in drug discovery and development to understand the evolution, trends, and key contributors within this rapidly growing field. By leveraging various bibliometric indicators and visualization techniques, we seek to explore the growth patterns, influential authors and institutions, collaboration networks, and emerging research trends within this domain. Bibliometric and network analysis methods (co-occurrence, co-authorship, and collaboration, etc.) were used to achieve this goal. Bibliometric visualization tools such as Bibliometrix R package software, VOSviewer, and Litmaps were used for comprehensive data analysis. Scientific publications on AI in drug discovery and development were retrieved from the Web of Science Core Collection (WoS CC) database covering 1990–2023. In addition to visualization programs, the InCites database was also used for analysis and visualization. A total of 4059 scientific publications written by 13,932 authors and published in 1071 journals were included in the analysis. The results reveal that the most prolific authors are Ekins (n = 67), Schneider (n = 52), Hou Tj (n = 43), and Cao Ds (n = 34), while the most active institutions are the “Chinese Academy of Science” and “University of California.” The leading scientific journals are “Journal of Chemical Information and Modelling,” “Briefings in Bioinformatics,” and “Journal of Cheminformatics.” The most frequently used author keywords include “protein folding,” “QSAR,” “gene expression data,” “coronavirus,” and “genome rearrangement.” The average number of citations per scientific publication is 28.62, indicating a high impact of research in this field. A significant increase in publications was observed after 2014, with a peak in 2022, followed by a slight decline. International collaboration accounts for 28.06% of the publications, with the USA and China leading in both productivity and influence. The study also identifies key funding organizations, such as the National Natural Science Foundation of China (NSFC) and the United States Department of Health &amp; Human Services, which have significantly supported advancements in this field. In conclusion, this study highlights the transformative role of AI in drug discovery and development, showcasing its potential to accelerate innovation and improve efficiency. The findings provide valuable insights into the current state of research, emerging trends, and future directions, offering a roadmap for researchers, industry professionals, and policymakers to further explore and leverage AI technologies in this domain.</p><p><b>Scientific contribution</b>This study provides a comprehensive bibliometric analysis of 4,059 scientific publications (1990–2023) to map the evolution, trends, and key contributors in AI-driven drug discovery, identifying prolific authors (e.g., Ekins, Schneider), leading institutions (e.g., Chinese Academy of Sciences, University of California), and high-impact journals (<i>Journal of Chemical Information and Modelling</i>). It reveals critical collaboration patterns (28.06% international co-authorships), dominant funding sources (e.g., NSFC, NIH), and emerging research hotspots (e.g.,<i> protein folding, QSAR, coronavirus</i>), while highlighting the transformative role of deep learning post-2014. By synthesizing these insights, the study offers a strategic roadmap for researchers and policymakers to optimize AI applications in drug development, addressing both current challenges and future opportunities in the field.</p></div>","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"17 1","pages":""},"PeriodicalIF":7.1000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://jcheminf.biomedcentral.com/counter/pdf/10.1186/s13321-025-00988-4","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1186/s13321-025-00988-4","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Today, drug discovery and development is one of the fields where Artificial Intelligence (AI) is used extensively. Therefore, this study aims to systematically analyze the scientific literature on the application of AI in drug discovery and development to understand the evolution, trends, and key contributors within this rapidly growing field. By leveraging various bibliometric indicators and visualization techniques, we seek to explore the growth patterns, influential authors and institutions, collaboration networks, and emerging research trends within this domain. Bibliometric and network analysis methods (co-occurrence, co-authorship, and collaboration, etc.) were used to achieve this goal. Bibliometric visualization tools such as Bibliometrix R package software, VOSviewer, and Litmaps were used for comprehensive data analysis. Scientific publications on AI in drug discovery and development were retrieved from the Web of Science Core Collection (WoS CC) database covering 1990–2023. In addition to visualization programs, the InCites database was also used for analysis and visualization. A total of 4059 scientific publications written by 13,932 authors and published in 1071 journals were included in the analysis. The results reveal that the most prolific authors are Ekins (n = 67), Schneider (n = 52), Hou Tj (n = 43), and Cao Ds (n = 34), while the most active institutions are the “Chinese Academy of Science” and “University of California.” The leading scientific journals are “Journal of Chemical Information and Modelling,” “Briefings in Bioinformatics,” and “Journal of Cheminformatics.” The most frequently used author keywords include “protein folding,” “QSAR,” “gene expression data,” “coronavirus,” and “genome rearrangement.” The average number of citations per scientific publication is 28.62, indicating a high impact of research in this field. A significant increase in publications was observed after 2014, with a peak in 2022, followed by a slight decline. International collaboration accounts for 28.06% of the publications, with the USA and China leading in both productivity and influence. The study also identifies key funding organizations, such as the National Natural Science Foundation of China (NSFC) and the United States Department of Health & Human Services, which have significantly supported advancements in this field. In conclusion, this study highlights the transformative role of AI in drug discovery and development, showcasing its potential to accelerate innovation and improve efficiency. The findings provide valuable insights into the current state of research, emerging trends, and future directions, offering a roadmap for researchers, industry professionals, and policymakers to further explore and leverage AI technologies in this domain.

Scientific contributionThis study provides a comprehensive bibliometric analysis of 4,059 scientific publications (1990–2023) to map the evolution, trends, and key contributors in AI-driven drug discovery, identifying prolific authors (e.g., Ekins, Schneider), leading institutions (e.g., Chinese Academy of Sciences, University of California), and high-impact journals (Journal of Chemical Information and Modelling). It reveals critical collaboration patterns (28.06% international co-authorships), dominant funding sources (e.g., NSFC, NIH), and emerging research hotspots (e.g., protein folding, QSAR, coronavirus), while highlighting the transformative role of deep learning post-2014. By synthesizing these insights, the study offers a strategic roadmap for researchers and policymakers to optimize AI applications in drug development, addressing both current challenges and future opportunities in the field.

人工智能在药物发现和开发中的作用:1990年至2023年的文献计量学和社会网络分析
今天,药物发现和开发是人工智能(AI)广泛应用的领域之一。因此,本研究旨在系统分析有关人工智能在药物发现和开发中的应用的科学文献,以了解这一快速发展领域的演变、趋势和关键贡献者。通过利用各种文献计量指标和可视化技术,我们试图探索该领域的增长模式、有影响力的作者和机构、合作网络和新兴研究趋势。文献计量学和网络分析方法(共现、合著、协作等)被用来实现这一目标。文献计量可视化工具(如Bibliometrix R软件包软件、VOSviewer和Litmaps)用于综合数据分析。从Web of Science Core Collection (WoS CC)数据库中检索1990-2023年间有关人工智能在药物发现和开发中的科学出版物。除了可视化程序,InCites数据库也用于分析和可视化。共有13932位作者在1071种期刊上发表的4059篇科学论文被纳入分析。结果显示,最高产的作者是Ekins (n = 67)、Schneider (n = 52)、Hou Tj (n = 43)和Cao Ds (n = 34),而最活跃的机构是“中国科学院”和“加州大学”。主要的科学期刊有《化学信息与建模》、《生物信息学简报》和《化学信息学杂志》。最常用的作者关键词包括“蛋白质折叠”、“QSAR”、“基因表达数据”、“冠状病毒”和“基因组重排”。每份科学出版物的平均引用次数为28.62次,表明该领域的研究具有很高的影响力。2014年之后,论文发表量显著增加,2022年达到峰值,随后略有下降。国际合作论文占28.06%,美国和中国在生产率和影响力方面都处于领先地位。该研究还确定了主要资助组织,如中国国家自然科学基金委员会(NSFC)和美国卫生与公众服务部,它们大力支持了这一领域的进展。总之,本研究强调了人工智能在药物发现和开发中的变革性作用,展示了其加速创新和提高效率的潜力。这些发现为研究现状、新兴趋势和未来方向提供了有价值的见解,为研究人员、行业专业人士和政策制定者进一步探索和利用该领域的人工智能技术提供了路线图。科学贡献本研究对4059份科学出版物(1990-2023年)进行了全面的文献计量分析,以绘制人工智能驱动药物发现的演变、趋势和主要贡献者,确定了高产作者(如Ekins, Schneider)、领先机构(如中国科学院、加州大学)和高影响力期刊(《化学信息与建模杂志》)。它揭示了关键的合作模式(28.06%的国际合作作者)、主要的资金来源(如美国国家自然科学基金委员会、美国国立卫生研究院)和新兴的研究热点(如蛋白质折叠、QSAR、冠状病毒),同时突出了2014年后深度学习的变革作用。通过综合这些见解,该研究为研究人员和政策制定者提供了一个战略路线图,以优化人工智能在药物开发中的应用,应对该领域当前的挑战和未来的机遇。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Cheminformatics
Journal of Cheminformatics CHEMISTRY, MULTIDISCIPLINARY-COMPUTER SCIENCE, INFORMATION SYSTEMS
CiteScore
14.10
自引率
7.00%
发文量
82
审稿时长
3 months
期刊介绍: Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling. Coverage includes, but is not limited to: chemical information systems, software and databases, and molecular modelling, chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases, computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信