Christine Ann Withers, Amina Mardiyyah Rufai, Aravind Venkatesan, Santosh Tirunagari, Sebastian Lobentanzer, Melissa Harrison, Barbara Zdrazil
{"title":"Natural language processing in drug discovery: bridging the gap between text and therapeutics with artificial intelligence.","authors":"Christine Ann Withers, Amina Mardiyyah Rufai, Aravind Venkatesan, Santosh Tirunagari, Sebastian Lobentanzer, Melissa Harrison, Barbara Zdrazil","doi":"10.1080/17460441.2025.2490835","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>The field of Natural Language Processing (NLP) within the life sciences has exploded in its capacity to aid the extraction and analysis of data from scientific texts in recent years through the advancement of Artificial Intelligence (AI). Drug discovery pipelines have been innovated and accelerated by the uptake of AI/Machine Learning (ML) techniques.</p><p><strong>Areas covered: </strong>The authors provide background on Named Entity Recognition (NER) in text - from tagging terms in text using ontologies to entity identification via ML models. They also explore the use of Knowledge Graphs (KGs) in biological data ingestion, manipulation, and extraction, leading into the modern age of Large Language Models (LLMs) and their ability to maneuver complex and abundant data. The authors also cover the main strengths and weaknesses of the many methods available when undertaking NLP tasks in drug discovery. Literature was derived from searches utilizing Europe PMC, ResearchRabbit and SciSpace.</p><p><strong>Expert opinion: </strong>The mass of scientific data that is now produced each year is both a huge positive for potential innovation in drug discovery and a new hurdle for researchers to overcome. Notably, methods should be selected to fit a use case and the data available, as each method performs optimally under different conditions.</p>","PeriodicalId":12267,"journal":{"name":"Expert Opinion on Drug Discovery","volume":" ","pages":"765-783"},"PeriodicalIF":6.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Opinion on Drug Discovery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/17460441.2025.2490835","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/30 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: The field of Natural Language Processing (NLP) within the life sciences has exploded in its capacity to aid the extraction and analysis of data from scientific texts in recent years through the advancement of Artificial Intelligence (AI). Drug discovery pipelines have been innovated and accelerated by the uptake of AI/Machine Learning (ML) techniques.
Areas covered: The authors provide background on Named Entity Recognition (NER) in text - from tagging terms in text using ontologies to entity identification via ML models. They also explore the use of Knowledge Graphs (KGs) in biological data ingestion, manipulation, and extraction, leading into the modern age of Large Language Models (LLMs) and their ability to maneuver complex and abundant data. The authors also cover the main strengths and weaknesses of the many methods available when undertaking NLP tasks in drug discovery. Literature was derived from searches utilizing Europe PMC, ResearchRabbit and SciSpace.
Expert opinion: The mass of scientific data that is now produced each year is both a huge positive for potential innovation in drug discovery and a new hurdle for researchers to overcome. Notably, methods should be selected to fit a use case and the data available, as each method performs optimally under different conditions.
期刊介绍:
Expert Opinion on Drug Discovery (ISSN 1746-0441 [print], 1746-045X [electronic]) is a MEDLINE-indexed, peer-reviewed, international journal publishing review articles on novel technologies involved in the drug discovery process, leading to new leads and reduced attrition rates. Each article is structured to incorporate the author’s own expert opinion on the scope for future development.
The Editors welcome:
Reviews covering chemoinformatics; bioinformatics; assay development; novel screening technologies; in vitro/in vivo models; structure-based drug design; systems biology
Drug Case Histories examining the steps involved in the preclinical and clinical development of a particular drug
The audience consists of scientists and managers in the healthcare and pharmaceutical industry, academic pharmaceutical scientists and other closely related professionals looking to enhance the success of their drug candidates through optimisation at the preclinical level.