David Meijer, Mehdi A Beniddir, Connor W Coley, Yassine M Mejri, Meltem Öztürk, Justin J J van der Hooft, Marnix H Medema, Adam Skiredj
{"title":"Empowering natural product science with AI: leveraging multimodal data and knowledge graphs.","authors":"David Meijer, Mehdi A Beniddir, Connor W Coley, Yassine M Mejri, Meltem Öztürk, Justin J J van der Hooft, Marnix H Medema, Adam Skiredj","doi":"10.1039/d4np00008k","DOIUrl":null,"url":null,"abstract":"<p><p>Artificial intelligence (AI) is accelerating how we conduct science, from folding proteins with AlphaFold and summarizing literature findings with large language models, to annotating genomes and prioritizing newly generated molecules for screening using specialized software. However, the application of AI to emulate human cognition in natural product research and its subsequent impact has so far been limited. One reason for this limited impact is that available natural product data is multimodal, unbalanced, unstandardized, and scattered across many data repositories. This makes natural product data challenging to use with existing deep learning architectures that consume fairly standardized, often non-relational, data. It also prevents models from learning overarching patterns in natural product science. In this Viewpoint, we address this challenge and support ongoing initiatives aimed at democratizing natural product data by collating our collective knowledge into a knowledge graph. By doing so, we believe there will be an opportunity to use such a knowledge graph to develop AI models that can truly mimic natural product scientists' decision-making.</p>","PeriodicalId":94,"journal":{"name":"Natural Product Reports","volume":" ","pages":""},"PeriodicalIF":10.2000,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11327853/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Product Reports","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1039/d4np00008k","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Artificial intelligence (AI) is accelerating how we conduct science, from folding proteins with AlphaFold and summarizing literature findings with large language models, to annotating genomes and prioritizing newly generated molecules for screening using specialized software. However, the application of AI to emulate human cognition in natural product research and its subsequent impact has so far been limited. One reason for this limited impact is that available natural product data is multimodal, unbalanced, unstandardized, and scattered across many data repositories. This makes natural product data challenging to use with existing deep learning architectures that consume fairly standardized, often non-relational, data. It also prevents models from learning overarching patterns in natural product science. In this Viewpoint, we address this challenge and support ongoing initiatives aimed at democratizing natural product data by collating our collective knowledge into a knowledge graph. By doing so, we believe there will be an opportunity to use such a knowledge graph to develop AI models that can truly mimic natural product scientists' decision-making.
期刊介绍:
Natural Product Reports (NPR) serves as a pivotal critical review journal propelling advancements in all facets of natural products research, encompassing isolation, structural and stereochemical determination, biosynthesis, biological activity, and synthesis.
With a broad scope, NPR extends its influence into the wider bioinorganic, bioorganic, and chemical biology communities. Covering areas such as enzymology, nucleic acids, genetics, chemical ecology, carbohydrates, primary and secondary metabolism, and analytical techniques, the journal provides insightful articles focusing on key developments shaping the field, rather than offering exhaustive overviews of all results.
NPR encourages authors to infuse their perspectives on developments, trends, and future directions, fostering a dynamic exchange of ideas within the natural products research community.