知识组织网络(NEKO):用于合成生物学研究的人工智能知识挖掘工作流程。

IF 6.8 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY
Zhengyang Xiao, Himadri B Pakrasi, Yixin Chen, Yinjie J Tang
{"title":"知识组织网络(NEKO):用于合成生物学研究的人工智能知识挖掘工作流程。","authors":"Zhengyang Xiao, Himadri B Pakrasi, Yixin Chen, Yinjie J Tang","doi":"10.1016/j.ymben.2024.11.006","DOIUrl":null,"url":null,"abstract":"<p><p>Large language models (LLMs) can complete general scientific question-and-answer, yet they are constrained by their pretraining cut-off dates and lack the ability to provide specific, cited scientific knowledge. Here, we introduce Network for Knowledge Organization (NEKO), a workflow that uses LLM Qwen to extract knowledge through scientific literature text mining. When user inputs a keyword of interest, NEKO can generate knowledge graphs to link bioinformation entities and perform comprehensive summaries from PubMed search. NEKO significantly enhance LLM ability and has immediate applications in daily academic tasks such as education of young scientists, literature review, paper writing, experiment planning/troubleshooting, and new ideas/hypothesis generation. We exemplified this workflow's applicability through several case studies on yeast fermentation and cyanobacterial biorefinery. NEKO's output is more informative, specific, and actionable than GPT-4's zero-shot Q&A. NEKO offers flexible, lightweight local deployment options. NEKO democratizes artificial intelligence (AI) tools, making scientific foundation model more accessible to researchers without excessive computational power.</p>","PeriodicalId":18483,"journal":{"name":"Metabolic engineering","volume":" ","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Network for Knowledge Organization (NEKO): an AI knowledge mining workflow for synthetic biology research.\",\"authors\":\"Zhengyang Xiao, Himadri B Pakrasi, Yixin Chen, Yinjie J Tang\",\"doi\":\"10.1016/j.ymben.2024.11.006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Large language models (LLMs) can complete general scientific question-and-answer, yet they are constrained by their pretraining cut-off dates and lack the ability to provide specific, cited scientific knowledge. Here, we introduce Network for Knowledge Organization (NEKO), a workflow that uses LLM Qwen to extract knowledge through scientific literature text mining. When user inputs a keyword of interest, NEKO can generate knowledge graphs to link bioinformation entities and perform comprehensive summaries from PubMed search. NEKO significantly enhance LLM ability and has immediate applications in daily academic tasks such as education of young scientists, literature review, paper writing, experiment planning/troubleshooting, and new ideas/hypothesis generation. We exemplified this workflow's applicability through several case studies on yeast fermentation and cyanobacterial biorefinery. NEKO's output is more informative, specific, and actionable than GPT-4's zero-shot Q&A. NEKO offers flexible, lightweight local deployment options. NEKO democratizes artificial intelligence (AI) tools, making scientific foundation model more accessible to researchers without excessive computational power.</p>\",\"PeriodicalId\":18483,\"journal\":{\"name\":\"Metabolic engineering\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":6.8000,\"publicationDate\":\"2024-11-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Metabolic engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ymben.2024.11.006\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Metabolic engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.ymben.2024.11.006","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

大型语言模型(LLM)可以完成一般的科学问答,但它们受到预训练截止日期的限制,缺乏提供具体的引用科学知识的能力。在此,我们介绍了知识组织网络(NEKO),这是一种利用 LLM Qwen 通过科学文献文本挖掘来提取知识的工作流程。当用户输入感兴趣的关键词时,NEKO 可以生成知识图谱来链接生物信息实体,并对 PubMed 搜索结果进行全面总结。NEKO 大大提高了 LLM 的能力,并可立即应用于日常学术工作,如青年科学家教育、文献综述、论文写作、实验计划/故障排除以及新想法/新假设的产生。我们通过几个关于酵母发酵和蓝藻生物炼制的案例研究来说明这一工作流程的适用性。与 GPT-4 的零点问答相比,NEKO 的输出信息更丰富、更具体、更可操作。NEKO 提供灵活、轻量级的本地部署选项。NEKO 实现了人工智能(AI)工具的民主化,使没有过多计算能力的研究人员更容易获得科学基础模型。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Network for Knowledge Organization (NEKO): an AI knowledge mining workflow for synthetic biology research.

Large language models (LLMs) can complete general scientific question-and-answer, yet they are constrained by their pretraining cut-off dates and lack the ability to provide specific, cited scientific knowledge. Here, we introduce Network for Knowledge Organization (NEKO), a workflow that uses LLM Qwen to extract knowledge through scientific literature text mining. When user inputs a keyword of interest, NEKO can generate knowledge graphs to link bioinformation entities and perform comprehensive summaries from PubMed search. NEKO significantly enhance LLM ability and has immediate applications in daily academic tasks such as education of young scientists, literature review, paper writing, experiment planning/troubleshooting, and new ideas/hypothesis generation. We exemplified this workflow's applicability through several case studies on yeast fermentation and cyanobacterial biorefinery. NEKO's output is more informative, specific, and actionable than GPT-4's zero-shot Q&A. NEKO offers flexible, lightweight local deployment options. NEKO democratizes artificial intelligence (AI) tools, making scientific foundation model more accessible to researchers without excessive computational power.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Metabolic engineering
Metabolic engineering 工程技术-生物工程与应用微生物
CiteScore
15.60
自引率
6.00%
发文量
140
审稿时长
44 days
期刊介绍: Metabolic Engineering (MBE) is a journal that focuses on publishing original research papers on the directed modulation of metabolic pathways for metabolite overproduction or the enhancement of cellular properties. It welcomes papers that describe the engineering of native pathways and the synthesis of heterologous pathways to convert microorganisms into microbial cell factories. The journal covers experimental, computational, and modeling approaches for understanding metabolic pathways and manipulating them through genetic, media, or environmental means. Effective exploration of metabolic pathways necessitates the use of molecular biology and biochemistry methods, as well as engineering techniques for modeling and data analysis. MBE serves as a platform for interdisciplinary research in fields such as biochemistry, molecular biology, applied microbiology, cellular physiology, cellular nutrition in health and disease, and biochemical engineering. The journal publishes various types of papers, including original research papers and review papers. It is indexed and abstracted in databases such as Scopus, Embase, EMBiology, Current Contents - Life Sciences and Clinical Medicine, Science Citation Index, PubMed/Medline, CAS and Biotechnology Citation Index.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信