SeqImprove:机器学习辅助基因回路序列信息的整理

IF 3.9 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Jeanet Mante, Zach Sents, Duncan Britt, William Mo, Chunxiao Liao, Ryan Greer and Chris J. Myers*, 
{"title":"SeqImprove:机器学习辅助基因回路序列信息的整理","authors":"Jeanet Mante,&nbsp;Zach Sents,&nbsp;Duncan Britt,&nbsp;William Mo,&nbsp;Chunxiao Liao,&nbsp;Ryan Greer and Chris J. Myers*,&nbsp;","doi":"10.1021/acssynbio.4c0039210.1021/acssynbio.4c00392","DOIUrl":null,"url":null,"abstract":"<p >The progress and utility of synthetic biology is currently hindered by the lengthy process of studying literature and replicating poorly documented work. Reconstruction of crucial design information through post hoc curation is highly noisy and error-prone. To combat this, author participation during the curation process is crucial. To encourage author participation without overburdening them, an ML-assisted curation tool called SeqImprove has been developed. Using named entity recognition, called entity normalization, and sequence matching, SeqImprove creates machine-accessible sequence data and metadata annotations, which authors can then review and edit before submitting a final sequence file. SeqImprove makes it easier for authors to submit sequence data that is FAIR (findable, accessible, interoperable, and reusable).</p>","PeriodicalId":26,"journal":{"name":"ACS Synthetic Biology","volume":"13 9","pages":"3051–3055 3051–3055"},"PeriodicalIF":3.9000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SeqImprove: Machine-Learning-Assisted Curation of Genetic Circuit Sequence Information\",\"authors\":\"Jeanet Mante,&nbsp;Zach Sents,&nbsp;Duncan Britt,&nbsp;William Mo,&nbsp;Chunxiao Liao,&nbsp;Ryan Greer and Chris J. Myers*,&nbsp;\",\"doi\":\"10.1021/acssynbio.4c0039210.1021/acssynbio.4c00392\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >The progress and utility of synthetic biology is currently hindered by the lengthy process of studying literature and replicating poorly documented work. Reconstruction of crucial design information through post hoc curation is highly noisy and error-prone. To combat this, author participation during the curation process is crucial. To encourage author participation without overburdening them, an ML-assisted curation tool called SeqImprove has been developed. Using named entity recognition, called entity normalization, and sequence matching, SeqImprove creates machine-accessible sequence data and metadata annotations, which authors can then review and edit before submitting a final sequence file. SeqImprove makes it easier for authors to submit sequence data that is FAIR (findable, accessible, interoperable, and reusable).</p>\",\"PeriodicalId\":26,\"journal\":{\"name\":\"ACS Synthetic Biology\",\"volume\":\"13 9\",\"pages\":\"3051–3055 3051–3055\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Synthetic Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acssynbio.4c00392\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Synthetic Biology","FirstCategoryId":"99","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acssynbio.4c00392","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

目前,研究文献和复制记录不全的工作过程漫长,阻碍了合成生物学的发展和应用。通过事后整理重建关键的设计信息非常嘈杂,而且容易出错。要解决这个问题,作者在策划过程中的参与至关重要。为了在不增加作者负担的情况下鼓励他们参与,我们开发了一种名为 SeqImprove 的人工智能辅助整理工具。SeqImprove 利用命名实体识别、命名实体规范化和序列匹配,创建机器可访问的序列数据和元数据注释,作者可以在提交最终序列文件之前对其进行审查和编辑。SeqImprove 使作者更容易提交 FAIR(可查找、可访问、可互操作和可重用)序列数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

SeqImprove: Machine-Learning-Assisted Curation of Genetic Circuit Sequence Information

SeqImprove: Machine-Learning-Assisted Curation of Genetic Circuit Sequence Information

The progress and utility of synthetic biology is currently hindered by the lengthy process of studying literature and replicating poorly documented work. Reconstruction of crucial design information through post hoc curation is highly noisy and error-prone. To combat this, author participation during the curation process is crucial. To encourage author participation without overburdening them, an ML-assisted curation tool called SeqImprove has been developed. Using named entity recognition, called entity normalization, and sequence matching, SeqImprove creates machine-accessible sequence data and metadata annotations, which authors can then review and edit before submitting a final sequence file. SeqImprove makes it easier for authors to submit sequence data that is FAIR (findable, accessible, interoperable, and reusable).

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.00
自引率
10.60%
发文量
380
审稿时长
6-12 weeks
期刊介绍: The journal is particularly interested in studies on the design and synthesis of new genetic circuits and gene products; computational methods in the design of systems; and integrative applied approaches to understanding disease and metabolism. Topics may include, but are not limited to: Design and optimization of genetic systems Genetic circuit design and their principles for their organization into programs Computational methods to aid the design of genetic systems Experimental methods to quantify genetic parts, circuits, and metabolic fluxes Genetic parts libraries: their creation, analysis, and ontological representation Protein engineering including computational design Metabolic engineering and cellular manufacturing, including biomass conversion Natural product access, engineering, and production Creative and innovative applications of cellular programming Medical applications, tissue engineering, and the programming of therapeutic cells Minimal cell design and construction Genomics and genome replacement strategies Viral engineering Automated and robotic assembly platforms for synthetic biology DNA synthesis methodologies Metagenomics and synthetic metagenomic analysis Bioinformatics applied to gene discovery, chemoinformatics, and pathway construction Gene optimization Methods for genome-scale measurements of transcription and metabolomics Systems biology and methods to integrate multiple data sources in vitro and cell-free synthetic biology and molecular programming Nucleic acid engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信