机器学习驱动的蛋白质工程:计算药物发现的案例研究

Engineering biology Pub Date : 2020-03-16 DOI:10.1049/enb.2019.0019

Harry F. Rickerby, Katya Putintseva, Christopher Cozens

{"title":"机器学习驱动的蛋白质工程:计算药物发现的案例研究","authors":"Harry F. Rickerby, Katya Putintseva, Christopher Cozens","doi":"10.1049/enb.2019.0019","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Research and development in drug discovery will need to find significant efficiency gains if the industry is to continue generating novel drugs. There is great expectation for machine learning (ML) to provide this boost in R&D productivity, but to harness the full potential of ML, the generation of new, high-quality datasets will be necessary. Here, the authors present a platform that combines high-throughput display and selection data generation with ML. More specifically, deep learning is used to inform the directed evolution of novel biotherapeutics using DNA library synthesis, ultra-high throughput selections, and next generation sequencing. By combining the learnings of multiple <i>in silico</i> models, their platform enables multi-parameter optimisation across multiple important protein characteristics. They also present a model for benchmarking these ML-driven drug discovery platforms according to the accuracy of their underlying <i>in silico</i> models, in conjunction with the throughput of their empirical experimentation.</p>\n </div>","PeriodicalId":72921,"journal":{"name":"Engineering biology","volume":"4 1","pages":"7-9"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/enb.2019.0019","citationCount":"3","resultStr":"{\"title\":\"Machine learning-driven protein engineering: a case study in computational drug discovery\",\"authors\":\"Harry F. Rickerby, Katya Putintseva, Christopher Cozens\",\"doi\":\"10.1049/enb.2019.0019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n <p>Research and development in drug discovery will need to find significant efficiency gains if the industry is to continue generating novel drugs. There is great expectation for machine learning (ML) to provide this boost in R&D productivity, but to harness the full potential of ML, the generation of new, high-quality datasets will be necessary. Here, the authors present a platform that combines high-throughput display and selection data generation with ML. More specifically, deep learning is used to inform the directed evolution of novel biotherapeutics using DNA library synthesis, ultra-high throughput selections, and next generation sequencing. By combining the learnings of multiple <i>in silico</i> models, their platform enables multi-parameter optimisation across multiple important protein characteristics. They also present a model for benchmarking these ML-driven drug discovery platforms according to the accuracy of their underlying <i>in silico</i> models, in conjunction with the throughput of their empirical experimentation.</p>\\n </div>\",\"PeriodicalId\":72921,\"journal\":{\"name\":\"Engineering biology\",\"volume\":\"4 1\",\"pages\":\"7-9\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1049/enb.2019.0019\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/enb.2019.0019\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering biology","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/enb.2019.0019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

如果制药行业要继续生产新药，那么药物研发将需要找到显著的效率提升。人们对机器学习(ML)提供研发生产力的极大期望，但要充分利用ML的潜力，生成新的高质量数据集将是必要的。在这里，作者提出了一个将高通量显示和选择数据生成与ML相结合的平台。更具体地说，深度学习用于通过DNA文库合成、超高通量选择和下一代测序来指导新型生物治疗药物的定向进化。通过结合多个计算机模型的学习，他们的平台可以跨多个重要蛋白质特征进行多参数优化。他们还提出了一个模型，根据其基础计算机模型的准确性，结合其经验实验的吞吐量，对这些机器学习驱动的药物发现平台进行基准测试。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Machine learning-driven protein engineering: a case study in computational drug discovery

查看原文本刊更多论文

Machine learning-driven protein engineering: a case study in computational drug discovery

Research and development in drug discovery will need to find significant efficiency gains if the industry is to continue generating novel drugs. There is great expectation for machine learning (ML) to provide this boost in R&D productivity, but to harness the full potential of ML, the generation of new, high-quality datasets will be necessary. Here, the authors present a platform that combines high-throughput display and selection data generation with ML. More specifically, deep learning is used to inform the directed evolution of novel biotherapeutics using DNA library synthesis, ultra-high throughput selections, and next generation sequencing. By combining the learnings of multiple in silico models, their platform enables multi-parameter optimisation across multiple important protein characteristics. They also present a model for benchmarking these ML-driven drug discovery platforms according to the accuracy of their underlying in silico models, in conjunction with the throughput of their empirical experimentation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering biology

自引率

0.00%

发文量

审稿时长

22 weeks