Harry F. Rickerby, Katya Putintseva, Christopher Cozens
{"title":"机器学习驱动的蛋白质工程:计算药物发现的案例研究","authors":"Harry F. Rickerby, Katya Putintseva, Christopher Cozens","doi":"10.1049/enb.2019.0019","DOIUrl":null,"url":null,"abstract":"<div>\n <p>Research and development in drug discovery will need to find significant efficiency gains if the industry is to continue generating novel drugs. There is great expectation for machine learning (ML) to provide this boost in R&D productivity, but to harness the full potential of ML, the generation of new, high-quality datasets will be necessary. Here, the authors present a platform that combines high-throughput display and selection data generation with ML. More specifically, deep learning is used to inform the directed evolution of novel biotherapeutics using DNA library synthesis, ultra-high throughput selections, and next generation sequencing. By combining the learnings of multiple <i>in silico</i> models, their platform enables multi-parameter optimisation across multiple important protein characteristics. They also present a model for benchmarking these ML-driven drug discovery platforms according to the accuracy of their underlying <i>in silico</i> models, in conjunction with the throughput of their empirical experimentation.</p>\n </div>","PeriodicalId":72921,"journal":{"name":"Engineering biology","volume":"4 1","pages":"7-9"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1049/enb.2019.0019","citationCount":"3","resultStr":"{\"title\":\"Machine learning-driven protein engineering: a case study in computational drug discovery\",\"authors\":\"Harry F. Rickerby, Katya Putintseva, Christopher Cozens\",\"doi\":\"10.1049/enb.2019.0019\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n <p>Research and development in drug discovery will need to find significant efficiency gains if the industry is to continue generating novel drugs. There is great expectation for machine learning (ML) to provide this boost in R&D productivity, but to harness the full potential of ML, the generation of new, high-quality datasets will be necessary. Here, the authors present a platform that combines high-throughput display and selection data generation with ML. More specifically, deep learning is used to inform the directed evolution of novel biotherapeutics using DNA library synthesis, ultra-high throughput selections, and next generation sequencing. By combining the learnings of multiple <i>in silico</i> models, their platform enables multi-parameter optimisation across multiple important protein characteristics. They also present a model for benchmarking these ML-driven drug discovery platforms according to the accuracy of their underlying <i>in silico</i> models, in conjunction with the throughput of their empirical experimentation.</p>\\n </div>\",\"PeriodicalId\":72921,\"journal\":{\"name\":\"Engineering biology\",\"volume\":\"4 1\",\"pages\":\"7-9\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1049/enb.2019.0019\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/enb.2019.0019\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering biology","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/enb.2019.0019","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Machine learning-driven protein engineering: a case study in computational drug discovery
Research and development in drug discovery will need to find significant efficiency gains if the industry is to continue generating novel drugs. There is great expectation for machine learning (ML) to provide this boost in R&D productivity, but to harness the full potential of ML, the generation of new, high-quality datasets will be necessary. Here, the authors present a platform that combines high-throughput display and selection data generation with ML. More specifically, deep learning is used to inform the directed evolution of novel biotherapeutics using DNA library synthesis, ultra-high throughput selections, and next generation sequencing. By combining the learnings of multiple in silico models, their platform enables multi-parameter optimisation across multiple important protein characteristics. They also present a model for benchmarking these ML-driven drug discovery platforms according to the accuracy of their underlying in silico models, in conjunction with the throughput of their empirical experimentation.