Enabling data-limited chemical bioactivity predictions through deep neural network transfer learning

IF 3 3区生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY

Journal of Computer-Aided Molecular Design Pub Date : 2022-10-22 DOI:10.1007/s10822-022-00486-x

Ruifeng Liu, Srinivas Laxminarayan, Jaques Reifman, Anders Wallqvist

{"title":"Enabling data-limited chemical bioactivity predictions through deep neural network transfer learning","authors":"Ruifeng Liu, Srinivas Laxminarayan, Jaques Reifman, Anders Wallqvist","doi":"10.1007/s10822-022-00486-x","DOIUrl":null,"url":null,"abstract":"The main limitation in developing deep neural network (DNN) models to predict bioactivity properties of chemicals is the lack of sufficient assay data to train the network’s classification layers. Focusing on feedforward DNNs that use atom- and bond-based structural fingerprints as input, we examined whether layers of a fully trained DNN based on large amounts of data to predict one property could be used to develop DNNs to predict other related or unrelated properties based on limited amounts of data. Hence, we assessed if and under what conditions the dense layers of a pre-trained DNN could be transferred and used for the development of another DNN associated with limited training data. We carried out a quantitative study employing more than 400 pairs of assay datasets, where we used fully trained layers from a large dataset to augment the training of a small dataset. We found that the higher the correlation r between two assay datasets, the more efficient the transfer learning is in reducing prediction errors associated with the smaller dataset DNN predictions. The reduction in mean squared prediction errors ranged from 10 to 20% for every 0.1 increase in r2 between the datasets, with the bulk of the error reductions associated with transfers of the first dense layer. Transfer of other dense layers did not result in additional benefits, suggesting that deeper, dense layers conveyed more specialized and assay-specific information. Importantly, depending on the dataset correlation, training sample size could be reduced by up to tenfold without any loss of prediction accuracy.","PeriodicalId":621,"journal":{"name":"Journal of Computer-Aided Molecular Design","volume":"36 12","pages":"867 - 878"},"PeriodicalIF":3.0000,"publicationDate":"2022-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10822-022-00486-x.pdf","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer-Aided Molecular Design","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1007/s10822-022-00486-x","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}

引用次数: 1

Abstract

The main limitation in developing deep neural network (DNN) models to predict bioactivity properties of chemicals is the lack of sufficient assay data to train the network’s classification layers. Focusing on feedforward DNNs that use atom- and bond-based structural fingerprints as input, we examined whether layers of a fully trained DNN based on large amounts of data to predict one property could be used to develop DNNs to predict other related or unrelated properties based on limited amounts of data. Hence, we assessed if and under what conditions the dense layers of a pre-trained DNN could be transferred and used for the development of another DNN associated with limited training data. We carried out a quantitative study employing more than 400 pairs of assay datasets, where we used fully trained layers from a large dataset to augment the training of a small dataset. We found that the higher the correlation r between two assay datasets, the more efficient the transfer learning is in reducing prediction errors associated with the smaller dataset DNN predictions. The reduction in mean squared prediction errors ranged from 10 to 20% for every 0.1 increase in r² between the datasets, with the bulk of the error reductions associated with transfers of the first dense layer. Transfer of other dense layers did not result in additional benefits, suggesting that deeper, dense layers conveyed more specialized and assay-specific information. Importantly, depending on the dataset correlation, training sample size could be reduced by up to tenfold without any loss of prediction accuracy.

Abstract Image

查看原文本刊更多论文

通过深度神经网络迁移学习实现数据有限的化学生物活性预测

开发深度神经网络(DNN)模型来预测化学物质的生物活性特性的主要限制是缺乏足够的分析数据来训练网络的分类层。专注于使用基于原子和键的结构指纹作为输入的前馈深度神经网络，我们研究了基于大量数据来预测一个属性的完全训练的深度神经网络的层是否可以用于开发基于有限数据来预测其他相关或不相关属性的深度神经网络。因此，我们评估了预训练DNN的密集层是否以及在什么条件下可以转移，并用于与有限训练数据相关的另一个DNN的开发。我们进行了一项定量研究，使用了400多对分析数据集，其中我们使用了来自大型数据集的完全训练层来增强小型数据集的训练。我们发现，两个分析数据集之间的相关性r越高，迁移学习在减少与较小数据集DNN预测相关的预测误差方面就越有效。数据集之间的r2每增加0.1，均方预测误差的减少幅度从10%到20%不等，大部分误差减少与第一个密集层的转移有关。其他密集层的转移没有带来额外的好处，这表明更深、更密集的层传递了更专业和分析特定的信息。重要的是，根据数据集的相关性，训练样本量可以减少十倍，而不会损失预测精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Computer-Aided Molecular Design 生物-计算机：跨学科应用

CiteScore

8.00

自引率

8.60%

发文量

审稿时长

3 months

期刊介绍： The Journal of Computer-Aided Molecular Design provides a form for disseminating information on both the theory and the application of computer-based methods in the analysis and design of molecules. The scope of the journal encompasses papers which report new and original research and applications in the following areas: - theoretical chemistry; - computational chemistry; - computer and molecular graphics; - molecular modeling; - protein engineering; - drug design; - expert systems; - general structure-property relationships; - molecular dynamics; - chemical database development and usage.