{"title":"3DSGIMD: An accurate and interpretable molecular property prediction method using 3D spatial graph focusing network and structure-based feature fusion","authors":"","doi":"10.1016/j.future.2024.07.004","DOIUrl":null,"url":null,"abstract":"<div><p>A comprehensive representation of molecular structure is essential for establishing accurate and reliable molecular property prediction models. However, fully extracting and learning intrinsic molecular structure information, especially spatial structure features, remains a challenging task, leading that many molecular property prediction models still have no enough accuracy for the real application. In this study, we developed an innovative and interpretable deep learning method, termed 3DSGIMD, which predicted the molecular properties by integrating and learning the spatial structure and substructure information of molecules at multiple levels, and generated the focusing weights by aggregating spatial and adjacency information of molecules to improve understanding of prediction results. We evaluated the model on 10 public datasets and 14 cell-based phenotypic screening datasets. Extensive experimental results indicated that 3DSGIMD achieved superior or comparable predictive performance compared with some existing models, and the individually designed components contributed significantly to the advanced performance of the model. In addition, we also provided insight into the interpretability of our model via visualizing the focusing weights and perturbation analysis, and the results showed that 3DSGIMD can pinpoint crucial local structures and bits of molecular descriptors associated with the predicted properties. In summary, 3DSGIMD is a competitive molecular property prediction method that holds the potential to aid drug design and optimization.</p></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":null,"pages":null},"PeriodicalIF":6.2000,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X24003662","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
A comprehensive representation of molecular structure is essential for establishing accurate and reliable molecular property prediction models. However, fully extracting and learning intrinsic molecular structure information, especially spatial structure features, remains a challenging task, leading that many molecular property prediction models still have no enough accuracy for the real application. In this study, we developed an innovative and interpretable deep learning method, termed 3DSGIMD, which predicted the molecular properties by integrating and learning the spatial structure and substructure information of molecules at multiple levels, and generated the focusing weights by aggregating spatial and adjacency information of molecules to improve understanding of prediction results. We evaluated the model on 10 public datasets and 14 cell-based phenotypic screening datasets. Extensive experimental results indicated that 3DSGIMD achieved superior or comparable predictive performance compared with some existing models, and the individually designed components contributed significantly to the advanced performance of the model. In addition, we also provided insight into the interpretability of our model via visualizing the focusing weights and perturbation analysis, and the results showed that 3DSGIMD can pinpoint crucial local structures and bits of molecular descriptors associated with the predicted properties. In summary, 3DSGIMD is a competitive molecular property prediction method that holds the potential to aid drug design and optimization.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.