Molecular Informatics最新文献

筛选
英文 中文
Data-driven approaches for identifying hyperparameters in multi-step retrosynthesis. 用于识别多步骤逆转录合成中的超参数的数据驱动方法。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-11-01 Epub Date: 2023-09-27 DOI: 10.1002/minf.202300128
Annie M Westerlund, Bente Barge, Lewis Mervin, Samuel Genheden
{"title":"Data-driven approaches for identifying hyperparameters in multi-step retrosynthesis.","authors":"Annie M Westerlund, Bente Barge, Lewis Mervin, Samuel Genheden","doi":"10.1002/minf.202300128","DOIUrl":"10.1002/minf.202300128","url":null,"abstract":"<p><p>The multi-step retrosynthesis problem can be solved by a search algorithm, such as Monte Carlo tree search (MCTS). The performance of multistep retrosynthesis, as measured by a trade-off in search time and route solvability, therefore depends on the hyperparameters of the search algorithm. In this paper, we demonstrated the effect of three MCTS hyperparameters (number of iterations, tree depth, and tree width) on metrics such as Linear integrated speed-accuracy score (LISAS) and Inverse efficiency score which consider both route solvability and search time. This exploration was conducted by employing three data-driven approaches, namely a systematic grid search, Bayesian optimization over an ensemble of molecules to obtain static MCTS hyperparameters, and a machine learning approach to dynamically predict optimal MCTS hyperparameters given an input target molecule. With the obtained results on the internal dataset, we demonstrated that it is possible to identify a hyperparameter set which outperforms the current AiZynthFinder default setting. It appeared optimal across a variety of target input molecules, both on proprietary and public datasets. The settings identified with the in-house dataset reached a solvability of 93 % and median search time of 151 s for the in-house dataset, and a 74 % solvability and 114 s for the ChEMBL dataset. These numbers can be compared to the current default settings which solved 85 % and 73 % during a median time of 110s and 84 s, for in-house and ChEMBL, respectively.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10553301","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Phenothiazine-based virtual screening, molecular docking, and molecular dynamics of new trypanothione reductase inhibitors of Trypanosoma cruzi. 新锥虫锥硫酮还原酶抑制剂基于吩噻嗪的虚拟筛选、分子对接和分子动力学。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-10-01 Epub Date: 2023-08-21 DOI: 10.1002/minf.202300069
Gildardo Rivera, Alonzo González-González, Citlali Vázquez, Rusely Encalada, Emma Saavedra, Lenci K Vázquez-Jiménez, Eyra Ortiz-Pérez, Maria Bolognesi
{"title":"Phenothiazine-based virtual screening, molecular docking, and molecular dynamics of new trypanothione reductase inhibitors of Trypanosoma cruzi.","authors":"Gildardo Rivera,&nbsp;Alonzo González-González,&nbsp;Citlali Vázquez,&nbsp;Rusely Encalada,&nbsp;Emma Saavedra,&nbsp;Lenci K Vázquez-Jiménez,&nbsp;Eyra Ortiz-Pérez,&nbsp;Maria Bolognesi","doi":"10.1002/minf.202300069","DOIUrl":"10.1002/minf.202300069","url":null,"abstract":"<p><p>Phenothiazine derivatives can unselectively inhibit the trypanothione-dependent antioxidant system enzyme trypanothione reductase (TR). A virtual screening of 2163 phenothiazine derivatives from the ZINC15 and PubChem databases docked on the active site of T. cruzi TR showed that 285 compounds have higher affinity than the natural ligand trypanothione disulfide. 244 compounds showed higher affinity toward the parasite's enzyme than to its human homolog glutathione reductase. Protein-ligand interaction profiling predicted that the main interactions for the top scored compounds were with residues important for trypanothione disulfide binding: Phe396, Pro398, Leu399, His461, Glu466, and Glu467, particularly His461, which participates in catalysis. Two compounds with the desired profiles, ZINC1033681 (Zn_C687) and ZINC10213096 (Zn_C216), decreased parasite growth by 20 % and 50 %, respectively. They behaved as mixed-type inhibitors of recombinant TR, with Ki values of 59 and 47 μM, respectively. This study provides a further understanding of the potential of phenothiazine derivatives as TR inhibitors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10020855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conjugated quantitative structure-property relationship models: Prediction of kinetic characteristics linked by the Arrhenius equation. 共轭定量结构-性质关系模型:由阿伦尼斯方程联系的动力学特性预测。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-10-01 Epub Date: 2023-08-21 DOI: 10.1002/minf.202200275
Dmitry Zankov, Timur Madzhidov, Igor Baskin, Alexandre Varnek
{"title":"Conjugated quantitative structure-property relationship models: Prediction of kinetic characteristics linked by the Arrhenius equation.","authors":"Dmitry Zankov,&nbsp;Timur Madzhidov,&nbsp;Igor Baskin,&nbsp;Alexandre Varnek","doi":"10.1002/minf.202200275","DOIUrl":"10.1002/minf.202200275","url":null,"abstract":"<p><p>Conjugated QSPR models for reactions integrate fundamental chemical laws expressed by mathematical equations with machine learning algorithms. Herein we present a methodology for building conjugated QSPR models integrated with the Arrhenius equation. Conjugated QSPR models were used to predict kinetic characteristics of cycloaddition reactions related by the Arrhenius equation: rate constant <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>k</mi></mrow> <annotation>${{rm l}{rm o}{rm g}k}$</annotation> </semantics> </math> , pre-exponential factor <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>A</mi></mrow> <annotation>${{rm l}{rm o}{rm g}A}$</annotation> </semantics> </math> , and activation energy <math> <semantics><msub><mi>E</mi> <mi>a</mi></msub> <annotation>${{E}_{{rm a}}}$</annotation> </semantics> </math> . They were benchmarked against single-task (individual and equation-based models) and multi-task models. In individual models, all characteristics were modeled separately, while in multi-task models <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>k</mi></mrow> <annotation>${{rm l}{rm o}{rm g}k}$</annotation> </semantics> </math> , <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>A</mi></mrow> <annotation>${{rm l}{rm o}{rm g}A}$</annotation> </semantics> </math> and <math> <semantics><msub><mi>E</mi> <mi>a</mi></msub> <annotation>${{E}_{{rm a}}}$</annotation> </semantics> </math> were treated cooperatively. An equation-based model assessed <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>k</mi></mrow> <annotation>${{rm l}{rm o}{rm g}k}$</annotation> </semantics> </math> using the Arrhenius equation and <math> <semantics><mrow><mi>l</mi> <mi>o</mi> <mi>g</mi> <mi>A</mi></mrow> <annotation>${{rm l}{rm o}{rm g}A}$</annotation> </semantics> </math> and <math> <semantics><msub><mi>E</mi> <mi>a</mi></msub> <annotation>${{E}_{{rm a}}}$</annotation> </semantics> </math> values predicted by individual models. It has been demonstrated that the conjugated QSPR models can accurately predict the reaction rate constants at extreme temperatures, at which reaction rate constants hardly can be measured experimentally. Also, in the case of small training sets conjugated models are more robust than related single-task approaches.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10029968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A multi-tier computational screening framework to effectively search the mutational space of SARS-CoV-2 receptor binding motif to identify mutants with enhanced ACE2 binding abilities. 一种有效搜索严重急性呼吸系统综合征冠状病毒2型变异空间的多层计算筛选框架 受体结合基序以鉴定具有增强的ACE2结合能力的突变体。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-10-01 Epub Date: 2023-08-31 DOI: 10.1002/minf.202300055
Sandipan Chakraborty, Chiranjeet Saha
{"title":"A multi-tier computational screening framework to effectively search the mutational space of SARS-CoV-2 receptor binding motif to identify mutants with enhanced ACE2 binding abilities.","authors":"Sandipan Chakraborty,&nbsp;Chiranjeet Saha","doi":"10.1002/minf.202300055","DOIUrl":"10.1002/minf.202300055","url":null,"abstract":"<p><p>SARS-CoV-2 gained crucial mutations at the receptor binding domain (RBD) that often changed the course of the pandemic leading to new waves with increased case fatality. Variants are observed with enhanced transmission and immune invasion abilities. Thus, predicting future variants with enhanced transmission ability is a problem of utmost research interest. Here, we have developed a multi-tier exhaustive SARS-CoV-2 mutation screening platform combining MM/GBSA, extensive molecular dynamics simulations, and steered molecular dynamics to identify RBD mutants with enhanced ACE2 binding capability. We have identified four RBM mutations (F490K, S494K, G504F, and the P499L) with significantly higher ACE2 binding abilities than wild-type RBD. Compared to wild-type RBD, they all form stable complexes with more hydrogen bonds and salt-bridge interactions with ACE2. Our simulation data suggest that these mutations allosterically alter the packing of the RBM interface of the RBD-ACE2 complex. As a result, the rupture force required to break the RBD-ACE2 contacts is significantly higher for these mutants.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10491472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature importance-based interpretation of UMAP-visualized polymer space. 基于特征重要性的UMAP可视化聚合物空间解释。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-08-01 Epub Date: 2023-06-16 DOI: 10.1002/minf.202300061
Takuya Ehiro
{"title":"Feature importance-based interpretation of UMAP-visualized polymer space.","authors":"Takuya Ehiro","doi":"10.1002/minf.202300061","DOIUrl":"10.1002/minf.202300061","url":null,"abstract":"<p><p>Dimensionality reduction (DR) techniques are used for various purposes such as exploratory data analysis. A commonly employed linear DR technique is principal component analysis (PCA), which is one of the most popular methods for DR. Owing to its linear nature, PCA enables the determination of axes in a low-dimensional space and the calculation of corresponding loading vectors. However, PCA cannot necessarily extract important features of non-linearly distributed data. This study presents a technique aimed at aiding the interpretation of data reduced through non-linear DR methods. In the proposed method, non-linear dimensionally reduced data was clustered via a density-based clustering method. Thereafter, the obtained cluster labels were classified by random forest (RF) classifiers. Further, feature importance (FI) of RF classifiers and Spearman's rank correlation coefficients between predictive probabilities to obtained clusters and original feature values were utilized for characterizing the visualized dimensionally reduced data. The results revealed that the proposed method can provide the interpretable FI-based images of the handwritten digits dataset. Moreover, the proposed method was also applied to the polymer dataset. The study found that incorporating signed FI was advantageous in achieving a meaningful interpretation. Furthermore, Gaussian process regression was utilized to produce intuitive FI-based heatmaps on a 2-dimensional space for greater ease of understanding. Additionally, to enhance the interpretability of the obtained clusters, a feature selection technique called Boruta was applied. The Boruta feature selection method worked effectively to interpret the obtained clusters with limited and commonly important features. Additionally, the study suggested that computing FI solely from substructure-based descriptors could further enhance the interpretability of the results. Finally, the automation of the proposed method was investigated, and through maximizing the target score based on the quality of both the DR and clustering, indicative results were automatically obtained for both the handwritten digits and polymer datasets.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10257338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computer-aided design of muscarinic acetylcholine receptor M3 inhibitors: Promising compounds among trifluoromethyl containing hexahydropyrimidinones/thiones. 毒蕈碱乙酰胆碱受体M3抑制剂的计算机辅助设计:含三氟甲基的六氢嘧啶酮/硫酮中有前途的化合物。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-08-01 Epub Date: 2023-08-09 DOI: 10.1002/minf.202300006
Alex Nyporko, Olga Tsymbalyuk, Ivan Voiteshenko, Sergiy Starosyla, Mykola Protopopov, Volodymyr Bdzhola
{"title":"Computer-aided design of muscarinic acetylcholine receptor M3 inhibitors: Promising compounds among trifluoromethyl containing hexahydropyrimidinones/thiones.","authors":"Alex Nyporko,&nbsp;Olga Tsymbalyuk,&nbsp;Ivan Voiteshenko,&nbsp;Sergiy Starosyla,&nbsp;Mykola Protopopov,&nbsp;Volodymyr Bdzhola","doi":"10.1002/minf.202300006","DOIUrl":"10.1002/minf.202300006","url":null,"abstract":"<p><p>The new high selective mAChRs M3 inhibitors with IC<sub>50</sub> in nanomolecular ranges, which can be the prototypes for effective COPD and asthma treatment drugs, were discovered with computational approaches among trifluoromethyl containing hexahydropyrimidinones/thiones. Compounds [6-(4-ethoxy-3-methoxy-phenyl)-4-hydroxy-2-thioxo-4-(trifluoromethyl)hexahydropyrimidin-5-yl]-phenyl-methanone (THPT-1) and 5-benzoyl-6-(3,4-dimethoxyphenyl)-4-hydroxy-4-(trifluoromethyl)hexahydropyrimidin-2-one (THPO-4) have been proved to be a highly effective (with IC<sub>50</sub> values of 1.62 ⋅ 10<sup>-7</sup>  M and 3.09 ⋅ 10<sup>-9</sup>  M, respectively) at the same concentrations significantly competitive inhibit the signal conduction through mAChR3 in comparison with ipratropium bromide, without significant effect on mAChR2, nicotinic cholinergic and adrenergic receptors.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10227176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Novel Inhibitors of androgen receptor's DNA binding domain identified using an ultra-large virtual screening. 使用超大型虚拟筛选鉴定的雄激素受体DNA结合域的新型抑制剂。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-08-01 Epub Date: 2023-07-19 DOI: 10.1002/minf.202300026
Mariia Radaeva, Helene Morin, Mohit Pandey, Fuqiang Ban, Maria Guo, Eric LeBlanc, Nada Lallous, Artem Cherkasov
{"title":"Novel Inhibitors of androgen receptor's DNA binding domain identified using an ultra-large virtual screening.","authors":"Mariia Radaeva,&nbsp;Helene Morin,&nbsp;Mohit Pandey,&nbsp;Fuqiang Ban,&nbsp;Maria Guo,&nbsp;Eric LeBlanc,&nbsp;Nada Lallous,&nbsp;Artem Cherkasov","doi":"10.1002/minf.202300026","DOIUrl":"10.1002/minf.202300026","url":null,"abstract":"<p><p>Androgen receptor (AR) inhibition remains the primary strategy to combat the progression of prostate cancer (PC). However, all clinically used AR inhibitors target the ligand-binding domain (LBD), which is highly susceptible to truncations through splicing or mutations that confer drug resistance. Thus, there exists an urgent need for AR inhibitors with novel modes of action. We thus launched a virtual screening of an ultra-large chemical library to find novel inhibitors of the AR DNA-binding domain (DBD) at two sites: protein-DNA interface (P-box) and dimerization site (D-box). The compounds selected through vigorous computational filtering were then experimentally validated. We identified several novel chemotypes that effectively suppress transcriptional activity of AR and its splice variant V7. The identified compounds represent previously unexplored chemical scaffolds with a mechanism of action that evades the conventional drug resistance manifested through LBD mutations. Additionally, we describe the binding features required to inhibit AR DBD at both P-box and D-box target sites.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10218099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deimos: A novel automated methodology for optimal grouping. Application to nanoinformatics case studies. Deimos:一种用于优化分组的新型自动化方法。应用于纳米信息学案例研究。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-08-01 Epub Date: 2023-08-21 DOI: 10.1002/minf.202300019
Dimitra-Danai Varsou, Haralambos Sarimveis
{"title":"Deimos: A novel automated methodology for optimal grouping. Application to nanoinformatics case studies.","authors":"Dimitra-Danai Varsou,&nbsp;Haralambos Sarimveis","doi":"10.1002/minf.202300019","DOIUrl":"10.1002/minf.202300019","url":null,"abstract":"<p><p>In this study we present deimos, a computational methodology for optimal grouping, applied on the read-across prediction of engineered nanomaterials' (ENMs) toxicity-related properties. The method is based on the formulation and the solution of a mixed-integer optimization program (MILP) problem that automatically and simultaneously performs feature selection, defines the grouping boundaries according to the response variable and develops linear regression models in each group. For each group/region, the characteristic centroid is defined in order to allocate untested ENMs to the groups. The deimos MILP problem is integrated in a broader optimization workflow that selects the best performing methodology between the standard multiple linear regression (MLR), the least absolute shrinkage and selection operator (LASSO) models and the proposed deimos multiple-region model. The performance of the suggested methodology is demonstrated through the application to benchmark ENMs datasets and comparison with other predictive modelling approaches. However, the proposed method can be applied to property prediction of other than ENM chemical entities and it is not limited to ENMs toxicity prediction.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10575617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
De novo drug design based on patient gene expression profiles via deep learning. 通过深度学习基于患者基因表达谱的从头药物设计。
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-08-01 Epub Date: 2023-08-21 DOI: 10.1002/minf.202300064
Chikashige Yamanaka, Shunya Uki, Kazuma Kaitoh, Michio Iwata, Yoshihiro Yamanishi
{"title":"De novo drug design based on patient gene expression profiles via deep learning.","authors":"Chikashige Yamanaka,&nbsp;Shunya Uki,&nbsp;Kazuma Kaitoh,&nbsp;Michio Iwata,&nbsp;Yoshihiro Yamanishi","doi":"10.1002/minf.202300064","DOIUrl":"10.1002/minf.202300064","url":null,"abstract":"<p><p>Computational de novo drug design is a challenging issue in medicine, and it is desirable to consider all of the relevant information of the biological systems in a disease state. Here, we propose a novel computational method to generate drug candidate molecular structures from patient gene expression profiles via deep learning, which we call DRAGONET. Our model can generate new molecules that are likely to counteract disease-specific gene expression patterns in patients, which is made possible by exploring the latent space constructed by a transformer-based variational autoencoder and integrating the substructures of disease-correlated molecules. We applied DRAGONET to generate drug candidate molecules for gastric cancer, atopic dermatitis, and Alzheimer's disease, and demonstrated that the newly generated molecules were chemically similar to registered drugs for each disease. This approach is applicable to diseases with unknown therapeutic target proteins and will make a significant contribution to the field of precision medicine.</p>","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10576146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Natural‐Language Processing (NLP) based feature extraction technique in Deep‐Learning model to predict the Blood‐Brain‐Barrier permeability of molecules 基于自然语言处理(NLP)的特征提取技术在深度学习模型中预测分子的血脑屏障通透性
IF 3.6 4区 医学
Molecular Informatics Pub Date : 2023-07-04 DOI: 10.1002/minf.202200271
Ashok Kumar, Ravi Singh, Powsali Ghosh, Ankit Ganeshpurkar, *. Asha, Rayala Swetha, Ravi Singh, Dileep Kumar, Sudheer Kumar Singh
{"title":"Natural‐Language Processing (NLP) based feature extraction technique in Deep‐Learning model to predict the Blood‐Brain‐Barrier permeability of molecules","authors":"Ashok Kumar, Ravi Singh, Powsali Ghosh, Ankit Ganeshpurkar, *. Asha, Rayala Swetha, Ravi Singh, Dileep Kumar, Sudheer Kumar Singh","doi":"10.1002/minf.202200271","DOIUrl":"https://doi.org/10.1002/minf.202200271","url":null,"abstract":"Blood‐Brain‐Barrier (BBB) permeability is one of the critical factors in the success and failure of CNS drug development. The most accurate method of measuring BBB permeability involves clinical experiments, which are labour‐intensive and time‐consuming. Thus, numerous efforts were made to use artificial intelligence (AI) to predict molecules′ BBB permeability. Most of the previous models are based on calculated descriptors and molecular fingerprints. In the present work, we have developed an NLP‐based feature extraction technique in Deep‐Learning models to predict BBB permeability. We have used the B3DB database and generated SELFIES to extract features from the molecules. We have employed word level and N‐gram tokenization to represent words into numeric vectors. The extracted features were fed into several Artificial Neural Network (ANN) and Bi‐directional Long Short‐Term Memory (LSTM) models. The model, ANN‐10 built using ANN and 6‐gram tokenization, performed best on the independent test set. The accuracy, precision, recall, F1, specificity and AUC of ROC scores were found to be 0.89, 0.91, 0.91, 0.91, 0.85 and 0.90. Thus, the developed model can be used for the early screening of CNS drugs.","PeriodicalId":18853,"journal":{"name":"Molecular Informatics","volume":null,"pages":null},"PeriodicalIF":3.6,"publicationDate":"2023-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41857571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信