Qiushi Feng, Zehua Dong, Rongfang Nie, Xiaosheng Wang
{"title":"Identifying Diffuse Glioma Subtypes Based on Pathway Enrichment Evaluation","authors":"Qiushi Feng, Zehua Dong, Rongfang Nie, Xiaosheng Wang","doi":"10.1007/s12539-024-00627-w","DOIUrl":"https://doi.org/10.1007/s12539-024-00627-w","url":null,"abstract":"<p>Gliomas are highly heterogeneous in molecular, histology, and microenvironment. However, a classification of gliomas by integrating different tumor microenvironment (TME) components remains unexplored. Based on the enrichment scores of 17 pathways involved in immune, stromal, DNA repair, and nervous system signatures in diffuse gliomas, we performed consensus clustering to uncover novel subtypes of gliomas. Consistently in three glioma datasets (TCGA-glioma, CGGA325, and CGGA301), we identified three subtypes: Stromal-enriched (Str-G), Nerve-enriched (Ner-G), and mixed (Mix-G). Ner-G was charactered by low immune infiltration levels, stromal contents, tumor mutation burden, copy number alterations, DNA repair activity, cell proliferation, epithelial-mesenchymal transformation, stemness, intratumor heterogeneity, androgen receptor expression and <i>EGFR</i>, <i>PTEN</i>, <i>NF1</i> and <i>MUC16</i> mutation rates, while high enrichment of neurons and nervous system pathways, and high tumor purity, estrogen receptor expression, <i>IDH1</i> and <i>CIC</i> mutation rates, temozolomide response rate and overall and disease-free survival rates. In contrast, Str-G displayed contrastive characteristics to Ner-G. Our analysis indicates that the heterogeneity between glioma cells and neurons is lower than that between glioma cells and immune and stromal cells. Furthermore, the abundance of neurons is positively associated with clinical outcomes in gliomas, while the enrichment of immune and stromal cells has a negative association with them. Our classification method provides new insights into the tumor biology of gliomas, as well as clinical implications for the precise management of this disease.</p><h3 data-test=\"abstract-sub-heading\">Graphic Abstract</h3>\u0000","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":"78 1","pages":""},"PeriodicalIF":4.8,"publicationDate":"2024-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140624134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"AMPFLDAP: Adaptive Message Passing and Feature Fusion on Heterogeneous Network for LncRNA-Disease Associations Prediction","authors":"","doi":"10.1007/s12539-024-00610-5","DOIUrl":"https://doi.org/10.1007/s12539-024-00610-5","url":null,"abstract":"<h3>Abstract</h3> <p>Exploration of the intricate connections between long noncoding RNA (lncRNA) and diseases, referred to as lncRNA-disease associations (LDAs), plays a pivotal and indispensable role in unraveling the underlying molecular mechanisms of diseases and devising practical treatment approaches. It is imperative to employ computational methods for predicting lncRNA-disease associations to circumvent the need for superfluous experimental endeavors. Graph-based learning models have gained substantial popularity in predicting these associations, primarily because of their capacity to leverage node attributes and relationships within the network. Nevertheless, there remains much room for enhancing the performance of these techniques by incorporating and harmonizing the node attributes more effectively. In this context, we introduce a novel model, i.e., Adaptive Message Passing and Feature Fusion (AMPFLDAP), for forecasting lncRNA-disease associations within a heterogeneous network. Firstly, we constructed a heterogeneous network involving lncRNA, microRNA (miRNA), and diseases based on established associations and employing Gaussian interaction profile kernel similarity as a measure. Then, an adaptive topological message passing mechanism is suggested to address the information aggregation for heterogeneous networks. The topological features of nodes in the heterogeneous network were extracted based on the adaptive topological message passing mechanism. Moreover, an attention mechanism is applied to integrate both topological and semantic information to achieve the multimodal features of biomolecules, which are further used to predict potential LDAs. The experimental results demonstrated that the performance of the proposed AMPFLDAP is superior to seven state-of-the-art methods. Furthermore, to validate its efficacy in practical scenarios, we conducted detailed case studies involving three distinct diseases, which conclusively demonstrated AMPFLDAP’s effectiveness in the prediction of LDAs.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":"21 1","pages":""},"PeriodicalIF":4.8,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140570563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Haoze Du, Jiahao Xu, Zhiyong Du, Lihui Chen, Shaohui Ma, Dongqing Wei, Xianfang Wang
{"title":"MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records","authors":"Haoze Du, Jiahao Xu, Zhiyong Du, Lihui Chen, Shaohui Ma, Dongqing Wei, Xianfang Wang","doi":"10.1007/s12539-024-00624-z","DOIUrl":"https://doi.org/10.1007/s12539-024-00624-z","url":null,"abstract":"<p>To address the problem of poor entity recognition performance caused by the lack of Chinese annotation in clinical electronic medical records, this paper proposes a multi-medical entity recognition method F-MNER using a fusion technique combining BART, Bi-LSTM, and CRF. First, after cleaning, encoding, and segmenting the electronic medical records, the obtained semantic representations are dynamically fused using a bidirectional autoregressive transformer (BART) model. Then, sequential information is captured using a bidirectional long short-term memory (Bi-LSTM) network. Finally, the conditional random field (CRF) is used to decode and output multi-task entity recognition. Experiments are performed on the CCKS2019 dataset, with <i>micro avg Precision</i>, <i>macro avg Recall</i>, <i>weighted avg Precision</i> reaching 0.880, 0.887, and 0.883, and <i>micro avg F1-score</i>, <i>macro avg F1-score</i>, <i>weighted avg F1-score</i> reaching 0.875, 0.876, and 0.876 respectively. Compared with existing models, our method outperforms the existing literature in three evaluation metrics (<i>micro average</i>, <i>macro average</i>, <i>weighted average</i>) under the same dataset conditions. In the case of weighted average, the <i>Precision</i>, <i>Recall</i>, and <i>F1-score</i> are 19.64%, 15.67%, and 17.58% higher than the existing BERT-BiLSTM-CRF model respectively. Experiments are performed on the actual clinical dataset with our MF-MNER, the <i>Precision</i>, <i>Recall</i>, and <i>F1-score</i> are 0.638, 0.825, and 0.719 under the micro-avg evaluation mechanism. The <i>Precision</i>, <i>Recall</i>, and <i>F1-score</i> are 0.685, 0.800, and 0.733 under the macro-avg evaluation mechanism. The <i>Precision</i>, <i>Recall</i>, and <i>F1-score</i> are 0.647, 0.825, and 0.722 under the <i>weighted avg</i> evaluation mechanism. The above results show that our method MF-MNER can integrate the advantages of BART, Bi-LSTM, and CRF layers, significantly improving the performance of downstream named entity recognition tasks with a small amount of annotation, and achieving excellent performance in terms of recall score, which has certain practical significance. Source code and datasets to reproduce the results in this paper are available at https://github.com/xfwang1969/MF-MNER.</p><h3 data-test=\"abstract-sub-heading\">Graphical Abstract</h3><p>Illustration of the proposed MF-MNER. The method mainly includes four steps: (1) medical electronic medical records need to be cleared, coded, and segmented. (2) The semantic representation obtained by dynamic fusion of the bidirectional autoregressive converter (BART) model. (3) The sequence information is captured by a bi-directional short-term memory (Bi-LSTM) network. (4) the multi-task entity recognition is decoded and output by conditional random field (CRF).\u0000</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":"2014 1","pages":""},"PeriodicalIF":4.8,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140570548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Huilian Zhang, Jie Chen, Bo Liao, Fang-xiang Wu, Xia-an Bi
{"title":"Deep Canonical Correlation Fusion Algorithm Based on Denoising Autoencoder for ASD Diagnosis and Pathogenic Brain Region Identification","authors":"Huilian Zhang, Jie Chen, Bo Liao, Fang-xiang Wu, Xia-an Bi","doi":"10.1007/s12539-024-00625-y","DOIUrl":"https://doi.org/10.1007/s12539-024-00625-y","url":null,"abstract":"<p>Autism Spectrum Disorder (ASD) is defined as a neurodevelopmental condition distinguished by unconventional neural activities. Early intervention is key to managing the progress of ASD, and current research primarily focuses on the use of structural magnetic resonance imaging (sMRI) or resting-state functional magnetic resonance imaging (rs-fMRI) for diagnosis. Moreover, the use of autoencoders for disease classification has not been sufficiently explored. In this study, we introduce a new framework based on autoencoder, the Deep Canonical Correlation Fusion algorithm based on Denoising Autoencoder (DCCF-DAE), which proves to be effective in handling high-dimensional data. This framework involves efficient feature extraction from different types of data with an advanced autoencoder, followed by the fusion of these features through the DCCF model. Then we utilize the fused features for disease classification. DCCF integrates functional and structural data to help accurately diagnose ASD and identify critical Regions of Interest (ROIs) in disease mechanisms. We compare the proposed framework with other methods by the Autism Brain Imaging Data Exchange (ABIDE) database and the results demonstrate its outstanding performance in ASD diagnosis. The superiority of DCCF-DAE highlights its potential as a crucial tool for early ASD diagnosis and monitoring.</p><h3 data-test=\"abstract-sub-heading\">Graphical Abstract</h3>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":"40 1","pages":""},"PeriodicalIF":4.8,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140570556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mingeun Ji, Yejin Kan, Dongyeon Kim, Seungmin Lee, Gangman Yi
{"title":"DeepPI: Alignment-Free Analysis of Flexible Length Proteins Based on Deep Learning and Image Generator","authors":"Mingeun Ji, Yejin Kan, Dongyeon Kim, Seungmin Lee, Gangman Yi","doi":"10.1007/s12539-024-00618-x","DOIUrl":"https://doi.org/10.1007/s12539-024-00618-x","url":null,"abstract":"<p>With the rapid development of NGS technology, the number of protein sequences has increased exponentially. Computational methods have been introduced in protein functional studies because the analysis of large numbers of proteins through biological experiments is costly and time-consuming. In recent years, new approaches based on deep learning have been proposed to overcome the limitations of conventional methods. Although deep learning-based methods effectively utilize features of protein function, they are limited to sequences of fixed-length and consider information from adjacent amino acids. Therefore, new protein analysis tools that extract functional features from proteins of flexible length and train models are required. We introduce DeepPI, a deep learning-based tool for analyzing proteins in large-scale database. The proposed model that utilizes Global Average Pooling is applied to proteins of flexible length and leads to reduced information loss compared to existing algorithms that use fixed sizes. The image generator converts a one-dimensional sequence into a distinct two-dimensional structure, which can extract common parts of various shapes. Finally, filtering techniques automatically detect representative data from the entire database and ensure coverage of large protein databases. We demonstrate that DeepPI has been successfully applied to large databases such as the Pfam-A database. Comparative experiments on four types of image generators illustrated the impact of structure on feature extraction. The filtering performance was verified by varying the parameter values and proved to be applicable to large databases. Compared to existing methods, DeepPI outperforms in family classification accuracy for protein function inference.</p><h3 data-test=\"abstract-sub-heading\">Graphical Abstract</h3>\u0000","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":"51 1","pages":""},"PeriodicalIF":4.8,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140570559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lun Zhu, Liuyang Wang, Zexi Yang, Piao Xu, Sen Yang
{"title":"PPSNO: A Feature-Rich SNO Sites Predictor by Stacking Ensemble Strategy from Protein Sequence-Derived Information.","authors":"Lun Zhu, Liuyang Wang, Zexi Yang, Piao Xu, Sen Yang","doi":"10.1007/s12539-023-00595-7","DOIUrl":"10.1007/s12539-023-00595-7","url":null,"abstract":"<p><p>The protein S-nitrosylation (SNO) is a significant post-translational modification that affects the stability, activity, cellular localization, and function of proteins. Therefore, highly accurate prediction of SNO sites aids in grasping biological function mechanisms. In this document, we have constructed a predictor, named PPSNO, forecasting protein SNO sites using stacked integrated learning. PPSNO integrates multiple machine learning techniques into an ensemble model, enhancing its predictive accuracy. First, we established benchmark datasets by collecting SNO sites from various sources, including literature, databases, and other predictors. Second, various techniques for feature extraction are applied to derive characteristics from protein sequences, which are subsequently amalgamated into the PPSNO predictor for training. Five-fold cross-validation experiments show that PPSNO outperformed existing predictors, such as PSNO, PreSNO, pCysMod, DeepNitro, RecSNO, and Mul-SNO. The PPSNO predictor achieved an impressive accuracy of 92.8%, an area under the curve (AUC) of 96.1%, a Matthews correlation coefficient (MCC) of 81.3%, an F1-score of 85.6%, an SN of 79.3%, an SP of 97.7%, and an average precision (AP) of 92.2%. We also employed ROC curves, PR curves, and radar plots to show the superior performance of PPSNO. Our study shows that fused protein sequence features and two-layer stacked ensemble models can improve the accuracy of predicting SNO sites, which can aid in comprehending cellular processes and disease mechanisms. The codes and data are available at https://github.com/serendipity-wly/PPSNO .</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"192-217"},"PeriodicalIF":4.8,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139416998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cervical Cancer Classification From Pap Smear Images Using Deep Convolutional Neural Network Models.","authors":"Sher Lyn Tan, Ganeshsree Selvachandran, Weiping Ding, Raveendran Paramesran, Ketan Kotecha","doi":"10.1007/s12539-023-00589-5","DOIUrl":"10.1007/s12539-023-00589-5","url":null,"abstract":"<p><p>As one of the most common female cancers, cervical cancer often develops years after a prolonged and reversible pre-cancerous stage. Traditional classification algorithms used for detection of cervical cancer often require cell segmentation and feature extraction techniques, while convolutional neural network (CNN) models demand a large dataset to mitigate over-fitting and poor generalization problems. To this end, this study aims to develop deep learning models for automated cervical cancer detection that do not rely on segmentation methods or custom features. Due to limited data availability, transfer learning was employed with pre-trained CNN models to directly operate on Pap smear images for a seven-class classification task. Thorough evaluation and comparison of 13 pre-trained deep CNN models were performed using the publicly available Herlev dataset and the Keras package in Google Collaboratory. In terms of accuracy and performance, DenseNet-201 is the best-performing model. The pre-trained CNN models studied in this paper produced good experimental results and required little computing time.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"16-38"},"PeriodicalIF":4.8,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10881721/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92153676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Application of Semi-supervised Fuzzy Clustering Based on Knowledge Weighting and Cluster Center Learning to Mammary Molybdenum Target Image Segmentation.","authors":"Peng Peng, Danping Wu, Li-Jun Huang, Jianqiang Wang, Li Zhang, Yue Wu, Yizhang Jiang, Zhihua Lu, Khin Wee Lai, Kaijian Xia","doi":"10.1007/s12539-023-00580-0","DOIUrl":"10.1007/s12539-023-00580-0","url":null,"abstract":"<p><p>Breast cancer is commonly diagnosed with mammography. Using image segmentation algorithms to separate lesion areas in mammography can facilitate diagnosis by doctors and reduce their workload, which has important clinical significance. Because large, accurately labeled medical image datasets are difficult to obtain, traditional clustering algorithms are widely used in medical image segmentation as an unsupervised model. Traditional unsupervised clustering algorithms have limited learning knowledge. Moreover, some semi-supervised fuzzy clustering algorithms cannot fully mine the information of labeled samples, which results in insufficient supervision. When faced with complex mammography images, the above algorithms cannot accurately segment lesion areas. To address this, a semi-supervised fuzzy clustering based on knowledge weighting and cluster center learning (WSFCM_V) is presented. According to prior knowledge, three learning modes are proposed: a knowledge weighting method for cluster centers, Euclidean distance weights for unlabeled samples, and learning from the cluster centers of labeled sample sets. These strategies improve the clustering performance. On real breast molybdenum target images, the WSFCM_V algorithm is compared with currently popular semi-supervised and unsupervised clustering algorithms. WSFCM_V has the best evaluation index values. Experimental results demonstrate that compared with the existing clustering algorithms, WSFCM_V has a higher segmentation accuracy than other clustering algorithms, both for larger lesion regions like tumor areas and for smaller lesion areas like calcification point areas.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"39-57"},"PeriodicalIF":4.8,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9849849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Computer-Aided Diagnosis of Complications After Liver Transplantation Based on Transfer Learning.","authors":"Ying Zhang, Chenyuan Shangguan, Xuena Zhang, Jialin Ma, Jiyuan He, Meng Jia, Na Chen","doi":"10.1007/s12539-023-00588-6","DOIUrl":"10.1007/s12539-023-00588-6","url":null,"abstract":"<p><p>Liver transplantation is one of the most effective treatments for acute liver failure, cirrhosis, and even liver cancer. The prediction of postoperative complications is of great significance for liver transplantation. However, the existing prediction methods based on traditional machine learning are often unavailable or unreliable due to the insufficient amount of real liver transplantation data. Therefore, we propose a new framework to increase the accuracy of computer-aided diagnosis of complications after liver transplantation with transfer learning, which can handle small-scale but high-dimensional data problems. Furthermore, since data samples are often high dimensional in the real world, capturing key features that influence postoperative complications can help make the correct diagnosis for patients. So, we also introduce the SHapley Additive exPlanation (SHAP) method into our framework for exploring the key features of postoperative complications. We used data obtained from 425 patients with 456 features in our experiments. Experimental results show that our approach outperforms all compared baseline methods in predicting postoperative complications. In our work, the average precision, the mean recall, and the mean F1 score reach 91.22%, 91.70%, and 91.18%, respectively.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"123-140"},"PeriodicalIF":4.8,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50157822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Deep Neural Network for Predicting Synergistic Drug Combinations on Cancer.","authors":"Shiyu Yan, Ding Zheng","doi":"10.1007/s12539-023-00596-6","DOIUrl":"10.1007/s12539-023-00596-6","url":null,"abstract":"<p><p>The exploration of drug combinations presents an opportunity to amplify therapeutic effectiveness while alleviating undesirable side effects. Nevertheless, the extensive array of potential combinations poses challenges in terms of cost and time constraints for experimental screening. Thus, it is crucial to narrow down the search space. Deep learning approaches have gained widespread popularity in predicting synergistic drug combinations tailored for specific cell lines in vitro settings. In the present study, we introduce a novel method termed GTextSyn, which utilizes the integration of gene expression data and chemical structure information for the prediction of synergistic effects in drug combinations. GTextSyn employs a sentence classification model within the domain of Natural Language Processing (NLP), wherein drugs and cell lines are regarded as entities possessing biochemical relevance. Meanwhile, combinations of drug pairs and cell lines are construed as sentences with biochemical relational significance. To assess the efficacy of GTextSyn, we conduct a comparative analysis with alternative deep learning approaches using a standard benchmark dataset. The results from a five-fold cross-validation demonstrate a 49.5% reduction in Mean Square Error (MSE) achieved by GTextSyn, surpassing the performance of the next best method in the regression task. Furthermore, we conduct a comprehensive literature survey on the predicted novel drug combinations and find substantial support from prior experimental studies for many of the combinations identified by GTextSyn.</p>","PeriodicalId":13670,"journal":{"name":"Interdisciplinary Sciences: Computational Life Sciences","volume":" ","pages":"218-230"},"PeriodicalIF":4.8,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139110865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}