{"title":"HPTRMF:基于协作矩阵因式分解的 LncRNA-疾病关联预测方法,使用高阶扰动和灵活的三因子正则化。","authors":"Guobo Xie, Dayin Li, Zhiyi Lin, Guosheng Gu, Weijun Li, Ruibin Chen, Zhenguo Liu","doi":"10.1021/acs.jcim.4c01070","DOIUrl":null,"url":null,"abstract":"<p><p>Existing matrix factorization methods face challenges, including the cold start problem and global nonlinear data loss during similarity learning, particularly in predicting associations between long noncoding RNAs (LncRNAs) and diseases. To overcome these issues, we introduce HPTRMF, a matrix factorization approach incorporating high-order perturbation and flexible trifactor regularization. HPTRMF constructs a high-order correlation matrix utilizing the known association matrix, leveraging high-order perturbation to effectively address the cold start problem caused by data sparsity. Additionally, HPTRMF incorporates a flexible trifactor regularization term to capture similarity information on LncRNAs and diseases, enabling the effective handling of global nonlinear data loss by capturing such data in the similarity matrix. Experimental results demonstrate the superiority of HPTRMF over nine state-of-the-art algorithms in Leave-One-Out Cross-Validation (LOOCV) and Five-Fold Cross-Validation (5-Fold CV) on three data sets.HPTRMF and data sets are available in https://github.com/Llvvvv/HPTRMF.</p>","PeriodicalId":44,"journal":{"name":"Journal of Chemical Information and Modeling ","volume":" ","pages":"9594-9608"},"PeriodicalIF":5.6000,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"HPTRMF: Collaborative Matrix Factorization-Based Prediction Method for LncRNA-Disease Associations Using High-Order Perturbation and Flexible Trifactor Regularization.\",\"authors\":\"Guobo Xie, Dayin Li, Zhiyi Lin, Guosheng Gu, Weijun Li, Ruibin Chen, Zhenguo Liu\",\"doi\":\"10.1021/acs.jcim.4c01070\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Existing matrix factorization methods face challenges, including the cold start problem and global nonlinear data loss during similarity learning, particularly in predicting associations between long noncoding RNAs (LncRNAs) and diseases. To overcome these issues, we introduce HPTRMF, a matrix factorization approach incorporating high-order perturbation and flexible trifactor regularization. HPTRMF constructs a high-order correlation matrix utilizing the known association matrix, leveraging high-order perturbation to effectively address the cold start problem caused by data sparsity. Additionally, HPTRMF incorporates a flexible trifactor regularization term to capture similarity information on LncRNAs and diseases, enabling the effective handling of global nonlinear data loss by capturing such data in the similarity matrix. Experimental results demonstrate the superiority of HPTRMF over nine state-of-the-art algorithms in Leave-One-Out Cross-Validation (LOOCV) and Five-Fold Cross-Validation (5-Fold CV) on three data sets.HPTRMF and data sets are available in https://github.com/Llvvvv/HPTRMF.</p>\",\"PeriodicalId\":44,\"journal\":{\"name\":\"Journal of Chemical Information and Modeling \",\"volume\":\" \",\"pages\":\"9594-9608\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2024-12-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemical Information and Modeling \",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.jcim.4c01070\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/7/26 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemical Information and Modeling ","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.jcim.4c01070","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/26 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
HPTRMF: Collaborative Matrix Factorization-Based Prediction Method for LncRNA-Disease Associations Using High-Order Perturbation and Flexible Trifactor Regularization.
Existing matrix factorization methods face challenges, including the cold start problem and global nonlinear data loss during similarity learning, particularly in predicting associations between long noncoding RNAs (LncRNAs) and diseases. To overcome these issues, we introduce HPTRMF, a matrix factorization approach incorporating high-order perturbation and flexible trifactor regularization. HPTRMF constructs a high-order correlation matrix utilizing the known association matrix, leveraging high-order perturbation to effectively address the cold start problem caused by data sparsity. Additionally, HPTRMF incorporates a flexible trifactor regularization term to capture similarity information on LncRNAs and diseases, enabling the effective handling of global nonlinear data loss by capturing such data in the similarity matrix. Experimental results demonstrate the superiority of HPTRMF over nine state-of-the-art algorithms in Leave-One-Out Cross-Validation (LOOCV) and Five-Fold Cross-Validation (5-Fold CV) on three data sets.HPTRMF and data sets are available in https://github.com/Llvvvv/HPTRMF.
期刊介绍:
The Journal of Chemical Information and Modeling publishes papers reporting new methodology and/or important applications in the fields of chemical informatics and molecular modeling. Specific topics include the representation and computer-based searching of chemical databases, molecular modeling, computer-aided molecular design of new materials, catalysts, or ligands, development of new computational methods or efficient algorithms for chemical software, and biopharmaceutical chemistry including analyses of biological activity and other issues related to drug discovery.
Astute chemists, computer scientists, and information specialists look to this monthly’s insightful research studies, programming innovations, and software reviews to keep current with advances in this integral, multidisciplinary field.
As a subscriber you’ll stay abreast of database search systems, use of graph theory in chemical problems, substructure search systems, pattern recognition and clustering, analysis of chemical and physical data, molecular modeling, graphics and natural language interfaces, bibliometric and citation analysis, and synthesis design and reactions databases.