Xinxing Yang, Faming Zhao, Tao Ren, Canping Chen, Katelyn T Byrne, Alexey V Danilov, Rosalie C Sears, Peter S Nelson, Lisa M Coussens, Gordon B Mills, Zheng Xia
{"title":"OmicsTweezer: A distribution-independent cell deconvolution model for multi-omics Data.","authors":"Xinxing Yang, Faming Zhao, Tao Ren, Canping Chen, Katelyn T Byrne, Alexey V Danilov, Rosalie C Sears, Peter S Nelson, Lisa M Coussens, Gordon B Mills, Zheng Xia","doi":"10.1016/j.xgen.2025.100950","DOIUrl":null,"url":null,"abstract":"<p><p>Cell deconvolution estimates cell type proportions from bulk omics data, enabling insights into tissue microenvironments and disease. However, practical applications are often hindered by batch effects between bulk data and referenced single-cell data, a challenge that is frequently overlooked. To address this discrepancy, we developed OmicsTweezer, a distribution-independent cell deconvolution model. By integrating optimal transport with deep learning, OmicsTweezer aligns simulated and real data in a shared latent space, effectively mitigating data shifts and inter-omics distribution differences. OmicsTweezer is versatile, capable of deconvolving bulk RNA-seq, bulk proteomics, and spatial transcriptomics. Extensive evaluations on simulated and real-world datasets demonstrate its robustness and accuracy. Furthermore, applications in prostate and colon cancer showcase OmicsTweezer's ability to identify biologically meaningful cell types. As a unified deconvolution framework for multi-omics data, OmicsTweezer offers an efficient and powerful tool for studying disease microenvironments.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100950"},"PeriodicalIF":11.1000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xgen.2025.100950","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Cell deconvolution estimates cell type proportions from bulk omics data, enabling insights into tissue microenvironments and disease. However, practical applications are often hindered by batch effects between bulk data and referenced single-cell data, a challenge that is frequently overlooked. To address this discrepancy, we developed OmicsTweezer, a distribution-independent cell deconvolution model. By integrating optimal transport with deep learning, OmicsTweezer aligns simulated and real data in a shared latent space, effectively mitigating data shifts and inter-omics distribution differences. OmicsTweezer is versatile, capable of deconvolving bulk RNA-seq, bulk proteomics, and spatial transcriptomics. Extensive evaluations on simulated and real-world datasets demonstrate its robustness and accuracy. Furthermore, applications in prostate and colon cancer showcase OmicsTweezer's ability to identify biologically meaningful cell types. As a unified deconvolution framework for multi-omics data, OmicsTweezer offers an efficient and powerful tool for studying disease microenvironments.