Xinxing Yang, Faming Zhao, Tao Ren, Canping Chen, Katelyn T Byrne, Alexey V Danilov, Rosalie C Sears, Peter S Nelson, Lisa M Coussens, Gordon B Mills, Zheng Xia
{"title":"OmicsTweezer:多组学数据的分布无关细胞反卷积模型。","authors":"Xinxing Yang, Faming Zhao, Tao Ren, Canping Chen, Katelyn T Byrne, Alexey V Danilov, Rosalie C Sears, Peter S Nelson, Lisa M Coussens, Gordon B Mills, Zheng Xia","doi":"10.1016/j.xgen.2025.100950","DOIUrl":null,"url":null,"abstract":"<p><p>Cell deconvolution estimates cell type proportions from bulk omics data, enabling insights into tissue microenvironments and disease. However, practical applications are often hindered by batch effects between bulk data and referenced single-cell data, a challenge that is frequently overlooked. To address this discrepancy, we developed OmicsTweezer, a distribution-independent cell deconvolution model. By integrating optimal transport with deep learning, OmicsTweezer aligns simulated and real data in a shared latent space, effectively mitigating data shifts and inter-omics distribution differences. OmicsTweezer is versatile, capable of deconvolving bulk RNA-seq, bulk proteomics, and spatial transcriptomics. Extensive evaluations on simulated and real-world datasets demonstrate its robustness and accuracy. Furthermore, applications in prostate and colon cancer showcase OmicsTweezer's ability to identify biologically meaningful cell types. As a unified deconvolution framework for multi-omics data, OmicsTweezer offers an efficient and powerful tool for studying disease microenvironments.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100950"},"PeriodicalIF":11.1000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"OmicsTweezer: A distribution-independent cell deconvolution model for multi-omics Data.\",\"authors\":\"Xinxing Yang, Faming Zhao, Tao Ren, Canping Chen, Katelyn T Byrne, Alexey V Danilov, Rosalie C Sears, Peter S Nelson, Lisa M Coussens, Gordon B Mills, Zheng Xia\",\"doi\":\"10.1016/j.xgen.2025.100950\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Cell deconvolution estimates cell type proportions from bulk omics data, enabling insights into tissue microenvironments and disease. However, practical applications are often hindered by batch effects between bulk data and referenced single-cell data, a challenge that is frequently overlooked. To address this discrepancy, we developed OmicsTweezer, a distribution-independent cell deconvolution model. By integrating optimal transport with deep learning, OmicsTweezer aligns simulated and real data in a shared latent space, effectively mitigating data shifts and inter-omics distribution differences. OmicsTweezer is versatile, capable of deconvolving bulk RNA-seq, bulk proteomics, and spatial transcriptomics. Extensive evaluations on simulated and real-world datasets demonstrate its robustness and accuracy. Furthermore, applications in prostate and colon cancer showcase OmicsTweezer's ability to identify biologically meaningful cell types. As a unified deconvolution framework for multi-omics data, OmicsTweezer offers an efficient and powerful tool for studying disease microenvironments.</p>\",\"PeriodicalId\":72539,\"journal\":{\"name\":\"Cell genomics\",\"volume\":\" \",\"pages\":\"100950\"},\"PeriodicalIF\":11.1000,\"publicationDate\":\"2025-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cell genomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.xgen.2025.100950\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/7/16 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CELL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xgen.2025.100950","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/16 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
OmicsTweezer: A distribution-independent cell deconvolution model for multi-omics Data.
Cell deconvolution estimates cell type proportions from bulk omics data, enabling insights into tissue microenvironments and disease. However, practical applications are often hindered by batch effects between bulk data and referenced single-cell data, a challenge that is frequently overlooked. To address this discrepancy, we developed OmicsTweezer, a distribution-independent cell deconvolution model. By integrating optimal transport with deep learning, OmicsTweezer aligns simulated and real data in a shared latent space, effectively mitigating data shifts and inter-omics distribution differences. OmicsTweezer is versatile, capable of deconvolving bulk RNA-seq, bulk proteomics, and spatial transcriptomics. Extensive evaluations on simulated and real-world datasets demonstrate its robustness and accuracy. Furthermore, applications in prostate and colon cancer showcase OmicsTweezer's ability to identify biologically meaningful cell types. As a unified deconvolution framework for multi-omics data, OmicsTweezer offers an efficient and powerful tool for studying disease microenvironments.