确保多模态单单元数据的对角集成,避免歧义映射。

Han Zhou, Kai Cao, Yang Young Lu
{"title":"确保多模态单单元数据的对角集成,避免歧义映射。","authors":"Han Zhou, Kai Cao, Yang Young Lu","doi":"10.1093/bioinformatics/btaf345","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Recent advances in single-cell multimodal omics technologies enable the exploration of cellular systems at unprecedented resolution, leading to the rapid generation of multimodal datasets that require sophisticated integration methods. Diagonal integration has emerged as a flexible solution for integrating heterogeneous single-cell data without relying on shared cells or features. However, the absence of anchoring elements introduces the risk of artificial integrations, where cells across modalities are incorrectly aligned due to ambiguous mapping.</p><p><strong>Results: </strong>To address this challenge, we propose SONATA, a novel diagnostic method designed to detect potential artificial integrations resulting from ambiguous mappings in diagonal data integration. SONATA identifies ambiguous alignments by quantifying cell-cell ambiguity within the data manifold, ensuring that biologically meaningful integrations are distinguished from spurious ones. It is worth noting that SONATA is not designed to replace any existing pipelines for diagonal data integration; instead, SONATA works simply as an add-on to an existing pipeline for achieving more reliable integration. Through a comprehensive evaluation on both simulated and real multimodal single-cell datasets, we observe that artificial integrations in diagonal data integration are widespread yet surprisingly overlooked, occurring across all mainstream diagonal integration methods. We demonstrate SONATA's ability to safeguard against misleading integrations and provide actionable insights into potential integration failures across mainstream methods. Our approach offers a robust framework for ensuring the reliability and interpretability of multimodal single-cell data integration.</p><p><strong>Availability and implementation: </strong>The source code is available at (https://github.com/batmen-lab/SONATA).</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Securing diagonal integration of multimodal single-cell data against ambiguous mapping.\",\"authors\":\"Han Zhou, Kai Cao, Yang Young Lu\",\"doi\":\"10.1093/bioinformatics/btaf345\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Motivation: </strong>Recent advances in single-cell multimodal omics technologies enable the exploration of cellular systems at unprecedented resolution, leading to the rapid generation of multimodal datasets that require sophisticated integration methods. Diagonal integration has emerged as a flexible solution for integrating heterogeneous single-cell data without relying on shared cells or features. However, the absence of anchoring elements introduces the risk of artificial integrations, where cells across modalities are incorrectly aligned due to ambiguous mapping.</p><p><strong>Results: </strong>To address this challenge, we propose SONATA, a novel diagnostic method designed to detect potential artificial integrations resulting from ambiguous mappings in diagonal data integration. SONATA identifies ambiguous alignments by quantifying cell-cell ambiguity within the data manifold, ensuring that biologically meaningful integrations are distinguished from spurious ones. It is worth noting that SONATA is not designed to replace any existing pipelines for diagonal data integration; instead, SONATA works simply as an add-on to an existing pipeline for achieving more reliable integration. Through a comprehensive evaluation on both simulated and real multimodal single-cell datasets, we observe that artificial integrations in diagonal data integration are widespread yet surprisingly overlooked, occurring across all mainstream diagonal integration methods. We demonstrate SONATA's ability to safeguard against misleading integrations and provide actionable insights into potential integration failures across mainstream methods. Our approach offers a robust framework for ensuring the reliability and interpretability of multimodal single-cell data integration.</p><p><strong>Availability and implementation: </strong>The source code is available at (https://github.com/batmen-lab/SONATA).</p><p><strong>Supplementary information: </strong>Supplementary data are available at Bioinformatics.</p>\",\"PeriodicalId\":93899,\"journal\":{\"name\":\"Bioinformatics (Oxford, England)\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics (Oxford, England)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioinformatics/btaf345\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf345","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

动机:单细胞多模态组学技术的最新进展能够以前所未有的分辨率探索细胞系统,从而导致需要复杂集成方法的多模态数据集的快速生成。对角集成已经成为一种灵活的解决方案,可以在不依赖于共享单元或特征的情况下集成异构单细胞数据。然而,锚定元素的缺失带来了人工集成的风险,其中不同模式的细胞由于不明确的映射而不正确对齐。结果:为了解决这一挑战,我们提出了SONATA,这是一种新的诊断方法,旨在检测对角数据集成中由模糊映射导致的潜在人为集成。SONATA通过量化数据歧义中的细胞-细胞歧义来识别歧义排列,确保将生物学上有意义的整合与虚假的整合区分开来。值得注意的是,SONATA不是为了取代任何现有的对角数据集成管道而设计的;相反,SONATA只是作为现有管道的附加组件,以实现更可靠的集成。通过对模拟和真实多模态单细胞数据集的综合评估,我们观察到对角数据集成中的人工集成普遍存在,但令人惊讶的是,所有主流对角集成方法都存在人工集成。我们展示了SONATA防止误导集成的能力,并对跨主流方法的潜在集成失败提供了可操作的见解。我们的方法为确保多模态单细胞数据集成的可靠性和可解释性提供了一个强大的框架。可用性和实现:源代码可在(https://github.com/batmen-lab/SONATA).Supplementary information)上获得;补充数据可在Bioinformatics上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Securing diagonal integration of multimodal single-cell data against ambiguous mapping.

Motivation: Recent advances in single-cell multimodal omics technologies enable the exploration of cellular systems at unprecedented resolution, leading to the rapid generation of multimodal datasets that require sophisticated integration methods. Diagonal integration has emerged as a flexible solution for integrating heterogeneous single-cell data without relying on shared cells or features. However, the absence of anchoring elements introduces the risk of artificial integrations, where cells across modalities are incorrectly aligned due to ambiguous mapping.

Results: To address this challenge, we propose SONATA, a novel diagnostic method designed to detect potential artificial integrations resulting from ambiguous mappings in diagonal data integration. SONATA identifies ambiguous alignments by quantifying cell-cell ambiguity within the data manifold, ensuring that biologically meaningful integrations are distinguished from spurious ones. It is worth noting that SONATA is not designed to replace any existing pipelines for diagonal data integration; instead, SONATA works simply as an add-on to an existing pipeline for achieving more reliable integration. Through a comprehensive evaluation on both simulated and real multimodal single-cell datasets, we observe that artificial integrations in diagonal data integration are widespread yet surprisingly overlooked, occurring across all mainstream diagonal integration methods. We demonstrate SONATA's ability to safeguard against misleading integrations and provide actionable insights into potential integration failures across mainstream methods. Our approach offers a robust framework for ensuring the reliability and interpretability of multimodal single-cell data integration.

Availability and implementation: The source code is available at (https://github.com/batmen-lab/SONATA).

Supplementary information: Supplementary data are available at Bioinformatics.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信