基于稀疏和密集残差低秩建模的人眼组织成像质谱高级降维方法。

IF 6.7 1区 化学 Q1 CHEMISTRY, ANALYTICAL
Roger A R Moens,Lukasz G Migas,David M G Anderson,Jeffrey D Messinger,Olga S Ovchinnikova,Richard M Caprioli,Christine A Curcio,Kevin L Schey,Jeffrey M Spraggins,Raf Van de Plas
{"title":"基于稀疏和密集残差低秩建模的人眼组织成像质谱高级降维方法。","authors":"Roger A R Moens,Lukasz G Migas,David M G Anderson,Jeffrey D Messinger,Olga S Ovchinnikova,Richard M Caprioli,Christine A Curcio,Kevin L Schey,Jeffrey M Spraggins,Raf Van de Plas","doi":"10.1021/acs.analchem.4c06368","DOIUrl":null,"url":null,"abstract":"Imaging mass spectrometry (IMS) yields high-dimensional and large data sets commonly exceeding 100,000 pixels, each reporting a mass spectrum of 200,000 intensity values or more. Reducing the dimensionality and size of IMS data is often necessary to enable downstream analysis, and matrix-factorization-based approaches are often used for this purpose. However, the model underlying most of these techniques, decomposing measurements into the sum of a low-rank term (presumed signal) and a small entry-wise residual term (presumed noise), is often not optimal for IMS. For example, while spatially or spectrally sparse signals are common in IMS data, they can heavily distort the low-rank approximation. Therefore, we propose capturing the IMS data structure using low-rank models that, in addition to a dense residual, allow for sparse variation to be captured separately. We implement two such methods, principal component pursuit (PCP) and stable principal component pursuit (SPCP), apply them to IMS data, and compare them to a classical factorization method, principal component analysis (PCA). We investigate their dimensionality and noise reduction performance on MALDI Q-TOF IMS measurements of human cornea and retina tissue since the human eye is a complex organ with lots of small, tightly packed tissue substructures that are spatially sparse. Our results suggest that if parameters are set adequately, PCP and SPCP enable stronger dimensionality reduction and higher compression of IMS data compared to PCA, while concurrently reducing signal overestimation.","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":"119 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Advanced Dimensionality Reduction for Imaging Mass Spectrometry of Human Eye Tissue through Low-Rank Modeling with Sparse and Dense Residuals.\",\"authors\":\"Roger A R Moens,Lukasz G Migas,David M G Anderson,Jeffrey D Messinger,Olga S Ovchinnikova,Richard M Caprioli,Christine A Curcio,Kevin L Schey,Jeffrey M Spraggins,Raf Van de Plas\",\"doi\":\"10.1021/acs.analchem.4c06368\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Imaging mass spectrometry (IMS) yields high-dimensional and large data sets commonly exceeding 100,000 pixels, each reporting a mass spectrum of 200,000 intensity values or more. Reducing the dimensionality and size of IMS data is often necessary to enable downstream analysis, and matrix-factorization-based approaches are often used for this purpose. However, the model underlying most of these techniques, decomposing measurements into the sum of a low-rank term (presumed signal) and a small entry-wise residual term (presumed noise), is often not optimal for IMS. For example, while spatially or spectrally sparse signals are common in IMS data, they can heavily distort the low-rank approximation. Therefore, we propose capturing the IMS data structure using low-rank models that, in addition to a dense residual, allow for sparse variation to be captured separately. We implement two such methods, principal component pursuit (PCP) and stable principal component pursuit (SPCP), apply them to IMS data, and compare them to a classical factorization method, principal component analysis (PCA). We investigate their dimensionality and noise reduction performance on MALDI Q-TOF IMS measurements of human cornea and retina tissue since the human eye is a complex organ with lots of small, tightly packed tissue substructures that are spatially sparse. Our results suggest that if parameters are set adequately, PCP and SPCP enable stronger dimensionality reduction and higher compression of IMS data compared to PCA, while concurrently reducing signal overestimation.\",\"PeriodicalId\":27,\"journal\":{\"name\":\"Analytical Chemistry\",\"volume\":\"119 1\",\"pages\":\"\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytical Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.analchem.4c06368\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.analchem.4c06368","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

成像质谱(IMS)产生的高维和大型数据集通常超过100,000像素,每个数据集报告的质谱强度值为200,000或更多。减少IMS数据的维数和大小通常是支持下游分析的必要条件,基于矩阵分解的方法通常用于此目的。然而,大多数这些技术的基础模型,将测量分解为低秩项(假定的信号)和小的入口残余项(假定的噪声)的总和,通常不是IMS的最佳选择。例如,虽然空间或频谱稀疏信号在IMS数据中很常见,但它们会严重扭曲低秩近似。因此,我们建议使用低秩模型捕获IMS数据结构,除了密集残差之外,还允许单独捕获稀疏变化。我们实现了两种这样的方法:主成分追踪(PCP)和稳定主成分追踪(SPCP),并将它们应用于IMS数据,并与经典的因式分解方法主成分分析(PCA)进行了比较。由于人眼是一个复杂的器官,具有许多空间稀疏的小而紧密排列的组织亚结构,因此我们研究了它们在人类角膜和视网膜组织的MALDI Q-TOF IMS测量中的维数和降噪性能。我们的研究结果表明,如果参数设置适当,与PCA相比,PCP和SPCP能够更强的降维和更高的压缩IMS数据,同时减少信号高估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Advanced Dimensionality Reduction for Imaging Mass Spectrometry of Human Eye Tissue through Low-Rank Modeling with Sparse and Dense Residuals.
Imaging mass spectrometry (IMS) yields high-dimensional and large data sets commonly exceeding 100,000 pixels, each reporting a mass spectrum of 200,000 intensity values or more. Reducing the dimensionality and size of IMS data is often necessary to enable downstream analysis, and matrix-factorization-based approaches are often used for this purpose. However, the model underlying most of these techniques, decomposing measurements into the sum of a low-rank term (presumed signal) and a small entry-wise residual term (presumed noise), is often not optimal for IMS. For example, while spatially or spectrally sparse signals are common in IMS data, they can heavily distort the low-rank approximation. Therefore, we propose capturing the IMS data structure using low-rank models that, in addition to a dense residual, allow for sparse variation to be captured separately. We implement two such methods, principal component pursuit (PCP) and stable principal component pursuit (SPCP), apply them to IMS data, and compare them to a classical factorization method, principal component analysis (PCA). We investigate their dimensionality and noise reduction performance on MALDI Q-TOF IMS measurements of human cornea and retina tissue since the human eye is a complex organ with lots of small, tightly packed tissue substructures that are spatially sparse. Our results suggest that if parameters are set adequately, PCP and SPCP enable stronger dimensionality reduction and higher compression of IMS data compared to PCA, while concurrently reducing signal overestimation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Analytical Chemistry
Analytical Chemistry 化学-分析化学
CiteScore
12.10
自引率
12.20%
发文量
1949
审稿时长
1.4 months
期刊介绍: Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信