Roger A R Moens,Lukasz G Migas,David M G Anderson,Jeffrey D Messinger,Olga S Ovchinnikova,Richard M Caprioli,Christine A Curcio,Kevin L Schey,Jeffrey M Spraggins,Raf Van de Plas
{"title":"Advanced Dimensionality Reduction for Imaging Mass Spectrometry of Human Eye Tissue through Low-Rank Modeling with Sparse and Dense Residuals.","authors":"Roger A R Moens,Lukasz G Migas,David M G Anderson,Jeffrey D Messinger,Olga S Ovchinnikova,Richard M Caprioli,Christine A Curcio,Kevin L Schey,Jeffrey M Spraggins,Raf Van de Plas","doi":"10.1021/acs.analchem.4c06368","DOIUrl":null,"url":null,"abstract":"Imaging mass spectrometry (IMS) yields high-dimensional and large data sets commonly exceeding 100,000 pixels, each reporting a mass spectrum of 200,000 intensity values or more. Reducing the dimensionality and size of IMS data is often necessary to enable downstream analysis, and matrix-factorization-based approaches are often used for this purpose. However, the model underlying most of these techniques, decomposing measurements into the sum of a low-rank term (presumed signal) and a small entry-wise residual term (presumed noise), is often not optimal for IMS. For example, while spatially or spectrally sparse signals are common in IMS data, they can heavily distort the low-rank approximation. Therefore, we propose capturing the IMS data structure using low-rank models that, in addition to a dense residual, allow for sparse variation to be captured separately. We implement two such methods, principal component pursuit (PCP) and stable principal component pursuit (SPCP), apply them to IMS data, and compare them to a classical factorization method, principal component analysis (PCA). We investigate their dimensionality and noise reduction performance on MALDI Q-TOF IMS measurements of human cornea and retina tissue since the human eye is a complex organ with lots of small, tightly packed tissue substructures that are spatially sparse. Our results suggest that if parameters are set adequately, PCP and SPCP enable stronger dimensionality reduction and higher compression of IMS data compared to PCA, while concurrently reducing signal overestimation.","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":"119 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.analchem.4c06368","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Imaging mass spectrometry (IMS) yields high-dimensional and large data sets commonly exceeding 100,000 pixels, each reporting a mass spectrum of 200,000 intensity values or more. Reducing the dimensionality and size of IMS data is often necessary to enable downstream analysis, and matrix-factorization-based approaches are often used for this purpose. However, the model underlying most of these techniques, decomposing measurements into the sum of a low-rank term (presumed signal) and a small entry-wise residual term (presumed noise), is often not optimal for IMS. For example, while spatially or spectrally sparse signals are common in IMS data, they can heavily distort the low-rank approximation. Therefore, we propose capturing the IMS data structure using low-rank models that, in addition to a dense residual, allow for sparse variation to be captured separately. We implement two such methods, principal component pursuit (PCP) and stable principal component pursuit (SPCP), apply them to IMS data, and compare them to a classical factorization method, principal component analysis (PCA). We investigate their dimensionality and noise reduction performance on MALDI Q-TOF IMS measurements of human cornea and retina tissue since the human eye is a complex organ with lots of small, tightly packed tissue substructures that are spatially sparse. Our results suggest that if parameters are set adequately, PCP and SPCP enable stronger dimensionality reduction and higher compression of IMS data compared to PCA, while concurrently reducing signal overestimation.
期刊介绍:
Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.