用于光谱分析的多变量播种算法,一种增强分析性能的数据增强方法

IF 4.3 2区 化学 Q1 SPECTROSCOPY
M.E. Keating, H.J. Byrne
{"title":"用于光谱分析的多变量播种算法,一种增强分析性能的数据增强方法","authors":"M.E. Keating,&nbsp;H.J. Byrne","doi":"10.1016/j.saa.2025.126369","DOIUrl":null,"url":null,"abstract":"<div><div>Seeding spectral datasets by augmenting the data matrix with either the full spectrum or selected spectral features in order to bias multivariate analysis towards the solution of interest is explored. It is demonstrated that such seeding can have a profound effect on the endpoint of the analysis. Using Raman spectroscopic data of human lung adenocarcinoma cells (A549) in vitro, systematic perturbations to the spectra are introduced to simulate dose dependent exposure to a drug (cisplatin), and/or cellular response, representing reduced viability. Taking Principal Components Analysis (PCA) as the first example, seeding with the known spectral profiles of the drug exposure is demonstrated to greatly increase the ability of the algorithm to differentiate two distinct data subsets, representing control and exposed. The improved differentiation is quantified by further Linear Discriminant Analysis of the PCA data. Other examples of where seeding may be applied include, simulated datasets consisting of simultaneous changes in the spectral markers of exposure dose and cellular response, which are used for Multivariate Curve Resolution – Alternating Least Squares analysis (MCR-ALS). In the example presented, adding pure components to the dataset improves the ability of the algorithm to both model the systematic variation of concentration dependent data and extract the component spectra more accurately than the unseeded dataset. The seeded approach thus provides improved performance for differential analysis of datasets, as well as spectral unmixing analyses, to monitor, for example, the kinetic evolution of a reaction mixture, or metabolic pathway.</div></div>","PeriodicalId":433,"journal":{"name":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","volume":"340 ","pages":"Article 126369"},"PeriodicalIF":4.3000,"publicationDate":"2025-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Seeding multivariate algorithms for spectral analysis, a data augmentation approach to enhance analytical performance\",\"authors\":\"M.E. Keating,&nbsp;H.J. Byrne\",\"doi\":\"10.1016/j.saa.2025.126369\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Seeding spectral datasets by augmenting the data matrix with either the full spectrum or selected spectral features in order to bias multivariate analysis towards the solution of interest is explored. It is demonstrated that such seeding can have a profound effect on the endpoint of the analysis. Using Raman spectroscopic data of human lung adenocarcinoma cells (A549) in vitro, systematic perturbations to the spectra are introduced to simulate dose dependent exposure to a drug (cisplatin), and/or cellular response, representing reduced viability. Taking Principal Components Analysis (PCA) as the first example, seeding with the known spectral profiles of the drug exposure is demonstrated to greatly increase the ability of the algorithm to differentiate two distinct data subsets, representing control and exposed. The improved differentiation is quantified by further Linear Discriminant Analysis of the PCA data. Other examples of where seeding may be applied include, simulated datasets consisting of simultaneous changes in the spectral markers of exposure dose and cellular response, which are used for Multivariate Curve Resolution – Alternating Least Squares analysis (MCR-ALS). In the example presented, adding pure components to the dataset improves the ability of the algorithm to both model the systematic variation of concentration dependent data and extract the component spectra more accurately than the unseeded dataset. The seeded approach thus provides improved performance for differential analysis of datasets, as well as spectral unmixing analyses, to monitor, for example, the kinetic evolution of a reaction mixture, or metabolic pathway.</div></div>\",\"PeriodicalId\":433,\"journal\":{\"name\":\"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy\",\"volume\":\"340 \",\"pages\":\"Article 126369\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-05-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1386142525006754\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386142525006754","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0

摘要

通过用全光谱或选定的光谱特征增加数据矩阵来播种光谱数据集,以便将多变量分析偏向于感兴趣的解。结果表明,这样的播种可以对分析的终点产生深远的影响。利用体外人肺腺癌细胞(A549)的拉曼光谱数据,引入光谱的系统扰动来模拟药物(顺铂)剂量依赖性暴露和/或细胞反应,代表降低的活力。以主成分分析(PCA)为例,利用已知的药物暴露谱图进行播种,大大提高了算法区分两个不同的数据子集(代表对照和暴露)的能力。通过进一步对主成分分析数据进行线性判别分析,对改进的微分进行量化。其他可以应用播种的例子包括,由暴露剂量和细胞反应的光谱标记同时变化组成的模拟数据集,用于多变量曲线分辨率-交替最小二乘分析(MCR-ALS)。在示例中,将纯成分添加到数据集中可以提高算法对浓度相关数据的系统变化建模的能力,并且比未播种的数据集更准确地提取成分光谱。因此,种子方法为数据集的差异分析以及光谱分解分析提供了改进的性能,以监测例如反应混合物的动力学演变或代谢途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Seeding multivariate algorithms for spectral analysis, a data augmentation approach to enhance analytical performance
Seeding spectral datasets by augmenting the data matrix with either the full spectrum or selected spectral features in order to bias multivariate analysis towards the solution of interest is explored. It is demonstrated that such seeding can have a profound effect on the endpoint of the analysis. Using Raman spectroscopic data of human lung adenocarcinoma cells (A549) in vitro, systematic perturbations to the spectra are introduced to simulate dose dependent exposure to a drug (cisplatin), and/or cellular response, representing reduced viability. Taking Principal Components Analysis (PCA) as the first example, seeding with the known spectral profiles of the drug exposure is demonstrated to greatly increase the ability of the algorithm to differentiate two distinct data subsets, representing control and exposed. The improved differentiation is quantified by further Linear Discriminant Analysis of the PCA data. Other examples of where seeding may be applied include, simulated datasets consisting of simultaneous changes in the spectral markers of exposure dose and cellular response, which are used for Multivariate Curve Resolution – Alternating Least Squares analysis (MCR-ALS). In the example presented, adding pure components to the dataset improves the ability of the algorithm to both model the systematic variation of concentration dependent data and extract the component spectra more accurately than the unseeded dataset. The seeded approach thus provides improved performance for differential analysis of datasets, as well as spectral unmixing analyses, to monitor, for example, the kinetic evolution of a reaction mixture, or metabolic pathway.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.40
自引率
11.40%
发文量
1364
审稿时长
40 days
期刊介绍: Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy (SAA) is an interdisciplinary journal which spans from basic to applied aspects of optical spectroscopy in chemistry, medicine, biology, and materials science. The journal publishes original scientific papers that feature high-quality spectroscopic data and analysis. From the broad range of optical spectroscopies, the emphasis is on electronic, vibrational or rotational spectra of molecules, rather than on spectroscopy based on magnetic moments. Criteria for publication in SAA are novelty, uniqueness, and outstanding quality. Routine applications of spectroscopic techniques and computational methods are not appropriate. Topics of particular interest of Spectrochimica Acta Part A include, but are not limited to: Spectroscopy and dynamics of bioanalytical, biomedical, environmental, and atmospheric sciences, Novel experimental techniques or instrumentation for molecular spectroscopy, Novel theoretical and computational methods, Novel applications in photochemistry and photobiology, Novel interpretational approaches as well as advances in data analysis based on electronic or vibrational spectroscopy.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信