Quantifying the impact of sample, instrument, and data processing on biological signatures in modern and fossil tissues detected with Raman spectroscopy

IF 2.4 3区 化学 Q2 SPECTROSCOPY
Jasmina Wiemann, Philipp R. Heck
{"title":"Quantifying the impact of sample, instrument, and data processing on biological signatures in modern and fossil tissues detected with Raman spectroscopy","authors":"Jasmina Wiemann,&nbsp;Philipp R. Heck","doi":"10.1002/jrs.6669","DOIUrl":null,"url":null,"abstract":"<p>Raman spectroscopy is a popular tool for characterizing complex biological materials and their geological remains. Ordination methods, such as principal component analysis (PCA), use spectral variance to create a compositional space, the ChemoSpace, grouping samples based on spectroscopic manifestations reflecting different biological properties or geological processes. PCA allows to reduce the dimensionality of complex spectroscopic data and facilitates the extraction of informative features into formats suitable for downstream statistical analyses, thus representing a first step in the development of diagnostic biosignatures from complex modern and fossil tissues. For such samples, however, there is presently no systematic and accessible survey of the impact of sample, instrument, and spectral processing on the occupation of the ChemoSpace. Here, the influence of sample count, unwanted signals and different signal-to-noise ratios, spectrometer decalibration, baseline subtraction, and spectral normalization on ChemoSpace grouping is investigated and exemplified using synthetic spectra. Increase in sample size improves the dissociation of groups in the ChemoSpace, and our sample yields a representative and mostly stable pattern in occupation with less than 10 samples per group. The impact of systemic interference of different amplitude and frequency, periodical or random features that can be introduced by instrument or sample, on compositional biological signatures is reduced by PCA and allows to extract biological information even when spectra of differing signal-to-noise ratios are compared. Routine offsets (\n<span></span><math>\n <mo>±</mo></math>1 cm<sup>−1</sup>) in spectrometer calibration contribute in our sample to less than 0.1% of the total spectral variance captured in the ChemoSpace and do not obscure biological information. Standard adaptive baselining, together with normalization, increases spectral comparability and facilitates the extraction of informative features. The ChemoSpace approach to biosignatures represents a powerful tool for exploring, denoising, and integrating molecular information from modern and ancient organismal samples.</p>","PeriodicalId":16926,"journal":{"name":"Journal of Raman Spectroscopy","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/jrs.6669","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Raman Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/jrs.6669","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0

Abstract

Raman spectroscopy is a popular tool for characterizing complex biological materials and their geological remains. Ordination methods, such as principal component analysis (PCA), use spectral variance to create a compositional space, the ChemoSpace, grouping samples based on spectroscopic manifestations reflecting different biological properties or geological processes. PCA allows to reduce the dimensionality of complex spectroscopic data and facilitates the extraction of informative features into formats suitable for downstream statistical analyses, thus representing a first step in the development of diagnostic biosignatures from complex modern and fossil tissues. For such samples, however, there is presently no systematic and accessible survey of the impact of sample, instrument, and spectral processing on the occupation of the ChemoSpace. Here, the influence of sample count, unwanted signals and different signal-to-noise ratios, spectrometer decalibration, baseline subtraction, and spectral normalization on ChemoSpace grouping is investigated and exemplified using synthetic spectra. Increase in sample size improves the dissociation of groups in the ChemoSpace, and our sample yields a representative and mostly stable pattern in occupation with less than 10 samples per group. The impact of systemic interference of different amplitude and frequency, periodical or random features that can be introduced by instrument or sample, on compositional biological signatures is reduced by PCA and allows to extract biological information even when spectra of differing signal-to-noise ratios are compared. Routine offsets ( ±1 cm−1) in spectrometer calibration contribute in our sample to less than 0.1% of the total spectral variance captured in the ChemoSpace and do not obscure biological information. Standard adaptive baselining, together with normalization, increases spectral comparability and facilitates the extraction of informative features. The ChemoSpace approach to biosignatures represents a powerful tool for exploring, denoising, and integrating molecular information from modern and ancient organismal samples.

Abstract Image

Abstract Image

量化样品、仪器和数据处理对拉曼光谱检测到的现代和化石组织中生物特征的影响
拉曼光谱是表征复杂生物材料及其地质遗迹的常用工具。主成分分析(PCA)等排序方法利用光谱方差创建一个成分空间,即化学空间(ChemoSpace),根据反映不同生物特性或地质过程的光谱表现对样本进行分组。PCA 可以降低复杂光谱数据的维度,便于将信息特征提取为适合下游统计分析的格式,从而为开发复杂的现代组织和化石组织的诊断生物特征迈出了第一步。然而,对于此类样本,目前还没有关于样本、仪器和光谱处理对化学空间占用的影响的系统性调查。在此,我们利用合成光谱研究并举例说明了样本数量、无用信号和不同信噪比、光谱仪去校准、基线减法和光谱归一化对 ChemoSpace 分组的影响。样本量的增加改善了 ChemoSpace 中的分组解离,我们的样本在每组样本少于 10 个的情况下产生了具有代表性且基本稳定的分组模式。不同振幅和频率的系统干扰、仪器或样本可能引入的周期或随机特征对生物组成特征的影响通过 PCA 得到了降低,即使在比较不同信噪比的光谱时也能提取生物信息。在我们的样本中,光谱仪校准中的常规偏移(±$$\pm $$1 cm-1)只占化学空间捕获的总光谱差异的不到 0.1%,不会掩盖生物信息。标准自适应基线法与归一化相结合,提高了光谱的可比性,有利于提取信息特征。化学空间生物特征方法是探索、去噪和整合现代与古代生物样本分子信息的有力工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.40
自引率
8.00%
发文量
185
审稿时长
3.0 months
期刊介绍: The Journal of Raman Spectroscopy is an international journal dedicated to the publication of original research at the cutting edge of all areas of science and technology related to Raman spectroscopy. The journal seeks to be the central forum for documenting the evolution of the broadly-defined field of Raman spectroscopy that includes an increasing number of rapidly developing techniques and an ever-widening array of interdisciplinary applications. Such topics include time-resolved, coherent and non-linear Raman spectroscopies, nanostructure-based surface-enhanced and tip-enhanced Raman spectroscopies of molecules, resonance Raman to investigate the structure-function relationships and dynamics of biological molecules, linear and nonlinear Raman imaging and microscopy, biomedical applications of Raman, theoretical formalism and advances in quantum computational methodology of all forms of Raman scattering, Raman spectroscopy in archaeology and art, advances in remote Raman sensing and industrial applications, and Raman optical activity of all classes of chiral molecules.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信