Deciphering the differences in aroma components of tobacco from different origins based on HS-GC-IMS and multivariate statistical analysis†

IF 2.6 3区 化学 Q2 CHEMISTRY, ANALYTICAL
Suxuan Li, Ningyang Mao, Cong Chen, Hui Zhao, Xiaoyu Chen, Liusheng Wang, Fuyun Cui, Wenning Feng and Zhiyong Wu
{"title":"Deciphering the differences in aroma components of tobacco from different origins based on HS-GC-IMS and multivariate statistical analysis†","authors":"Suxuan Li, Ningyang Mao, Cong Chen, Hui Zhao, Xiaoyu Chen, Liusheng Wang, Fuyun Cui, Wenning Feng and Zhiyong Wu","doi":"10.1039/D5AY00531K","DOIUrl":null,"url":null,"abstract":"<p >This study employed headspace gas chromatography-ion mobility spectrometry (HS-GC-IMS) technology combined with multivariate statistical analysis methods to analyze the flavor compounds in flue-cured tobacco from five different regions in China: Henan, Hunan, Yunnan, Chongqing, and Fujian. A total of 98 volatile aroma compounds were identified through HS-GC-IMS analysis, including esters, ketones, aldehydes, acids, alcohols, heterocyclic compounds, sulfur-containing compounds, other types of compounds, and 8 uncharacterized compounds. Principal Component Analysis (PCA) and Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA) were utilized to conduct dimensionality reduction and distinguish the samples, effectively recognizing differences in volatile compounds among tobacco leaves from various origins. A Random Forest (RF) classification model was constructed, and its reliability was validated through ROC (Receiver Operating Characteristic) analysis, achieving an AUC (Area Under the Curve) value of 0.980, which demonstrates exceptional predictive performance. PCA revealed distinct separations of tobacco leaf samples from different regions on the PCA score plot, and OPLS-DA analysis further validated these differences and confirmed the model's validity through permutation testing. Twenty key aroma compounds with VIP &gt; 1.0 were screened by integrating OPLS-DA with the Random Forest classification model. These compounds showed significant differences in content among different samples, suggesting their potential as chemical markers for distinguishing the origin of flue-cured tobacco. This study not only provides a new method for identifying volatile compounds in tobacco but also offers novel insights into the geographical identification of flue-cured tobacco.</p>","PeriodicalId":64,"journal":{"name":"Analytical Methods","volume":" 27","pages":" 5736-5748"},"PeriodicalIF":2.6000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Methods","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/ay/d5ay00531k","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

This study employed headspace gas chromatography-ion mobility spectrometry (HS-GC-IMS) technology combined with multivariate statistical analysis methods to analyze the flavor compounds in flue-cured tobacco from five different regions in China: Henan, Hunan, Yunnan, Chongqing, and Fujian. A total of 98 volatile aroma compounds were identified through HS-GC-IMS analysis, including esters, ketones, aldehydes, acids, alcohols, heterocyclic compounds, sulfur-containing compounds, other types of compounds, and 8 uncharacterized compounds. Principal Component Analysis (PCA) and Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA) were utilized to conduct dimensionality reduction and distinguish the samples, effectively recognizing differences in volatile compounds among tobacco leaves from various origins. A Random Forest (RF) classification model was constructed, and its reliability was validated through ROC (Receiver Operating Characteristic) analysis, achieving an AUC (Area Under the Curve) value of 0.980, which demonstrates exceptional predictive performance. PCA revealed distinct separations of tobacco leaf samples from different regions on the PCA score plot, and OPLS-DA analysis further validated these differences and confirmed the model's validity through permutation testing. Twenty key aroma compounds with VIP > 1.0 were screened by integrating OPLS-DA with the Random Forest classification model. These compounds showed significant differences in content among different samples, suggesting their potential as chemical markers for distinguishing the origin of flue-cured tobacco. This study not only provides a new method for identifying volatile compounds in tobacco but also offers novel insights into the geographical identification of flue-cured tobacco.

Abstract Image

基于HS-GC-IMS和多元统计分析的不同产地烟草香气成分差异分析
本研究采用顶空气相色谱-离子迁移谱(HS-GC-IMS)技术结合多元统计分析方法,对中国河南、湖南、云南、重庆、福建5个地区的烤烟进行风味成分分析。通过HS-GC-IMS分析共鉴定出98种挥发性香气化合物,包括酯类、酮类、醛类、酸类、醇类、杂环类、含硫类及其他类型化合物,以及8种未鉴定的化合物。利用主成分分析(PCA)和正交偏最小二乘判别分析(OPLS-DA)进行降维和区分,有效识别不同产地烟叶挥发物的差异。构建随机森林(Random Forest, RF)分类模型,通过ROC (Receiver Operating Characteristic)分析验证其可靠性,曲线下面积(AUC)值为0.980,具有较好的预测性能。PCA分析显示,不同地区烟叶样本在PCA评分图上存在明显的分离,OPLS-DA分析进一步验证了这些差异,并通过置换检验证实了模型的有效性。将OPLS-DA与随机森林分类模型相结合,筛选出VIP > 1.0的20个关键香气化合物。这些化合物在不同样品中的含量存在显著差异,表明它们有可能作为区分烤烟产地的化学标记物。本研究不仅为鉴定烟草中挥发性化合物提供了一种新的方法,而且为烤烟的地理鉴定提供了新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Analytical Methods
Analytical Methods CHEMISTRY, ANALYTICAL-FOOD SCIENCE & TECHNOLOGY
CiteScore
5.10
自引率
3.20%
发文量
569
审稿时长
1.8 months
期刊介绍: Early applied demonstrations of new analytical methods with clear societal impact
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信