Guidelines to build PLS-DA chemometric classification models using a GC-IMS method: Dry-cured ham as a case of study

IF 4.1 Q1 CHEMISTRY, ANALYTICAL
Andrés Martín-Gómez , Pablo Rodríguez-Hernández , María José Cardador , Belén Vega-Márquez , Vicente Rodríguez-Estévez , Lourdes Arce
{"title":"Guidelines to build PLS-DA chemometric classification models using a GC-IMS method: Dry-cured ham as a case of study","authors":"Andrés Martín-Gómez ,&nbsp;Pablo Rodríguez-Hernández ,&nbsp;María José Cardador ,&nbsp;Belén Vega-Márquez ,&nbsp;Vicente Rodríguez-Estévez ,&nbsp;Lourdes Arce","doi":"10.1016/j.talo.2022.100175","DOIUrl":null,"url":null,"abstract":"<div><p>The number of representative samples to build a calibration model plays a major role in the success of chemometric models for class discrimination; therefore, knowing which samples should be used for the calibration of prediction models is essential. The aim of this work is to design a basic guideline for the training of partial least squares discriminant analysis (PLS-DA) models to classify complex samples analysed by Gas Chromatography (GC) coupled to Ion Mobility Spectrometry (IMS) using dry-cured Iberian ham as an example. The effect of the number, proportion and class of samples for training and validation and the use of two data types (spectral fingerprint or pre-selected markers) has been assessed by analysing with GC-IMS nearly 1000 dry-cured Iberian ham samples obtained from 7 different curing plants. Subsequently, these were classified with PLS-DA according to the pig's feeding regime (acorn-fed vs. feed-fed) and it has been demonstrated that 450 out of 997 samples are enough for model training to achieve a maximum average prediction accuracy rate. Furthermore, the use of pre-selected GC-IMS markers provides slightly better prediction results than the use of the complete spectral fingerprint. In summary, these results represent a tentative guide for the classification of samples in an industrial setting using GC-IMS and PLS-DA. This methodology would allow authorities and producers to ensure the quality of the agri-food products put on the market as is proven in this study.</p></div>","PeriodicalId":436,"journal":{"name":"Talanta Open","volume":"7 ","pages":"Article 100175"},"PeriodicalIF":4.1000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Talanta Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666831922000923","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The number of representative samples to build a calibration model plays a major role in the success of chemometric models for class discrimination; therefore, knowing which samples should be used for the calibration of prediction models is essential. The aim of this work is to design a basic guideline for the training of partial least squares discriminant analysis (PLS-DA) models to classify complex samples analysed by Gas Chromatography (GC) coupled to Ion Mobility Spectrometry (IMS) using dry-cured Iberian ham as an example. The effect of the number, proportion and class of samples for training and validation and the use of two data types (spectral fingerprint or pre-selected markers) has been assessed by analysing with GC-IMS nearly 1000 dry-cured Iberian ham samples obtained from 7 different curing plants. Subsequently, these were classified with PLS-DA according to the pig's feeding regime (acorn-fed vs. feed-fed) and it has been demonstrated that 450 out of 997 samples are enough for model training to achieve a maximum average prediction accuracy rate. Furthermore, the use of pre-selected GC-IMS markers provides slightly better prediction results than the use of the complete spectral fingerprint. In summary, these results represent a tentative guide for the classification of samples in an industrial setting using GC-IMS and PLS-DA. This methodology would allow authorities and producers to ensure the quality of the agri-food products put on the market as is proven in this study.

Abstract Image

用GC-IMS方法建立PLS-DA化学计量分类模型的指南——以干腌火腿为例
建立校正模型的代表性样本数量对化学计量模型的成功与否起着重要的作用;因此,知道哪些样本应该用于预测模型的校准是至关重要的。以干腌伊比利亚火腿为例,设计了一种训练偏最小二乘判别分析(PLS-DA)模型的基本准则,以对气相色谱(GC) -离子迁移率谱(IMS)分析的复杂样品进行分类。通过使用GC-IMS分析从7个不同的腌制工厂获得的近1000个干腌伊比利亚火腿样品,评估了用于训练和验证的样品数量、比例和类别以及两种数据类型(光谱指纹或预选标记)的使用的影响。随后,根据猪的喂养方式(橡子喂养与饲料喂养),使用PLS-DA对这些样本进行分类,并证明997个样本中有450个样本足以用于模型训练,以达到最大的平均预测准确率。此外,使用预先选择的GC-IMS标记比使用完整的光谱指纹提供略好的预测结果。总之,这些结果为在工业环境中使用GC-IMS和PLS-DA进行样品分类提供了初步指导。正如本研究所证明的那样,这种方法将使当局和生产者能够确保投放市场的农产品的质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Talanta Open
Talanta Open Chemistry-Analytical Chemistry
CiteScore
5.20
自引率
0.00%
发文量
86
审稿时长
49 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信