Prediction of soluble solids content using near-infrared spectra and optical properties of intact apple and pulp applying PLSR and CNN.

Shuochong Zeng, Zongyi Zhang, Xiaodong Cheng, Xiao Cai, Mengke Cao, Wenchuan Guo
{"title":"Prediction of soluble solids content using near-infrared spectra and optical properties of intact apple and pulp applying PLSR and CNN.","authors":"Shuochong Zeng,&nbsp;Zongyi Zhang,&nbsp;Xiaodong Cheng,&nbsp;Xiao Cai,&nbsp;Mengke Cao,&nbsp;Wenchuan Guo","doi":"10.1016/j.saa.2023.123402","DOIUrl":null,"url":null,"abstract":"<p><p>Soluble solids content (SSC) is one of the most important internal quality attributes of fruit and could be predicted using near-infrared (NIR) spectra and optical properties. Partial least squares regression (PLSR) is a conventional regression method in SSC prediction. In recent years, deep learning methods represented by convolutional neural network (CNN) was suggested to be implied in spectral analysis. However, researchers are inevitably facing problems with regard to the selection of spectral pretreatment methods and the evaluation of the performance of the chosen regression. This study employed PLSR and CNN regression to predict SSC of apple based on the collected diffuse reflectance spectra of intact apple, total reflectance and total transmittance spectra of apple pulp, and the calculated optical property spectra, i.e., absorption coefficient and reduced scattering coefficient spectra of apple pulp. Five different spectral pretreatment methods were exerted on these spectra. Results showed that at a given regression (PLSR or CNN), the built models based on the diffuse reflectance spectra of intact apple had the best SSC prediction, and the built models based on pulp's reduced scattering coefficient spectra had the poorest prediction performance. The best prediction performance was achieved by PLSR models using Savitzky-Golay with multiple scattering correction (R<sub>p</sub> = 0.96, RMSEP = 0.54 %) and CNN regressions using Savitzky-Golay with standard normal variational transformation (R<sub>p</sub> = 0.95, RMSEP = 0.59 %), respectively. Additionally, when the unknown original spectra were used for modeling, CNN had a better performance compared to PLSR, indicating the outstanding preponderance of CNN in spectral analysis. This study provides an effective reference for the selection of chemometric method based on NIR spectra.</p>","PeriodicalId":94213,"journal":{"name":"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy","volume":"304 ","pages":"123402"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.saa.2023.123402","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/9/12 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Soluble solids content (SSC) is one of the most important internal quality attributes of fruit and could be predicted using near-infrared (NIR) spectra and optical properties. Partial least squares regression (PLSR) is a conventional regression method in SSC prediction. In recent years, deep learning methods represented by convolutional neural network (CNN) was suggested to be implied in spectral analysis. However, researchers are inevitably facing problems with regard to the selection of spectral pretreatment methods and the evaluation of the performance of the chosen regression. This study employed PLSR and CNN regression to predict SSC of apple based on the collected diffuse reflectance spectra of intact apple, total reflectance and total transmittance spectra of apple pulp, and the calculated optical property spectra, i.e., absorption coefficient and reduced scattering coefficient spectra of apple pulp. Five different spectral pretreatment methods were exerted on these spectra. Results showed that at a given regression (PLSR or CNN), the built models based on the diffuse reflectance spectra of intact apple had the best SSC prediction, and the built models based on pulp's reduced scattering coefficient spectra had the poorest prediction performance. The best prediction performance was achieved by PLSR models using Savitzky-Golay with multiple scattering correction (Rp = 0.96, RMSEP = 0.54 %) and CNN regressions using Savitzky-Golay with standard normal variational transformation (Rp = 0.95, RMSEP = 0.59 %), respectively. Additionally, when the unknown original spectra were used for modeling, CNN had a better performance compared to PLSR, indicating the outstanding preponderance of CNN in spectral analysis. This study provides an effective reference for the selection of chemometric method based on NIR spectra.

利用PLSR和CNN的近红外光谱和完整苹果和果肉的光学特性预测可溶性固形物含量。
可溶性固形物含量(SSC)是水果最重要的内在品质指标之一,可以利用近红外光谱和光学特性进行预测。偏最小二乘回归(PLSR)是SSC预测中的一种传统回归方法。近年来,以卷积神经网络(CNN)为代表的深度学习方法被建议隐含在频谱分析中。然而,研究人员不可避免地面临着光谱预处理方法的选择和所选回归性能的评估问题。本研究基于收集的完整苹果的漫反射光谱、苹果果肉的全反射光谱和全透射光谱,以及计算出的苹果果肉的光学性质光谱,即吸收系数和减少散射系数光谱,采用PLSR和CNN回归来预测苹果的SSC。对这些光谱采用了五种不同的光谱预处理方法。结果表明,在给定的回归(PLSR或CNN)下,基于完整苹果漫反射光谱建立的模型具有最好的SSC预测性能,而基于果肉散射系数谱的模型具有最差的预测性能。使用具有多重散射校正的Savitzky Golay的PLSR模型(Rp=0.96,RMSEP=0.54%)和使用具有标准正态变分变换的Savitz-Golay的CNN回归(Rp=0.95,RMSEP=0.59%)分别获得了最佳的预测性能。此外,当使用未知的原始光谱进行建模时,与PLSR相比,CNN具有更好的性能,表明CNN在光谱分析中的突出优势。该研究为基于近红外光谱的化学计量方法的选择提供了有效的参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信