{"title":"Prediction of soluble solids content using near-infrared spectra and optical properties of intact apple and pulp applying PLSR and CNN.","authors":"Shuochong Zeng, Zongyi Zhang, Xiaodong Cheng, Xiao Cai, Mengke Cao, Wenchuan Guo","doi":"10.1016/j.saa.2023.123402","DOIUrl":null,"url":null,"abstract":"<p><p>Soluble solids content (SSC) is one of the most important internal quality attributes of fruit and could be predicted using near-infrared (NIR) spectra and optical properties. Partial least squares regression (PLSR) is a conventional regression method in SSC prediction. In recent years, deep learning methods represented by convolutional neural network (CNN) was suggested to be implied in spectral analysis. However, researchers are inevitably facing problems with regard to the selection of spectral pretreatment methods and the evaluation of the performance of the chosen regression. This study employed PLSR and CNN regression to predict SSC of apple based on the collected diffuse reflectance spectra of intact apple, total reflectance and total transmittance spectra of apple pulp, and the calculated optical property spectra, i.e., absorption coefficient and reduced scattering coefficient spectra of apple pulp. Five different spectral pretreatment methods were exerted on these spectra. Results showed that at a given regression (PLSR or CNN), the built models based on the diffuse reflectance spectra of intact apple had the best SSC prediction, and the built models based on pulp's reduced scattering coefficient spectra had the poorest prediction performance. The best prediction performance was achieved by PLSR models using Savitzky-Golay with multiple scattering correction (R<sub>p</sub> = 0.96, RMSEP = 0.54 %) and CNN regressions using Savitzky-Golay with standard normal variational transformation (R<sub>p</sub> = 0.95, RMSEP = 0.59 %), respectively. Additionally, when the unknown original spectra were used for modeling, CNN had a better performance compared to PLSR, indicating the outstanding preponderance of CNN in spectral analysis. This study provides an effective reference for the selection of chemometric method based on NIR spectra.</p>","PeriodicalId":94213,"journal":{"name":"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy","volume":"304 ","pages":"123402"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.saa.2023.123402","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/9/12 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Soluble solids content (SSC) is one of the most important internal quality attributes of fruit and could be predicted using near-infrared (NIR) spectra and optical properties. Partial least squares regression (PLSR) is a conventional regression method in SSC prediction. In recent years, deep learning methods represented by convolutional neural network (CNN) was suggested to be implied in spectral analysis. However, researchers are inevitably facing problems with regard to the selection of spectral pretreatment methods and the evaluation of the performance of the chosen regression. This study employed PLSR and CNN regression to predict SSC of apple based on the collected diffuse reflectance spectra of intact apple, total reflectance and total transmittance spectra of apple pulp, and the calculated optical property spectra, i.e., absorption coefficient and reduced scattering coefficient spectra of apple pulp. Five different spectral pretreatment methods were exerted on these spectra. Results showed that at a given regression (PLSR or CNN), the built models based on the diffuse reflectance spectra of intact apple had the best SSC prediction, and the built models based on pulp's reduced scattering coefficient spectra had the poorest prediction performance. The best prediction performance was achieved by PLSR models using Savitzky-Golay with multiple scattering correction (Rp = 0.96, RMSEP = 0.54 %) and CNN regressions using Savitzky-Golay with standard normal variational transformation (Rp = 0.95, RMSEP = 0.59 %), respectively. Additionally, when the unknown original spectra were used for modeling, CNN had a better performance compared to PLSR, indicating the outstanding preponderance of CNN in spectral analysis. This study provides an effective reference for the selection of chemometric method based on NIR spectra.