Effects of Pre-Processing and Principal Components for Artificial Neural Network in Non-Destructive Internal Quality Prediction of Mango across Different Harvest Periods
{"title":"Effects of Pre-Processing and Principal Components for Artificial Neural Network in Non-Destructive Internal Quality Prediction of Mango across Different Harvest Periods","authors":"Yit Peng Tan, K. Chia","doi":"10.1109/ICCSCE58721.2023.10237167","DOIUrl":null,"url":null,"abstract":"Dry Matter content (DMC) is one of the important components can be used to determine the quality of mango. However, DMC measurement is destructive and tedious. Although near infrared spectroscopy (NIRS) is a promising fast and non-destructive analytical technique, the accuracy of NIRS is susceptible with the use of pre-processing methods throughout the NIRS modeling process across different harvest periods. Thus, this research aims to investigate the effects of pre-processing methods on two popular different machine learning models (i.e. artificial neural network (ANN) and principal components ANN (PCs-ANN) in non-destructive internal quality prediction using NIRS. Two different pre-processing methods i.e. Standard Normalized Vector (SNV) and second order Savitzky-Golay (SG) would be applied to pre-process the near infrared spectra prior to the machine learning process. Result indicates that both ANN and PCs-ANN with second order SG outperformed that with SNV and that without pre-processing method. Additionally, based on root mean squared errors of validation (RMSEV), ANN outperformed PCs-ANN; while PCs-ANN outperformed ANN when root mean squared errors of prediction (RMSEP) was referred to. This indicates that ANN tends to be over-fitted while the involvement of principal component improves the robustness of ANN when it is applied to new samples that harvested from different periods.","PeriodicalId":287947,"journal":{"name":"2023 IEEE 13th International Conference on Control System, Computing and Engineering (ICCSCE)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 13th International Conference on Control System, Computing and Engineering (ICCSCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSCE58721.2023.10237167","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Dry Matter content (DMC) is one of the important components can be used to determine the quality of mango. However, DMC measurement is destructive and tedious. Although near infrared spectroscopy (NIRS) is a promising fast and non-destructive analytical technique, the accuracy of NIRS is susceptible with the use of pre-processing methods throughout the NIRS modeling process across different harvest periods. Thus, this research aims to investigate the effects of pre-processing methods on two popular different machine learning models (i.e. artificial neural network (ANN) and principal components ANN (PCs-ANN) in non-destructive internal quality prediction using NIRS. Two different pre-processing methods i.e. Standard Normalized Vector (SNV) and second order Savitzky-Golay (SG) would be applied to pre-process the near infrared spectra prior to the machine learning process. Result indicates that both ANN and PCs-ANN with second order SG outperformed that with SNV and that without pre-processing method. Additionally, based on root mean squared errors of validation (RMSEV), ANN outperformed PCs-ANN; while PCs-ANN outperformed ANN when root mean squared errors of prediction (RMSEP) was referred to. This indicates that ANN tends to be over-fitted while the involvement of principal component improves the robustness of ANN when it is applied to new samples that harvested from different periods.