{"title":"Detection of amylose content in rice samples with spectral augmentation and advanced machine learning","authors":"Kamini G. Panchbhai , Madhusudan G. Lanjewar","doi":"10.1016/j.jfca.2025.107455","DOIUrl":null,"url":null,"abstract":"<div><div>Amylose content (AC) in rice samples constitutes one of the many essential indicators for determining its eating taste and quality. This paper describes an integrated strategy using spectroscopy (12000–4000cm<sup>−1</sup>), spectral pre-processing, data/spectral augmentation, wavelengths/features selection, and sophisticated machine learning (ML) approaches to identify AC in rice samples reliably. Two scenarios were tested: one with the original dataset and another with SMOTE-augmented spectral dataset. Furthermore, the performance of the models was also evaluated on segmented wavelengths range 9000–4000 cm<sup>−1</sup>. The most informative wavelengths were chosen and used in different ML models, including stacking regressors and classifiers for forecasting the AC. The regression analysis yielded the best results with a coefficient of determination (R²) of 0.968, Root Mean Squared Error (RMSE) of 0.306, and Ratio of performance of deviation (RPD) of 5.564. The classification performance achieved an F1 score of 99.0 %, with a 5-fold average F1 score of 91.2 %. These findings show that the suggested technique can identify AC in rice in a non-destructive and dependable manner, significantly improving model performance over previous works.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"142 ","pages":"Article 107455"},"PeriodicalIF":4.0000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0889157525002704","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Amylose content (AC) in rice samples constitutes one of the many essential indicators for determining its eating taste and quality. This paper describes an integrated strategy using spectroscopy (12000–4000cm−1), spectral pre-processing, data/spectral augmentation, wavelengths/features selection, and sophisticated machine learning (ML) approaches to identify AC in rice samples reliably. Two scenarios were tested: one with the original dataset and another with SMOTE-augmented spectral dataset. Furthermore, the performance of the models was also evaluated on segmented wavelengths range 9000–4000 cm−1. The most informative wavelengths were chosen and used in different ML models, including stacking regressors and classifiers for forecasting the AC. The regression analysis yielded the best results with a coefficient of determination (R²) of 0.968, Root Mean Squared Error (RMSE) of 0.306, and Ratio of performance of deviation (RPD) of 5.564. The classification performance achieved an F1 score of 99.0 %, with a 5-fold average F1 score of 91.2 %. These findings show that the suggested technique can identify AC in rice in a non-destructive and dependable manner, significantly improving model performance over previous works.
期刊介绍:
The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects.
The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.