Data fusion of visible near-infrared and mid-infrared spectroscopy combined with feature selection and machine learning for rapid discrimination of fusarium head blight infection in wheat kernel and flour
Muhammad Baraa Almoujahed, Rebecca L. Whetton, Abdul M. Mouazen
{"title":"Data fusion of visible near-infrared and mid-infrared spectroscopy combined with feature selection and machine learning for rapid discrimination of fusarium head blight infection in wheat kernel and flour","authors":"Muhammad Baraa Almoujahed, Rebecca L. Whetton, Abdul M. Mouazen","doi":"10.1016/j.infrared.2025.106072","DOIUrl":null,"url":null,"abstract":"<div><div>Fusarium head blight (FHB) is a significant crop fungal disease that downgrades the yield quality and affects food safety. There is a necessity for the development of fast and cost-effective detection approaches of FHB to meet the needs of the food industry, as the traditional methods are slow, costly, difficult, and expose chemicals to the environment. Visible near-infrared (vis-NIR) and mid-infrared (MIR) spectroscopy have been used as promising tools for the detection of FHB contamination and mycotoxins in cereal crops and foods. This study explores the potential of data fusion approaches of the vis-NIR (400–1650 nm) and MIR (4000 – 650 cm<sup>−1</sup>) spectra for FHB detection in wheat kernels and flour of eight varieties of wheat. Spectra concatenation and feature selection methods were utilized as input data for two different machine learning models, namely, random forest (RF), and decision tree (DT). For the selection of the most informative wavebands from both sensors, genetic algorithm (GA), recursive feature elimination (RFE), and principal component analysis (PCA), were employed. Results showed that spectral concatenation data fusion has resulted in very high test accuracy for FHB detection in both kernel and flour, with all models reaching 100% classification accuracy, except the RF-kernel model, which achieved 96.6%. Among the three feature selection algorithms, GA was the best method for the selection of the most informative bands related to FHB, resulting in a correct classification accuracy of 100 %, for both RF and DT modelling tools. For the RFE feature selection method, a lower classification accuracy of 96.6 % was obtained with both RF and DT models in kernels. However, PCA resulted in the lowest accuracies, dropping down by 10.3 % to 17.3 %, compared to that of GA and RFE, respectively. Overall, the proposed data fusion methods allow the non-destructive, rapid, and accurate detection of FHB infection in wheat flour and kernels. This is particularly useful for the flour as it is not possible to visually estimate the infected from healthy samples.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"151 ","pages":"Article 106072"},"PeriodicalIF":3.4000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525003652","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0
Abstract
Fusarium head blight (FHB) is a significant crop fungal disease that downgrades the yield quality and affects food safety. There is a necessity for the development of fast and cost-effective detection approaches of FHB to meet the needs of the food industry, as the traditional methods are slow, costly, difficult, and expose chemicals to the environment. Visible near-infrared (vis-NIR) and mid-infrared (MIR) spectroscopy have been used as promising tools for the detection of FHB contamination and mycotoxins in cereal crops and foods. This study explores the potential of data fusion approaches of the vis-NIR (400–1650 nm) and MIR (4000 – 650 cm−1) spectra for FHB detection in wheat kernels and flour of eight varieties of wheat. Spectra concatenation and feature selection methods were utilized as input data for two different machine learning models, namely, random forest (RF), and decision tree (DT). For the selection of the most informative wavebands from both sensors, genetic algorithm (GA), recursive feature elimination (RFE), and principal component analysis (PCA), were employed. Results showed that spectral concatenation data fusion has resulted in very high test accuracy for FHB detection in both kernel and flour, with all models reaching 100% classification accuracy, except the RF-kernel model, which achieved 96.6%. Among the three feature selection algorithms, GA was the best method for the selection of the most informative bands related to FHB, resulting in a correct classification accuracy of 100 %, for both RF and DT modelling tools. For the RFE feature selection method, a lower classification accuracy of 96.6 % was obtained with both RF and DT models in kernels. However, PCA resulted in the lowest accuracies, dropping down by 10.3 % to 17.3 %, compared to that of GA and RFE, respectively. Overall, the proposed data fusion methods allow the non-destructive, rapid, and accurate detection of FHB infection in wheat flour and kernels. This is particularly useful for the flour as it is not possible to visually estimate the infected from healthy samples.
期刊介绍:
The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region.
Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine.
Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.