Lactose prediction in dry milk with hyperspectral imaging: A data analysis competition at the “International Workshop on Spectroscopy and Chemometrics 2024”
Maria Frizzarin , Vicky Caponigro , Katarina Domijan , Arnaud Molle , Timilehin Aderinola , Thach Le Nguyen , Davide Serramazza , Georgiana Ifrim , Agnieszka Konkolewska
{"title":"Lactose prediction in dry milk with hyperspectral imaging: A data analysis competition at the “International Workshop on Spectroscopy and Chemometrics 2024”","authors":"Maria Frizzarin , Vicky Caponigro , Katarina Domijan , Arnaud Molle , Timilehin Aderinola , Thach Le Nguyen , Davide Serramazza , Georgiana Ifrim , Agnieszka Konkolewska","doi":"10.1016/j.chemolab.2024.105279","DOIUrl":null,"url":null,"abstract":"<div><div>In April 2024, the Vistamilk SFI Research Centre organized the fourth edition of the “International Workshop on Spectroscopy and Chemometrics — Spectroscopy meets modern Statistics”. Within this event, a data challenge was organized among workshop participants, focusing on hyperspectral imaging (HSI) of milk samples.</div><div>Milk is a complex emulsion comprising of fats, water, proteins, and carbohydrates. Due to the widespread prevalence of lactose intolerance, precise lactose quantification in milk samples became necessary for the dairy industry.</div><div>The dataset provided to the participants contained spectral data extracted from HSI, without the spatial information, obtained from 72 samples with reference laboratory values for lactose concentration [mg/mL]. The winning strategy was built using ROCKET, a convolutional-based method that was originally designed for time series classification, which achieved a Pearson correlation of 0.86 and RMSE of 9.8 on the test set. The present paper describes the approaches and statistical methods adopted by all the participants to analyse the data and develop the lactose prediction models.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"258 ","pages":"Article 105279"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemometrics and Intelligent Laboratory Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169743924002193","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In April 2024, the Vistamilk SFI Research Centre organized the fourth edition of the “International Workshop on Spectroscopy and Chemometrics — Spectroscopy meets modern Statistics”. Within this event, a data challenge was organized among workshop participants, focusing on hyperspectral imaging (HSI) of milk samples.
Milk is a complex emulsion comprising of fats, water, proteins, and carbohydrates. Due to the widespread prevalence of lactose intolerance, precise lactose quantification in milk samples became necessary for the dairy industry.
The dataset provided to the participants contained spectral data extracted from HSI, without the spatial information, obtained from 72 samples with reference laboratory values for lactose concentration [mg/mL]. The winning strategy was built using ROCKET, a convolutional-based method that was originally designed for time series classification, which achieved a Pearson correlation of 0.86 and RMSE of 9.8 on the test set. The present paper describes the approaches and statistical methods adopted by all the participants to analyse the data and develop the lactose prediction models.
期刊介绍:
Chemometrics and Intelligent Laboratory Systems publishes original research papers, short communications, reviews, tutorials and Original Software Publications reporting on development of novel statistical, mathematical, or computer techniques in Chemistry and related disciplines.
Chemometrics is the chemical discipline that uses mathematical and statistical methods to design or select optimal procedures and experiments, and to provide maximum chemical information by analysing chemical data.
The journal deals with the following topics:
1) Development of new statistical, mathematical and chemometrical methods for Chemistry and related fields (Environmental Chemistry, Biochemistry, Toxicology, System Biology, -Omics, etc.)
2) Novel applications of chemometrics to all branches of Chemistry and related fields (typical domains of interest are: process data analysis, experimental design, data mining, signal processing, supervised modelling, decision making, robust statistics, mixture analysis, multivariate calibration etc.) Routine applications of established chemometrical techniques will not be considered.
3) Development of new software that provides novel tools or truly advances the use of chemometrical methods.
4) Well characterized data sets to test performance for the new methods and software.
The journal complies with International Committee of Medical Journal Editors'' Uniform requirements for manuscripts.