数据集和基于机器学习的计算机辅助工具,用于模拟干燥羊皮纸和绿咖啡豆的工作吸附等温线

IF 1.4 Q3 MULTIDISCIPLINARY SCIENCES
Gentil A. Collazos-Escobar , Andrés F. Bahamón-Monje , Nelson Gutiérrez-Guzmán
{"title":"数据集和基于机器学习的计算机辅助工具,用于模拟干燥羊皮纸和绿咖啡豆的工作吸附等温线","authors":"Gentil A. Collazos-Escobar ,&nbsp;Andrés F. Bahamón-Monje ,&nbsp;Nelson Gutiérrez-Guzmán","doi":"10.1016/j.dib.2025.111738","DOIUrl":null,"url":null,"abstract":"<div><div>This work presents a comprehensive dataset on working sorption isotherms and mid-infrared spectra for parchment husk, parchment coffee, and green coffee beans (<em>Coffea arabica</em> L.). The working sorption isotherms were experimentally determined using the Dynamic Dewpoint Isotherm (DDI) method, covering typical storage conditions of dried coffee beans in warehouses. These conditions account for a water activity range (a<sub>w</sub>) from 0.1 to 0.9 and temperatures of 25°C, 35°C, and 45°C. Furthermore, the mid-infrared spectra of parchment husk and green coffee were obtained using Attenuated Total Reflectance-Fourier Transform Infrared (ATR-FTIR) spectroscopy as a complementary tool to analyze the role of parchment covering in the water sorption behavior of parchment coffee beans. The dataset also provides computer-aided tools for the mathematical modeling of working sorption isotherms and infrared data of coffee. These tools were developed using MATLAB® R2023a (The MathWorks Inc., Natick, MA, USA) and offer users the ability to model sorption isotherms and analyze infrared spectral data using advanced machine learning techniques. Thereby, the MATLAB scripts implement an automated routine for the calibration and optimization of the Support Vector Machine (SVM) and Random Forest (RF) techniques, enabling the modeling of working sorption isotherms for each coffee type (considering only a<sub>w</sub> and temperature) and in a multivariate approach (incorporating a<sub>w</sub>, temperature, and coffee type) to predict the equilibrium moisture content (X<sub>e</sub>). Additionally, the MATLAB script for Principal Component Analysis (PCA) enables users to perform advanced chemometric modeling of the coffee spectra. This script provides a latent-variable-based tool for analyzing spectral patterns associated with different coffee types, allowing for robust model-based differentiation of coffee samples using their infrared properties. These models are particularly valuable as digital representations of the coffee storage process and can be used to optimize storage conditions, understand hygroscopic behavior, and ensure moisture-based quality monitoring in parchment and green coffee beans. The experimental dataset, including working sorption isotherms and mid-infrared spectra, is organized into Excel sheets according to experimental conditions and replicates. The MATLAB scripts come with ready-to-use computational instructions for calibrating predictive models, ensuring precise fitting of isotherms and spectral properties. This dataset represents a valuable asset for researchers, coffee producers, and industry stakeholders, providing practical tools for storage optimization, shelf-life determination, and in-depth analysis of water sorption behavior across different coffee processing stages.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"61 ","pages":"Article 111738"},"PeriodicalIF":1.4000,"publicationDate":"2025-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dataset and machine learning-based computer-aided tools for modeling working sorption isotherms in dried parchment and green coffee beans\",\"authors\":\"Gentil A. Collazos-Escobar ,&nbsp;Andrés F. Bahamón-Monje ,&nbsp;Nelson Gutiérrez-Guzmán\",\"doi\":\"10.1016/j.dib.2025.111738\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This work presents a comprehensive dataset on working sorption isotherms and mid-infrared spectra for parchment husk, parchment coffee, and green coffee beans (<em>Coffea arabica</em> L.). The working sorption isotherms were experimentally determined using the Dynamic Dewpoint Isotherm (DDI) method, covering typical storage conditions of dried coffee beans in warehouses. These conditions account for a water activity range (a<sub>w</sub>) from 0.1 to 0.9 and temperatures of 25°C, 35°C, and 45°C. Furthermore, the mid-infrared spectra of parchment husk and green coffee were obtained using Attenuated Total Reflectance-Fourier Transform Infrared (ATR-FTIR) spectroscopy as a complementary tool to analyze the role of parchment covering in the water sorption behavior of parchment coffee beans. The dataset also provides computer-aided tools for the mathematical modeling of working sorption isotherms and infrared data of coffee. These tools were developed using MATLAB® R2023a (The MathWorks Inc., Natick, MA, USA) and offer users the ability to model sorption isotherms and analyze infrared spectral data using advanced machine learning techniques. Thereby, the MATLAB scripts implement an automated routine for the calibration and optimization of the Support Vector Machine (SVM) and Random Forest (RF) techniques, enabling the modeling of working sorption isotherms for each coffee type (considering only a<sub>w</sub> and temperature) and in a multivariate approach (incorporating a<sub>w</sub>, temperature, and coffee type) to predict the equilibrium moisture content (X<sub>e</sub>). Additionally, the MATLAB script for Principal Component Analysis (PCA) enables users to perform advanced chemometric modeling of the coffee spectra. This script provides a latent-variable-based tool for analyzing spectral patterns associated with different coffee types, allowing for robust model-based differentiation of coffee samples using their infrared properties. These models are particularly valuable as digital representations of the coffee storage process and can be used to optimize storage conditions, understand hygroscopic behavior, and ensure moisture-based quality monitoring in parchment and green coffee beans. The experimental dataset, including working sorption isotherms and mid-infrared spectra, is organized into Excel sheets according to experimental conditions and replicates. The MATLAB scripts come with ready-to-use computational instructions for calibrating predictive models, ensuring precise fitting of isotherms and spectral properties. This dataset represents a valuable asset for researchers, coffee producers, and industry stakeholders, providing practical tools for storage optimization, shelf-life determination, and in-depth analysis of water sorption behavior across different coffee processing stages.</div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"61 \",\"pages\":\"Article 111738\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340925004652\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925004652","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

本文介绍了羊皮纸外壳、羊皮纸咖啡和绿咖啡豆(Coffea arabica L.)的工作吸收等温线和中红外光谱的综合数据集。采用动态露点等温线(DDI)方法测定了干燥咖啡豆在仓库中的典型储存条件下的工作吸附等温线。这些条件解释了水活度范围(aw)从0.1到0.9,温度为25°C, 35°C和45°C。此外,利用衰减全反射-傅里叶变换红外(ATR-FTIR)光谱作为补充工具,获得了羊皮纸外壳和生咖啡的中红外光谱,分析了羊皮纸覆盖物对羊皮纸咖啡豆吸水行为的影响。该数据集还为咖啡的工作吸附等温线和红外数据的数学建模提供了计算机辅助工具。这些工具是使用MATLAB®R2023a (The MathWorks Inc., Natick, MA, USA)开发的,并使用户能够使用先进的机器学习技术模拟吸收等温线并分析红外光谱数据。因此,MATLAB脚本实现了用于校准和优化支持向量机(SVM)和随机森林(RF)技术的自动化例程,从而能够对每种咖啡类型(仅考虑aw和温度)的工作吸附等温线进行建模,并以多元方法(结合aw,温度和咖啡类型)来预测平衡水分含量(Xe)。此外,主成分分析(PCA)的MATLAB脚本使用户能够执行咖啡光谱的高级化学计量建模。这个脚本提供了一个基于潜在变量的工具,用于分析与不同咖啡类型相关的光谱模式,允许使用咖啡样品的红外特性进行基于模型的鲁棒区分。这些模型作为咖啡储存过程的数字表示特别有价值,可用于优化储存条件,了解吸湿性行为,并确保羊皮纸和绿咖啡豆的水分质量监测。实验数据集,包括工作吸收等温线和中红外光谱,根据实验条件和重复,组织成Excel表格。MATLAB脚本附带了随时可用的计算指令,用于校准预测模型,确保等温线和光谱特性的精确拟合。该数据集为研究人员、咖啡生产商和行业利益相关者提供了宝贵的资产,为存储优化、保质期确定和深入分析不同咖啡加工阶段的吸水行为提供了实用工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Dataset and machine learning-based computer-aided tools for modeling working sorption isotherms in dried parchment and green coffee beans
This work presents a comprehensive dataset on working sorption isotherms and mid-infrared spectra for parchment husk, parchment coffee, and green coffee beans (Coffea arabica L.). The working sorption isotherms were experimentally determined using the Dynamic Dewpoint Isotherm (DDI) method, covering typical storage conditions of dried coffee beans in warehouses. These conditions account for a water activity range (aw) from 0.1 to 0.9 and temperatures of 25°C, 35°C, and 45°C. Furthermore, the mid-infrared spectra of parchment husk and green coffee were obtained using Attenuated Total Reflectance-Fourier Transform Infrared (ATR-FTIR) spectroscopy as a complementary tool to analyze the role of parchment covering in the water sorption behavior of parchment coffee beans. The dataset also provides computer-aided tools for the mathematical modeling of working sorption isotherms and infrared data of coffee. These tools were developed using MATLAB® R2023a (The MathWorks Inc., Natick, MA, USA) and offer users the ability to model sorption isotherms and analyze infrared spectral data using advanced machine learning techniques. Thereby, the MATLAB scripts implement an automated routine for the calibration and optimization of the Support Vector Machine (SVM) and Random Forest (RF) techniques, enabling the modeling of working sorption isotherms for each coffee type (considering only aw and temperature) and in a multivariate approach (incorporating aw, temperature, and coffee type) to predict the equilibrium moisture content (Xe). Additionally, the MATLAB script for Principal Component Analysis (PCA) enables users to perform advanced chemometric modeling of the coffee spectra. This script provides a latent-variable-based tool for analyzing spectral patterns associated with different coffee types, allowing for robust model-based differentiation of coffee samples using their infrared properties. These models are particularly valuable as digital representations of the coffee storage process and can be used to optimize storage conditions, understand hygroscopic behavior, and ensure moisture-based quality monitoring in parchment and green coffee beans. The experimental dataset, including working sorption isotherms and mid-infrared spectra, is organized into Excel sheets according to experimental conditions and replicates. The MATLAB scripts come with ready-to-use computational instructions for calibrating predictive models, ensuring precise fitting of isotherms and spectral properties. This dataset represents a valuable asset for researchers, coffee producers, and industry stakeholders, providing practical tools for storage optimization, shelf-life determination, and in-depth analysis of water sorption behavior across different coffee processing stages.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Data in Brief
Data in Brief MULTIDISCIPLINARY SCIENCES-
CiteScore
3.10
自引率
0.00%
发文量
996
审稿时长
70 days
期刊介绍: Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信