Carlos Cambra, Félix Movilla, Félix de Miguel, Daniel Urda, Nuria Velasco, Álvaro Herrero
{"title":"现实世界的工业物联网数据集,用于金属加工液的预测性维护","authors":"Carlos Cambra, Félix Movilla, Félix de Miguel, Daniel Urda, Nuria Velasco, Álvaro Herrero","doi":"10.1016/j.dib.2025.112020","DOIUrl":null,"url":null,"abstract":"<div><div>This article presents a multivariate time series dataset detailing the physicochemical degradation of an industrial metalworking fluid (MWF). The data were collected continuously over several months from a test tank under typical operational conditions at an industrial facility in Spain. Four critical variables were monitored using industrial-grade sensors: pH, temperature, concentration, and conductivity. The dataset is provided in five CSV files. The primary file, measures.csv, contains the preprocessed time series at a uniform 5-minute frequency, with authentic missing data gaps intentionally preserved to reflect real-world sensor and connectivity issues. The four additional files serve as a comprehensive benchmark for data imputation algorithms. Each of these benchmark files corresponds to a single variable and includes the original data alongside imputed values generated by five distinct methods: K-Nearest Neighbours (KNN), a hybrid model (HybridKCL), an LSTM-based Variational Autoencoder (LSTM-VAE), and both pre-trained and fine-tuned versions of the MOMENT foundation model. This resource enables researchers and practitioners to develop, validate, and compare predictive maintenance models, anomaly detection systems, and advanced imputation techniques. Furthermore, it serves as a valuable educational tool for addressing common challenges in industrial IoT data, fostering advancements in sustainable and efficient manufacturing.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"62 ","pages":"Article 112020"},"PeriodicalIF":1.4000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A real-world iiot dataset for predictive maintenance of metalworking fluids\",\"authors\":\"Carlos Cambra, Félix Movilla, Félix de Miguel, Daniel Urda, Nuria Velasco, Álvaro Herrero\",\"doi\":\"10.1016/j.dib.2025.112020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This article presents a multivariate time series dataset detailing the physicochemical degradation of an industrial metalworking fluid (MWF). The data were collected continuously over several months from a test tank under typical operational conditions at an industrial facility in Spain. Four critical variables were monitored using industrial-grade sensors: pH, temperature, concentration, and conductivity. The dataset is provided in five CSV files. The primary file, measures.csv, contains the preprocessed time series at a uniform 5-minute frequency, with authentic missing data gaps intentionally preserved to reflect real-world sensor and connectivity issues. The four additional files serve as a comprehensive benchmark for data imputation algorithms. Each of these benchmark files corresponds to a single variable and includes the original data alongside imputed values generated by five distinct methods: K-Nearest Neighbours (KNN), a hybrid model (HybridKCL), an LSTM-based Variational Autoencoder (LSTM-VAE), and both pre-trained and fine-tuned versions of the MOMENT foundation model. This resource enables researchers and practitioners to develop, validate, and compare predictive maintenance models, anomaly detection systems, and advanced imputation techniques. Furthermore, it serves as a valuable educational tool for addressing common challenges in industrial IoT data, fostering advancements in sustainable and efficient manufacturing.</div></div>\",\"PeriodicalId\":10973,\"journal\":{\"name\":\"Data in Brief\",\"volume\":\"62 \",\"pages\":\"Article 112020\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Data in Brief\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352340925007425\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340925007425","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
A real-world iiot dataset for predictive maintenance of metalworking fluids
This article presents a multivariate time series dataset detailing the physicochemical degradation of an industrial metalworking fluid (MWF). The data were collected continuously over several months from a test tank under typical operational conditions at an industrial facility in Spain. Four critical variables were monitored using industrial-grade sensors: pH, temperature, concentration, and conductivity. The dataset is provided in five CSV files. The primary file, measures.csv, contains the preprocessed time series at a uniform 5-minute frequency, with authentic missing data gaps intentionally preserved to reflect real-world sensor and connectivity issues. The four additional files serve as a comprehensive benchmark for data imputation algorithms. Each of these benchmark files corresponds to a single variable and includes the original data alongside imputed values generated by five distinct methods: K-Nearest Neighbours (KNN), a hybrid model (HybridKCL), an LSTM-based Variational Autoencoder (LSTM-VAE), and both pre-trained and fine-tuned versions of the MOMENT foundation model. This resource enables researchers and practitioners to develop, validate, and compare predictive maintenance models, anomaly detection systems, and advanced imputation techniques. Furthermore, it serves as a valuable educational tool for addressing common challenges in industrial IoT data, fostering advancements in sustainable and efficient manufacturing.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.