75 MW电弧镍铁炉结构健康监测系统的数据清洗方法

Jaiber Camacho‐Olarte, Julian Esteban Barrera Torres, Daniel Alfonso Garavito Jimenez, Jersson X. Leon Medina, R. C. G. Vargas, D. Cardenas, Camilo Gutierrez-Osorio, B. Rueda, Whilmar Vargas, Diego Alexander Tibaduiza Burgos, C. Bonilla, Jorge Ivan Sofrony Esmeral, Felipe Restrepo-Calle
{"title":"75 MW电弧镍铁炉结构健康监测系统的数据清洗方法","authors":"Jaiber Camacho‐Olarte, Julian Esteban Barrera Torres, Daniel Alfonso Garavito Jimenez, Jersson X. Leon Medina, R. C. G. Vargas, D. Cardenas, Camilo Gutierrez-Osorio, B. Rueda, Whilmar Vargas, Diego Alexander Tibaduiza Burgos, C. Bonilla, Jorge Ivan Sofrony Esmeral, Felipe Restrepo-Calle","doi":"10.3390/ecsa-7-08245","DOIUrl":null,"url":null,"abstract":"Within a model of scientific and technical cooperation between the smelting company Cerro Matoso S.A. (CMSA) and the Universidad Nacional de Colombia (UNAL), a project was developed in order to take advantage of the data that were obtained from a sensor network in a ferronickel electric arc furnace at CMSA to improve the structural health monitoring process. Through this sensor network, online data are obtained on the temperature measurement along the refractory lining of the electric furnace, as well as heat fluxes and chemical characterization of the minerals on each stage of the process. These data are stored in a local database, which stores several years of historical data with valuable information for control and analysis purposes. These data reflect the behavior of the industrial process and can be used in the development of machine learning models to predict some of the electric arc furnace operation parameters, and thus improve the decision-making process. Currently, most of the data are analyzed by the experts of the structural control department, but, due to the large amount of data, the development of analytical tools is necessary to support their work. This paper proposes a data cleaning approach for improving data quality by creating a set of rules and filters based on both expert judgment and best practices in data quality. A statistical analysis was also carried out in order to detect variables with anomalies and outliers, which do not reflect real operation parameters and belong to anomalous data that should not be considered for modelling. With the proposed process, the quality of the data was improved and abnormal data were eliminated in order to consolidate a clean data set for later use in the development of machine learning models. This work contributes on understanding data cleansing rules that must be considered in order to reflect the real behavior of the electric furnace operation for further analysis and modeling tasks.","PeriodicalId":270652,"journal":{"name":"Proceedings of 7th International Electronic Conference on Sensors and Applications","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Data Cleaning Approach for a Structural Health Monitoring System in a 75 MW Electric Arc Ferronickel Furnace\",\"authors\":\"Jaiber Camacho‐Olarte, Julian Esteban Barrera Torres, Daniel Alfonso Garavito Jimenez, Jersson X. Leon Medina, R. C. G. Vargas, D. Cardenas, Camilo Gutierrez-Osorio, B. Rueda, Whilmar Vargas, Diego Alexander Tibaduiza Burgos, C. Bonilla, Jorge Ivan Sofrony Esmeral, Felipe Restrepo-Calle\",\"doi\":\"10.3390/ecsa-7-08245\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Within a model of scientific and technical cooperation between the smelting company Cerro Matoso S.A. (CMSA) and the Universidad Nacional de Colombia (UNAL), a project was developed in order to take advantage of the data that were obtained from a sensor network in a ferronickel electric arc furnace at CMSA to improve the structural health monitoring process. Through this sensor network, online data are obtained on the temperature measurement along the refractory lining of the electric furnace, as well as heat fluxes and chemical characterization of the minerals on each stage of the process. These data are stored in a local database, which stores several years of historical data with valuable information for control and analysis purposes. These data reflect the behavior of the industrial process and can be used in the development of machine learning models to predict some of the electric arc furnace operation parameters, and thus improve the decision-making process. Currently, most of the data are analyzed by the experts of the structural control department, but, due to the large amount of data, the development of analytical tools is necessary to support their work. This paper proposes a data cleaning approach for improving data quality by creating a set of rules and filters based on both expert judgment and best practices in data quality. A statistical analysis was also carried out in order to detect variables with anomalies and outliers, which do not reflect real operation parameters and belong to anomalous data that should not be considered for modelling. With the proposed process, the quality of the data was improved and abnormal data were eliminated in order to consolidate a clean data set for later use in the development of machine learning models. This work contributes on understanding data cleansing rules that must be considered in order to reflect the real behavior of the electric furnace operation for further analysis and modeling tasks.\",\"PeriodicalId\":270652,\"journal\":{\"name\":\"Proceedings of 7th International Electronic Conference on Sensors and Applications\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of 7th International Electronic Conference on Sensors and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/ecsa-7-08245\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 7th International Electronic Conference on Sensors and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/ecsa-7-08245","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

在Cerro Matoso S.A.冶炼公司(CMSA)和哥伦比亚国立大学(UNAL)之间的科学和技术合作模式下,制定了一个项目,以便利用从CMSA镍铁电弧炉的传感器网络获得的数据,改进结构健康监测过程。通过该传感器网络,可以在线获得电炉沿耐火衬里的温度测量数据,以及过程各阶段矿物的热流密度和化学特性。这些数据存储在本地数据库中,该数据库存储了几年的历史数据,其中包含用于控制和分析目的的有价值的信息。这些数据反映了工业过程的行为,可以用于开发机器学习模型来预测电弧炉的一些操作参数,从而改进决策过程。目前,大部分数据由结构控制部门的专家进行分析,但由于数据量大,需要开发分析工具来支持他们的工作。本文提出了一种数据清理方法,通过创建一组基于专家判断和数据质量最佳实践的规则和过滤器来提高数据质量。还进行了统计分析,以检测具有异常和离群值的变量,这些变量不能反映实际操作参数,属于不应考虑建模的异常数据。通过提出的过程,提高了数据的质量,消除了异常数据,以便巩固一个干净的数据集,以便以后在机器学习模型的开发中使用。这项工作有助于理解必须考虑的数据清理规则,以便反映电炉操作的真实行为,以进行进一步的分析和建模任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Data Cleaning Approach for a Structural Health Monitoring System in a 75 MW Electric Arc Ferronickel Furnace
Within a model of scientific and technical cooperation between the smelting company Cerro Matoso S.A. (CMSA) and the Universidad Nacional de Colombia (UNAL), a project was developed in order to take advantage of the data that were obtained from a sensor network in a ferronickel electric arc furnace at CMSA to improve the structural health monitoring process. Through this sensor network, online data are obtained on the temperature measurement along the refractory lining of the electric furnace, as well as heat fluxes and chemical characterization of the minerals on each stage of the process. These data are stored in a local database, which stores several years of historical data with valuable information for control and analysis purposes. These data reflect the behavior of the industrial process and can be used in the development of machine learning models to predict some of the electric arc furnace operation parameters, and thus improve the decision-making process. Currently, most of the data are analyzed by the experts of the structural control department, but, due to the large amount of data, the development of analytical tools is necessary to support their work. This paper proposes a data cleaning approach for improving data quality by creating a set of rules and filters based on both expert judgment and best practices in data quality. A statistical analysis was also carried out in order to detect variables with anomalies and outliers, which do not reflect real operation parameters and belong to anomalous data that should not be considered for modelling. With the proposed process, the quality of the data was improved and abnormal data were eliminated in order to consolidate a clean data set for later use in the development of machine learning models. This work contributes on understanding data cleansing rules that must be considered in order to reflect the real behavior of the electric furnace operation for further analysis and modeling tasks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信