{"title":"Physics-Informed Imputation for Data Cleaning and Pre-Processing in Robust Smart Manufacturing Systems","authors":"Dieter Joenssen , Abhinav Singh Hada , Juergen Lenz","doi":"10.1016/j.procs.2024.01.037","DOIUrl":null,"url":null,"abstract":"<div><p>Missing Data is of concern in smart manufacturing. There are various reasons why data may be missing. Buffering issues, sensor failure, or protocol issues can cause missing entries or skipped captures. This missing data leads to distorted datasets, failure to train models, and either to disregard, deletion of records or typically to vast manual re-work efforts. This paper shows types of missing data and various approaches to detect them. Replacement or fill-in approaches are presented and their limitations are highlighted. A more precise imputation method beyond the state of the art is explained in detail. The implementation of this physics-informed imputation method was performed and experiments were carried out. The results of the experiments are presented in this paper and the results discussed.</p></div>","PeriodicalId":20465,"journal":{"name":"Procedia Computer Science","volume":"232 ","pages":"Pages 377-387"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1877050924000371/pdf?md5=7b4d894dbbcfda5c894d976ee01b12e3&pid=1-s2.0-S1877050924000371-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Procedia Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1877050924000371","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Missing Data is of concern in smart manufacturing. There are various reasons why data may be missing. Buffering issues, sensor failure, or protocol issues can cause missing entries or skipped captures. This missing data leads to distorted datasets, failure to train models, and either to disregard, deletion of records or typically to vast manual re-work efforts. This paper shows types of missing data and various approaches to detect them. Replacement or fill-in approaches are presented and their limitations are highlighted. A more precise imputation method beyond the state of the art is explained in detail. The implementation of this physics-informed imputation method was performed and experiments were carried out. The results of the experiments are presented in this paper and the results discussed.