先进制造中机器学习的归一化与降维

Volume 2: 42nd Computers and Information in Engineering Conference (CIE) Pub Date : 2022-08-14 DOI:10.1115/detc2022-89921

Jida Huang, Tsz-Ho Kwok

{"title":"先进制造中机器学习的归一化与降维","authors":"Jida Huang, Tsz-Ho Kwok","doi":"10.1115/detc2022-89921","DOIUrl":null,"url":null,"abstract":"\n With the advances in sensing and communication techniques, data collection has become much easier in manufacturing processes. Machine learning (ML) is a vital tool for manufacturing data analytics to leverage the underlying informatics carried by data. However, the varieties of data formats, dimensionality, and manufacturing types hugely hinder the learning efficiency of ML methods. Data preparation is critical for exploiting the potential of ML in manufacturing problems. This paper investigates how data preparation affects the ML efficacy in manufacturing data. Specifically, we study the influences of data normalization and dimension reduction on the ML performance for various types of manufacturing problems. We conduct comparison studies of data with/without pre-processing on different manufacturing processes, such as casting, milling, and additive manufacturing. Experimental results reveal that different pre-processing methods have a distinct effect on learning efficiency. Normalization is helpful for both numerical and image data, while dimension reduction — this paper uses principal component analysis (PCA) — is not useful for low-dimensional numerical manufacturing data. Combining both normalization and PCA can significantly enhance the learning efficiency of high-dimensional data. After that, we summarize several practical guidelines for manufacturing data preparation for ML, which provide a valuable basis for future manufacturing data analysis with ML approaches.","PeriodicalId":382970,"journal":{"name":"Volume 2: 42nd Computers and Information in Engineering Conference (CIE)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Normalization and Dimension Reduction for Machine Learning in Advanced Manufacturing\",\"authors\":\"Jida Huang, Tsz-Ho Kwok\",\"doi\":\"10.1115/detc2022-89921\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n With the advances in sensing and communication techniques, data collection has become much easier in manufacturing processes. Machine learning (ML) is a vital tool for manufacturing data analytics to leverage the underlying informatics carried by data. However, the varieties of data formats, dimensionality, and manufacturing types hugely hinder the learning efficiency of ML methods. Data preparation is critical for exploiting the potential of ML in manufacturing problems. This paper investigates how data preparation affects the ML efficacy in manufacturing data. Specifically, we study the influences of data normalization and dimension reduction on the ML performance for various types of manufacturing problems. We conduct comparison studies of data with/without pre-processing on different manufacturing processes, such as casting, milling, and additive manufacturing. Experimental results reveal that different pre-processing methods have a distinct effect on learning efficiency. Normalization is helpful for both numerical and image data, while dimension reduction — this paper uses principal component analysis (PCA) — is not useful for low-dimensional numerical manufacturing data. Combining both normalization and PCA can significantly enhance the learning efficiency of high-dimensional data. After that, we summarize several practical guidelines for manufacturing data preparation for ML, which provide a valuable basis for future manufacturing data analysis with ML approaches.\",\"PeriodicalId\":382970,\"journal\":{\"name\":\"Volume 2: 42nd Computers and Information in Engineering Conference (CIE)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Volume 2: 42nd Computers and Information in Engineering Conference (CIE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1115/detc2022-89921\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Volume 2: 42nd Computers and Information in Engineering Conference (CIE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1115/detc2022-89921","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

随着传感和通信技术的进步，数据收集在制造过程中变得更加容易。机器学习(ML)是制造数据分析的重要工具，可以利用数据所携带的底层信息学。然而，数据格式、维度和制造类型的多样性极大地阻碍了机器学习方法的学习效率。数据准备对于挖掘机器学习在制造问题中的潜力至关重要。本文研究了数据准备如何影响机器学习在制造数据中的有效性。具体来说，我们研究了数据归一化和降维对各种类型制造问题的机器学习性能的影响。我们对不同制造工艺(如铸造、铣削和增材制造)经过/未经过预处理的数据进行比较研究。实验结果表明，不同的预处理方法对学习效率有不同的影响。归一化对数值和图像数据都有帮助，而降维——本文使用主成分分析(PCA)——对低维数值制造数据没有用处。将归一化与主成分分析相结合，可以显著提高高维数据的学习效率。在此之后，我们总结了机器学习制造数据准备的几个实用指南，为未来使用机器学习方法分析制造数据提供了有价值的基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Normalization and Dimension Reduction for Machine Learning in Advanced Manufacturing

With the advances in sensing and communication techniques, data collection has become much easier in manufacturing processes. Machine learning (ML) is a vital tool for manufacturing data analytics to leverage the underlying informatics carried by data. However, the varieties of data formats, dimensionality, and manufacturing types hugely hinder the learning efficiency of ML methods. Data preparation is critical for exploiting the potential of ML in manufacturing problems. This paper investigates how data preparation affects the ML efficacy in manufacturing data. Specifically, we study the influences of data normalization and dimension reduction on the ML performance for various types of manufacturing problems. We conduct comparison studies of data with/without pre-processing on different manufacturing processes, such as casting, milling, and additive manufacturing. Experimental results reveal that different pre-processing methods have a distinct effect on learning efficiency. Normalization is helpful for both numerical and image data, while dimension reduction — this paper uses principal component analysis (PCA) — is not useful for low-dimensional numerical manufacturing data. Combining both normalization and PCA can significantly enhance the learning efficiency of high-dimensional data. After that, we summarize several practical guidelines for manufacturing data preparation for ML, which provide a valuable basis for future manufacturing data analysis with ML approaches.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Volume 2: 42nd Computers and Information in Engineering Conference (CIE)

自引率

0.00%

发文量