Maciej Badora, Marzia Sepe, M. Bielecki, A. Graziano, T. Szolc
{"title":"基于机器学习算法的小数据状态下疲劳裂纹长度预测","authors":"Maciej Badora, Marzia Sepe, M. Bielecki, A. Graziano, T. Szolc","doi":"10.17531/EIN.2021.3.19","DOIUrl":null,"url":null,"abstract":"In this paper several statistical learning algorithms are used to predict the maximal length of fatigue cracks based on a sample composed of 31 observations. The small-data regime is still a problem for many professionals, especially in the areas where failures occur rarely. The analyzed object is a high-pressure Nozzle of a heavy-duty gas turbine. Operating parameters of the engines are used for the regression analysis. The following algorithms are used in this work: multiple linear and polynomial regression, random forest, kernel-based methods, AdaBoost and extreme gradient boosting and artificial neural networks. A substantial part of the paper provides advice on the effective selection of features. The paper explains how to process the dataset in order to reduce uncertainty; thus, simplifying the analysis of the results. The proposed loss and cost functions are custom and promote solutions accurately predicting the longest cracks. The obtained results confirm that some of the algorithms can accurately predict maximal lengths of the fatigue cracks, even if the sample is small.","PeriodicalId":50549,"journal":{"name":"Eksploatacja I Niezawodnosc-Maintenance and Reliability","volume":"43 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2021-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Predicting length of fatigue cracks by means of machine learning algorithms in the small-data regime\",\"authors\":\"Maciej Badora, Marzia Sepe, M. Bielecki, A. Graziano, T. Szolc\",\"doi\":\"10.17531/EIN.2021.3.19\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper several statistical learning algorithms are used to predict the maximal length of fatigue cracks based on a sample composed of 31 observations. The small-data regime is still a problem for many professionals, especially in the areas where failures occur rarely. The analyzed object is a high-pressure Nozzle of a heavy-duty gas turbine. Operating parameters of the engines are used for the regression analysis. The following algorithms are used in this work: multiple linear and polynomial regression, random forest, kernel-based methods, AdaBoost and extreme gradient boosting and artificial neural networks. A substantial part of the paper provides advice on the effective selection of features. The paper explains how to process the dataset in order to reduce uncertainty; thus, simplifying the analysis of the results. The proposed loss and cost functions are custom and promote solutions accurately predicting the longest cracks. The obtained results confirm that some of the algorithms can accurately predict maximal lengths of the fatigue cracks, even if the sample is small.\",\"PeriodicalId\":50549,\"journal\":{\"name\":\"Eksploatacja I Niezawodnosc-Maintenance and Reliability\",\"volume\":\"43 1\",\"pages\":\"\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2021-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Eksploatacja I Niezawodnosc-Maintenance and Reliability\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.17531/EIN.2021.3.19\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eksploatacja I Niezawodnosc-Maintenance and Reliability","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.17531/EIN.2021.3.19","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Predicting length of fatigue cracks by means of machine learning algorithms in the small-data regime
In this paper several statistical learning algorithms are used to predict the maximal length of fatigue cracks based on a sample composed of 31 observations. The small-data regime is still a problem for many professionals, especially in the areas where failures occur rarely. The analyzed object is a high-pressure Nozzle of a heavy-duty gas turbine. Operating parameters of the engines are used for the regression analysis. The following algorithms are used in this work: multiple linear and polynomial regression, random forest, kernel-based methods, AdaBoost and extreme gradient boosting and artificial neural networks. A substantial part of the paper provides advice on the effective selection of features. The paper explains how to process the dataset in order to reduce uncertainty; thus, simplifying the analysis of the results. The proposed loss and cost functions are custom and promote solutions accurately predicting the longest cracks. The obtained results confirm that some of the algorithms can accurately predict maximal lengths of the fatigue cracks, even if the sample is small.
期刊介绍:
The quarterly Eksploatacja i Niezawodność – Maintenance and Reliability publishes articles containing original results of experimental research on the durabilty and reliability of technical objects. We also accept papers presenting theoretical analyses supported by physical interpretation of causes or ones that have been verified empirically. Eksploatacja i Niezawodność – Maintenance and Reliability also publishes articles on innovative modeling approaches and research methods regarding the durability and reliability of objects.