Optimizing machine learning yield functions using query-by-committee for support vector classification with a dynamic stopping criterion

IF 3.7 2区工程技术 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

Computational Mechanics Pub Date : 2024-02-12 DOI:10.1007/s00466-023-02440-6

Ronak Shoghi, Lukas Morand, Dirk Helm, Alexander Hartmaier

{"title":"Optimizing machine learning yield functions using query-by-committee for support vector classification with a dynamic stopping criterion","authors":"Ronak Shoghi, Lukas Morand, Dirk Helm, Alexander Hartmaier","doi":"10.1007/s00466-023-02440-6","DOIUrl":null,"url":null,"abstract":"<p>In the field of materials engineering, the accurate prediction of material behavior under various loading conditions is crucial. Machine Learning (ML) methods have emerged as promising tools for generating constitutive models straight from data, capable of describing complex material behavior in a more flexible way than classical constitutive models. Yield functions, which serve as foundation of constitutive models for plasticity, can be properly described in a data-oriented manner using ML methods. However, the quality of these descriptions heavily relies on the availability of sufficient high-quality and representative training data that needs to be generated by fundamental numerical simulations, experiments, or a combination of both. The present paper addresses the issue of data selection, by introducing an active learning approach for Support Vector Classification (SVC) and its application in training an ML yield function with suitable data. In this regard, the Query-By-Committee (QBC) algorithm was employed, guiding the selection of new training data points in regions of the feature space where a committee of models shows significant disagreement. This approach resulted in a marked reduction in the variance of model predictions throughout the active learning process. It was also shown that the rate of decrease in the variance went along with an increase in the quality of the trained model, quantified by the Matthews Correlation Coefficient (MCC). This demonstrated the effectiveness of the approach and offered us the possibility to define a dynamic stopping criterion based on the variance in the committee results.</p>","PeriodicalId":55248,"journal":{"name":"Computational Mechanics","volume":"12 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Mechanics","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s00466-023-02440-6","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

In the field of materials engineering, the accurate prediction of material behavior under various loading conditions is crucial. Machine Learning (ML) methods have emerged as promising tools for generating constitutive models straight from data, capable of describing complex material behavior in a more flexible way than classical constitutive models. Yield functions, which serve as foundation of constitutive models for plasticity, can be properly described in a data-oriented manner using ML methods. However, the quality of these descriptions heavily relies on the availability of sufficient high-quality and representative training data that needs to be generated by fundamental numerical simulations, experiments, or a combination of both. The present paper addresses the issue of data selection, by introducing an active learning approach for Support Vector Classification (SVC) and its application in training an ML yield function with suitable data. In this regard, the Query-By-Committee (QBC) algorithm was employed, guiding the selection of new training data points in regions of the feature space where a committee of models shows significant disagreement. This approach resulted in a marked reduction in the variance of model predictions throughout the active learning process. It was also shown that the rate of decrease in the variance went along with an increase in the quality of the trained model, quantified by the Matthews Correlation Coefficient (MCC). This demonstrated the effectiveness of the approach and offered us the possibility to define a dynamic stopping criterion based on the variance in the committee results.

Abstract Image

查看原文本刊更多论文

利用查询委员会优化机器学习收益函数，实现支持向量分类的动态停止标准

在材料工程领域，准确预测各种加载条件下的材料行为至关重要。机器学习（ML）方法已成为从数据中直接生成构成模型的有效工具，能够以比经典构成模型更灵活的方式描述复杂的材料行为。屈服函数是塑性构造模型的基础，可以使用 ML 方法以数据为导向的方式对其进行正确描述。然而，这些描述的质量在很大程度上取决于是否有足够的高质量、有代表性的训练数据，这些数据需要通过基本的数值模拟、实验或两者的结合来生成。本文针对数据选择问题，介绍了支持向量分类（SVC）的主动学习方法，并将其应用于使用合适数据训练 ML 收益函数。在这方面，采用了 "委员会查询"（QBC）算法，指导在模型委员会显示出明显分歧的特征空间区域选择新的训练数据点。这种方法显著降低了整个主动学习过程中模型预测的方差。研究还表明，方差降低的同时，训练模型的质量也在提高，这可以通过马修斯相关系数（MCC）来量化。这证明了该方法的有效性，并为我们提供了根据委员会结果的方差定义动态停止标准的可能性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computational Mechanics 物理-力学

CiteScore

7.80

自引率

12.20%

发文量

122

审稿时长

3.4 months

期刊介绍： The journal reports original research of scholarly value in computational engineering and sciences. It focuses on areas that involve and enrich the application of mechanics, mathematics and numerical methods. It covers new methods and computationally-challenging technologies. Areas covered include method development in solid, fluid mechanics and materials simulations with application to biomechanics and mechanics in medicine, multiphysics, fracture mechanics, multiscale mechanics, particle and meshfree methods. Additionally, manuscripts including simulation and method development of synthesis of material systems are encouraged. Manuscripts reporting results obtained with established methods, unless they involve challenging computations, and manuscripts that report computations using commercial software packages are not encouraged.