Tackling data scarcity in machine learning-based CFRP drilling performance prediction through a broad learning system with virtual sample generation (BLS-VSG)

IF 14.2 1区 材料科学 Q1 ENGINEERING, MULTIDISCIPLINARY
Jia Ge , Zequan Yao , Ming Wu , José Humberto S. Almeida Jr , Yan Jin , Dan Sun
{"title":"Tackling data scarcity in machine learning-based CFRP drilling performance prediction through a broad learning system with virtual sample generation (BLS-VSG)","authors":"Jia Ge ,&nbsp;Zequan Yao ,&nbsp;Ming Wu ,&nbsp;José Humberto S. Almeida Jr ,&nbsp;Yan Jin ,&nbsp;Dan Sun","doi":"10.1016/j.compositesb.2025.112701","DOIUrl":null,"url":null,"abstract":"<div><div>Machine learning (ML)-based data-driven method has emerged as a powerful tool for predicting the manufacturing performance of carbon fibre reinforced plastic (CFRP), particularly in CFRP machining, where physics-based models are computationally expensive. However, the effectiveness of ML models are often constrained by limited datasets, due to the high cost and time required for experimental data acquisition. To address this, this paper presents the first study to apply virtual sample generation (VSG) techniques to enlarge the training dataset and mitigate data scarcity in the prediction of CFRP drilling performance. A novel hybrid ML framework integrating Broad Learning System (BLS) and VSG (BLS-VSG) is proposed to combine the capability of BLS in small dataset prediction with the enlarged dataset generated by VSG. The model has been employed to predict the drilling thrust force and delamination damage under various drilling conditions (spindle speed, feed rate, point angle). Three different VSG methods (SMOTE, MD-MTD and CVT) and the number of virtual samples were evaluated in detail. Results show that VSG can effectively enlarge the training dataset and improve the prediction performance of the ML model. Specifically, VSG reduced the mean square error (MSE) and mean absolute percentage error (MAPE) for thrust force prediction by 39.0 % and 12.9 %, respectively, compared to the benchmark without VSG. For delamination factor F<sub>da</sub> prediction, MSE and MAPE were reduced by 22.6 % and 16.5 %, respectively. The proposed BLS-VSG model outperforms other conventional ML models (BPNN, ELM, SVR and RT) for both scenarios (with/without VSG), providing a robust and data-efficient solution for CFRP drilling performance prediction.</div></div>","PeriodicalId":10660,"journal":{"name":"Composites Part B: Engineering","volume":"305 ","pages":"Article 112701"},"PeriodicalIF":14.2000,"publicationDate":"2025-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Composites Part B: Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S135983682500602X","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Machine learning (ML)-based data-driven method has emerged as a powerful tool for predicting the manufacturing performance of carbon fibre reinforced plastic (CFRP), particularly in CFRP machining, where physics-based models are computationally expensive. However, the effectiveness of ML models are often constrained by limited datasets, due to the high cost and time required for experimental data acquisition. To address this, this paper presents the first study to apply virtual sample generation (VSG) techniques to enlarge the training dataset and mitigate data scarcity in the prediction of CFRP drilling performance. A novel hybrid ML framework integrating Broad Learning System (BLS) and VSG (BLS-VSG) is proposed to combine the capability of BLS in small dataset prediction with the enlarged dataset generated by VSG. The model has been employed to predict the drilling thrust force and delamination damage under various drilling conditions (spindle speed, feed rate, point angle). Three different VSG methods (SMOTE, MD-MTD and CVT) and the number of virtual samples were evaluated in detail. Results show that VSG can effectively enlarge the training dataset and improve the prediction performance of the ML model. Specifically, VSG reduced the mean square error (MSE) and mean absolute percentage error (MAPE) for thrust force prediction by 39.0 % and 12.9 %, respectively, compared to the benchmark without VSG. For delamination factor Fda prediction, MSE and MAPE were reduced by 22.6 % and 16.5 %, respectively. The proposed BLS-VSG model outperforms other conventional ML models (BPNN, ELM, SVR and RT) for both scenarios (with/without VSG), providing a robust and data-efficient solution for CFRP drilling performance prediction.
基于虚拟样本生成(BLS-VSG)的广义学习系统解决基于机器学习的CFRP钻井性能预测中的数据稀缺性问题
基于机器学习(ML)的数据驱动方法已经成为预测碳纤维增强塑料(CFRP)制造性能的强大工具,特别是在CFRP加工中,基于物理模型的计算成本很高。然而,由于实验数据采集所需的高成本和时间,ML模型的有效性经常受到有限数据集的限制。为了解决这个问题,本文提出了第一个应用虚拟样本生成(VSG)技术来扩大训练数据集并缓解CFRP钻井性能预测中的数据稀缺性的研究。为了将广义学习系统(BLS)的小数据集预测能力与VSG生成的大数据集预测能力相结合,提出了一种将广义学习系统(BLS)与VSG相结合的混合机器学习框架(BLS-VSG)。利用该模型对不同钻孔条件(主轴转速、进给速率、进给角)下的钻孔推力和分层损伤进行了预测。对SMOTE、MD-MTD和CVT三种不同的VSG方法和虚拟样本数量进行了详细的评价。结果表明,VSG可以有效地扩大训练数据集,提高机器学习模型的预测性能。具体而言,与没有VSG的基准相比,VSG将推力预测的均方误差(MSE)和平均绝对百分比误差(MAPE)分别降低了39.0%和12.9%。对于分层因子Fda的预测,MSE和MAPE分别降低22.6%和16.5%。提出的BLS-VSG模型在两种情况下(有/没有VSG)都优于其他传统的ML模型(BPNN、ELM、SVR和RT),为CFRP钻井性能预测提供了一个强大且数据高效的解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Composites Part B: Engineering
Composites Part B: Engineering 工程技术-材料科学:复合
CiteScore
24.40
自引率
11.50%
发文量
784
审稿时长
21 days
期刊介绍: Composites Part B: Engineering is a journal that publishes impactful research of high quality on composite materials. This research is supported by fundamental mechanics and materials science and engineering approaches. The targeted research can cover a wide range of length scales, ranging from nano to micro and meso, and even to the full product and structure level. The journal specifically focuses on engineering applications that involve high performance composites. These applications can range from low volume and high cost to high volume and low cost composite development. The main goal of the journal is to provide a platform for the prompt publication of original and high quality research. The emphasis is on design, development, modeling, validation, and manufacturing of engineering details and concepts. The journal welcomes both basic research papers and proposals for review articles. Authors are encouraged to address challenges across various application areas. These areas include, but are not limited to, aerospace, automotive, and other surface transportation. The journal also covers energy-related applications, with a focus on renewable energy. Other application areas include infrastructure, off-shore and maritime projects, health care technology, and recreational products.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信