Combining Active Learning and Self-Paced Learning for Cost-Effective Process Design Intents Extraction of Process Data

IF 4.8 2区 工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Huang Rui, Zhu Shuyi, Huang Bo
{"title":"Combining Active Learning and Self-Paced Learning for Cost-Effective Process Design Intents Extraction of Process Data","authors":"Huang Rui, Zhu Shuyi, Huang Bo","doi":"10.1093/jcde/qwae027","DOIUrl":null,"url":null,"abstract":"\n With the widespread use of computer-aided technologies like CAD/CAM/CAPP in the product manufacturing process, a large amount of process data is constantly generated, and data-driven process planning has shown promising potentials for effectively reusing the process knowledge. However, a lot of labeled data are needed to train a deep learning model for effectively extracting the embedded knowledge and experiences within these process data, and the labeling of process data is quite expensive and time-consuming. This paper proposes a cost-effective process design intents extraction approach for process data by combining active learning (AL) and self-paced learning (SPL). First, the process design intents inference model based on Bi-LSTM is generated by using a few pre-labeled samples. Then, the prediction uncertainty of each unlabeled sample is calculated by using a Bayesian neural network, which can assist in the identification of high confidence samples in SPL and low confidence samples in AL. Finally, the low confidence samples with manual-labels and the high confidence samples with pseudo-labels are incorporated into the training data for retraining the process design intents inference model iteratively until the model attains optimal performance. The experiments demonstrate that our approach can substantially decrease the number of labeled samples required for model training, and the design intents in the process data could be inferred effectively with dynamically undated training data.","PeriodicalId":48611,"journal":{"name":"Journal of Computational Design and Engineering","volume":null,"pages":null},"PeriodicalIF":4.8000,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Design and Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1093/jcde/qwae027","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

With the widespread use of computer-aided technologies like CAD/CAM/CAPP in the product manufacturing process, a large amount of process data is constantly generated, and data-driven process planning has shown promising potentials for effectively reusing the process knowledge. However, a lot of labeled data are needed to train a deep learning model for effectively extracting the embedded knowledge and experiences within these process data, and the labeling of process data is quite expensive and time-consuming. This paper proposes a cost-effective process design intents extraction approach for process data by combining active learning (AL) and self-paced learning (SPL). First, the process design intents inference model based on Bi-LSTM is generated by using a few pre-labeled samples. Then, the prediction uncertainty of each unlabeled sample is calculated by using a Bayesian neural network, which can assist in the identification of high confidence samples in SPL and low confidence samples in AL. Finally, the low confidence samples with manual-labels and the high confidence samples with pseudo-labels are incorporated into the training data for retraining the process design intents inference model iteratively until the model attains optimal performance. The experiments demonstrate that our approach can substantially decrease the number of labeled samples required for model training, and the design intents in the process data could be inferred effectively with dynamically undated training data.
将主动学习与自学相结合,实现经济高效的流程设计意图 流程数据的提取
随着 CAD/CAM/CAPP 等计算机辅助技术在产品制造过程中的广泛应用,大量过程数据不断产生,数据驱动的过程规划已显示出有效重用过程知识的巨大潜力。然而,要训练深度学习模型以有效提取这些工艺数据中蕴含的知识和经验,需要大量的标注数据,而工艺数据的标注相当昂贵且耗时。本文结合主动学习(AL)和自定步调学习(SPL),提出了一种经济高效的流程数据流程设计意图提取方法。首先,使用少量预标记样本生成基于 Bi-LSTM 的流程设计意图推理模型。然后,利用贝叶斯神经网络计算每个未标记样本的预测不确定性,从而帮助 SPL 识别高置信度样本,AL 识别低置信度样本。最后,将带有人工标签的低置信度样本和带有伪标签的高置信度样本纳入训练数据,反复训练流程设计意图推理模型,直至模型达到最佳性能。实验证明,我们的方法可以大大减少模型训练所需的标注样本数量,并且可以利用动态无日期训练数据有效推断出流程数据中的设计意图。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Computational Design and Engineering
Journal of Computational Design and Engineering Computer Science-Human-Computer Interaction
CiteScore
7.70
自引率
20.40%
发文量
125
期刊介绍: Journal of Computational Design and Engineering is an international journal that aims to provide academia and industry with a venue for rapid publication of research papers reporting innovative computational methods and applications to achieve a major breakthrough, practical improvements, and bold new research directions within a wide range of design and engineering: • Theory and its progress in computational advancement for design and engineering • Development of computational framework to support large scale design and engineering • Interaction issues among human, designed artifacts, and systems • Knowledge-intensive technologies for intelligent and sustainable systems • Emerging technology and convergence of technology fields presented with convincing design examples • Educational issues for academia, practitioners, and future generation • Proposal on new research directions as well as survey and retrospectives on mature field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信