Wavelet Packet Transform and Deep Learning-based Fusion of Audio-Visual Signals: A Novel Approach for Enhancing Laser Cleaning Effect Evaluation

IF 5.3 3区 工程技术 Q1 ENGINEERING, MANUFACTURING
Haipeng Huang, Liang Li, Shiwei Liu, Bentian Hao, Dejun Ye
{"title":"Wavelet Packet Transform and Deep Learning-based Fusion of Audio-Visual Signals: A Novel Approach for Enhancing Laser Cleaning Effect Evaluation","authors":"Haipeng Huang, Liang Li, Shiwei Liu, Bentian Hao, Dejun Ye","doi":"10.1007/s40684-023-00589-2","DOIUrl":null,"url":null,"abstract":"<p>In response to the insufficient detection capability of laser paint stripping effects for a single modality and the high operational and cost requirements of existing multi-monitoring technologies, a method is proposed to integrate visual and auditory signals for evaluating laser paint stripping effects. Utilizing wavelet packet transformation for a more detailed understanding of the variations in paint-stripping sound signals, more representative energy features are extracted. The EfficientNetv2 network, optimized with an attention mechanism, further enhances the focus on crucial features. The image feature vectors are concatenated with the energy features extracted from the sound signals, forming a new and more informative feature vector for paint stripping effect discrimination. Experimental results demonstrate that the multi-feature fusion detection algorithm significantly improves the accuracy of paint stripping effect discrimination, reaching 98.7%. The 98.9% F1-Score and the smoothly converging loss curve also indicate the algorithm's effective control over category imbalance and training stability. This research is of paramount importance for improving the evaluation of laser cleaning technology effects and provides insights into multi-modal feature fusion for other relevant fields of study.</p>","PeriodicalId":14238,"journal":{"name":"International Journal of Precision Engineering and Manufacturing-Green Technology","volume":"174 1","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Precision Engineering and Manufacturing-Green Technology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s40684-023-00589-2","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MANUFACTURING","Score":null,"Total":0}
引用次数: 0

Abstract

In response to the insufficient detection capability of laser paint stripping effects for a single modality and the high operational and cost requirements of existing multi-monitoring technologies, a method is proposed to integrate visual and auditory signals for evaluating laser paint stripping effects. Utilizing wavelet packet transformation for a more detailed understanding of the variations in paint-stripping sound signals, more representative energy features are extracted. The EfficientNetv2 network, optimized with an attention mechanism, further enhances the focus on crucial features. The image feature vectors are concatenated with the energy features extracted from the sound signals, forming a new and more informative feature vector for paint stripping effect discrimination. Experimental results demonstrate that the multi-feature fusion detection algorithm significantly improves the accuracy of paint stripping effect discrimination, reaching 98.7%. The 98.9% F1-Score and the smoothly converging loss curve also indicate the algorithm's effective control over category imbalance and training stability. This research is of paramount importance for improving the evaluation of laser cleaning technology effects and provides insights into multi-modal feature fusion for other relevant fields of study.

Abstract Image

基于小波包变换和深度学习的视听信号融合:增强激光清洁效果评估的新方法
针对单一模式对激光脱漆效果的检测能力不足,以及现有多重监测技术对操作和成本的高要求,提出了一种整合视觉和听觉信号以评估激光脱漆效果的方法。利用小波包变换来更详细地了解脱漆声音信号的变化,从而提取出更具代表性的能量特征。通过注意力机制优化的 EfficientNetv2 网络进一步加强了对关键特征的关注。图像特征向量与从声音信号中提取的能量特征相串联,形成一个新的、信息量更大的特征向量,用于脱漆效果判别。实验结果表明,多特征融合检测算法显著提高了脱漆效果判别的准确率,达到 98.7%。98.9% 的 F1 分数和平稳收敛的损失曲线也表明该算法能有效控制类别不平衡和训练稳定性。这项研究对改进激光清洁技术效果评估具有重要意义,并为其他相关研究领域的多模态特征融合提供了启示。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
10.30
自引率
9.50%
发文量
65
审稿时长
5.3 months
期刊介绍: Green Technology aspects of precision engineering and manufacturing are becoming ever more important in current and future technologies. New knowledge in this field will aid in the advancement of various technologies that are needed to gain industrial competitiveness. To this end IJPEM - Green Technology aims to disseminate relevant developments and applied research works of high quality to the international community through efficient and rapid publication. IJPEM - Green Technology covers novel research contributions in all aspects of "Green" precision engineering and manufacturing.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信