Sen Mu , Guoqiang Fu , Yue Zheng , Xi Wang , Caijiang Lu , Jianzhong Fu
{"title":"复杂变速条件下主轴热误差联合特征分析的卷积神经网络-注意-门递归单元-注意混合框架","authors":"Sen Mu , Guoqiang Fu , Yue Zheng , Xi Wang , Caijiang Lu , Jianzhong Fu","doi":"10.1016/j.engappai.2025.111033","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning-based spindle thermal error modeling and compensation methods can effectively enhance manufacturing precision. The complex variable working conditions and thermal hysteresis effects in machine tool machining bring significant challenges for high-precision thermal error modeling. To address this issue, a hybrid structure network based on the feature extraction capability of Convolutional Neural Network (CNN) and the thermal hysteresis effect resolution capability of deep Gate Recurrent Unit (GRU) is established. A dual-layer attention mechanism is introduced to enhance spatial features and temporal features for improving the model's robustness and accuracy. First, complex variable working conditions result in complexity and nonlinearity of data. CNN is employed to extract spatial features due to its powerful feature extraction capability. A self-attention mechanism is introduced after CNN block to further filter important features. Due to the influence of thermal hysteresis effects, a deep GRU block is established to extract temporal features. A channel attention mechanism is introduced as the final layer of the network to achieve feature selection across different temperature channels. Second, features extracted by two attention mechanism layers are visualized and analyzed using t-distributed stochastic neighbor embedding (t-SNE) algorithm to explain the effectiveness of the dual-layer attention mechanism structure. The probability density distribution of the predicted results is calculated by kernel density estimation. Model performance is analyzed from the perspective of data distribution. Finally, the proposed model is compared with advanced methods under complex working conditions on two machine tools. The effectiveness of the proposed model is further validated through actual cutting compensation.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"156 ","pages":"Article 111033"},"PeriodicalIF":8.0000,"publicationDate":"2025-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Convolutional neural network-attention-gate recurrent unit-attention hybrid framework for spindle thermal error modeling with joint feature analysis under complex variable speed conditions\",\"authors\":\"Sen Mu , Guoqiang Fu , Yue Zheng , Xi Wang , Caijiang Lu , Jianzhong Fu\",\"doi\":\"10.1016/j.engappai.2025.111033\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep learning-based spindle thermal error modeling and compensation methods can effectively enhance manufacturing precision. The complex variable working conditions and thermal hysteresis effects in machine tool machining bring significant challenges for high-precision thermal error modeling. To address this issue, a hybrid structure network based on the feature extraction capability of Convolutional Neural Network (CNN) and the thermal hysteresis effect resolution capability of deep Gate Recurrent Unit (GRU) is established. A dual-layer attention mechanism is introduced to enhance spatial features and temporal features for improving the model's robustness and accuracy. First, complex variable working conditions result in complexity and nonlinearity of data. CNN is employed to extract spatial features due to its powerful feature extraction capability. A self-attention mechanism is introduced after CNN block to further filter important features. Due to the influence of thermal hysteresis effects, a deep GRU block is established to extract temporal features. A channel attention mechanism is introduced as the final layer of the network to achieve feature selection across different temperature channels. Second, features extracted by two attention mechanism layers are visualized and analyzed using t-distributed stochastic neighbor embedding (t-SNE) algorithm to explain the effectiveness of the dual-layer attention mechanism structure. The probability density distribution of the predicted results is calculated by kernel density estimation. Model performance is analyzed from the perspective of data distribution. Finally, the proposed model is compared with advanced methods under complex working conditions on two machine tools. The effectiveness of the proposed model is further validated through actual cutting compensation.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"156 \",\"pages\":\"Article 111033\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-06-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625010334\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625010334","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Convolutional neural network-attention-gate recurrent unit-attention hybrid framework for spindle thermal error modeling with joint feature analysis under complex variable speed conditions
Deep learning-based spindle thermal error modeling and compensation methods can effectively enhance manufacturing precision. The complex variable working conditions and thermal hysteresis effects in machine tool machining bring significant challenges for high-precision thermal error modeling. To address this issue, a hybrid structure network based on the feature extraction capability of Convolutional Neural Network (CNN) and the thermal hysteresis effect resolution capability of deep Gate Recurrent Unit (GRU) is established. A dual-layer attention mechanism is introduced to enhance spatial features and temporal features for improving the model's robustness and accuracy. First, complex variable working conditions result in complexity and nonlinearity of data. CNN is employed to extract spatial features due to its powerful feature extraction capability. A self-attention mechanism is introduced after CNN block to further filter important features. Due to the influence of thermal hysteresis effects, a deep GRU block is established to extract temporal features. A channel attention mechanism is introduced as the final layer of the network to achieve feature selection across different temperature channels. Second, features extracted by two attention mechanism layers are visualized and analyzed using t-distributed stochastic neighbor embedding (t-SNE) algorithm to explain the effectiveness of the dual-layer attention mechanism structure. The probability density distribution of the predicted results is calculated by kernel density estimation. Model performance is analyzed from the perspective of data distribution. Finally, the proposed model is compared with advanced methods under complex working conditions on two machine tools. The effectiveness of the proposed model is further validated through actual cutting compensation.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.