{"title":"Personalized hierarchical heterogeneous federated learning for thermal comfort prediction in smart buildings","authors":"","doi":"10.1016/j.engappai.2024.109464","DOIUrl":"10.1016/j.engappai.2024.109464","url":null,"abstract":"<div><div>Federated Learning (FL) is gaining significant traction due to its ability to provide security and privacy. In the FL paradigm, the global model is learned at the cloud through the consolidation of local model parameters instead of collecting local training data at the central node. This approach mitigates privacy leakage caused by the collection of sensitive information. However, it poses challenges to the convergence of the global model due to system and statistical heterogeneity. In this study, we propose a two-fold Personalized Hierarchical Heterogeneous FL (PHHFL) approach. It leverages a hierarchical structure to handle statistical heterogeneity and a normal distribution-based client selection to control model divergence in FL environment. PHHFL aims to use a maximum number of local features of each client and assign specific level in the hierarchy. Furthermore, to address model divergence caused by the nodes’ statistical heterogeneity, we propose a novel client selection strategy based on the performance distribution of the nodes. Experiments are conducted on thermal comfort datasets and a synthetic dataset with 12 and 10 clients, respectively. The results show that the proposed PHHFL outperforms in terms of accuracy, F1 score, and class-wise precision on both thermal comfort and synthetic datasets. The source code of the PHHFL model and datasets is available on <span><span>GitHub</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fault diagnosis of driving gear in battery swapping system based on auditory bionics","authors":"","doi":"10.1016/j.engappai.2024.109525","DOIUrl":"10.1016/j.engappai.2024.109525","url":null,"abstract":"<div><div>Rack and pinion drives (RPD) are widely used in battery swapping system (BSS) for electric heavy trucks (EHT), and due to the continuous heavy-load and high-intensity operation, along with the electric erosion, the gears in the RPD are always damaged, which causes unexpected consequences such as downtime or safety incidents. The working conditions of the RPD in BSS include uncertain noises, fluctuant and low speed, which pose steep challenges to accurate fault diagnosis. Considering the auditory resistance of interference, the low-frequency sensitivity of auditory perception, and the auditory saliency mechanism, to leverage the advantages of auditory perceptual mechanism in addressing the above challenges, as the contribution in artificial intelligence, we propose an entire vibration signal processing scheme based on auditory bionics, including some mathematical models for auditory mechanisms. For the application in engineering, the proposed scheme is employed for fault diagnosis of RPD in BSS in unique working conditions. First, adaptive resampling is used to smooth the speed fluctuation, then, Gammatone filters are employed to transform vibration signals to cochleograms, after that, based on auditory stream segregation and selective attention mechanisms, effective frequency channels and salient features are extracted from the cochleograms, besides, to improve the diagnosis accuracy, binaural features are also extracted, finally, based on (sectional) sparse representation and fusion, fault diagnosis is achieved. The effectiveness of the fault diagnosis scheme is demonstrated using a BSS prototype system.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"RMFDNet: Redundant and Missing Feature Decoupling Network for salient object detection","authors":"","doi":"10.1016/j.engappai.2024.109459","DOIUrl":"10.1016/j.engappai.2024.109459","url":null,"abstract":"<div><div>Recently, many salient object detection methods have utilized edge contours to constrain the solution space. This approach aims to reduce the omission of salient features and minimize the inclusion of non-salient features. To further leverage the potential of edge-related information, this paper proposes a Redundant and Missing Feature Decoupling Network (RMFDNet). RMFDNet primarily consists of a segment decoder, a complement decoder, a removal decoder, and a recurrent repair encoder. The complement and removal decoders are designed to directly predict the missing and redundant features within the segmentation features. These predicted features are then processed by the recurrent repair encoder to refine the segmentation features. Experimental results on multiple Red–Green–Blue (RGB) and Red–Green–Blue-Depth (RGB-D) benchmark datasets, as well as polyp segmentation datasets, demonstrate that RMFDNet significantly outperforms previous state-of-the-art methods across various evaluation metrics. The efficiency, robustness, and generalization capability of RMFDNet are thoroughly analyzed through a carefully designed ablation study. The code will be made available upon paper acceptance.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A novel integrated prediction method using adaptive mode decomposition, attention mechanism and deep learning for coking products prices","authors":"","doi":"10.1016/j.engappai.2024.109504","DOIUrl":"10.1016/j.engappai.2024.109504","url":null,"abstract":"<div><div>Accurate prediction of coking product prices is crucial for enhancing production efficiency, cost optimization, and profit maximization in smart coking facilities. To address the volatility caused by nonlinear factors such as raw material costs, substitutes, macroeconomic indicators, sudden events, policy changes, and market behaviors, we propose a novel integrated prediction method for coking product price prediction. This method combines Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) for signal decomposition, Bidirectional Encoder Representations from Transformers (BERT) for natural language processing, attention mechanisms (AT) to weigh feature importance, and an ensemble of Bidirectional Gated Recurrent Unit, Bidirectional Long Short-Term Memory, and Gated Recurrent Unit, abbreviated BBG, for robust feature extraction. We design a feature selection strategy to avoid data leakage and improve the predictive ability of the model, and describe a method to maintain textual data information integrity when combining data from different sources. Experimental results on coke and methanol datasets show that our approach retains multi-source text richness improves predictive capability, and outperforms other state-of-the-art methods, providing an effective tool for developing smart coke plants.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Diverse policy generation for the flexible job-shop scheduling problem via deep reinforcement learning with a novel graph representation","authors":"","doi":"10.1016/j.engappai.2024.109488","DOIUrl":"10.1016/j.engappai.2024.109488","url":null,"abstract":"<div><div>In scheduling problems common in the industry and various real-world scenarios, responding in real-time to disruptive events is important. Recent methods propose the use of deep reinforcement learning (DRL) to learn policies capable of generating solutions under this constraint. However, current DRL approaches struggle with large instances, which are common in real-world scenarios. The objective of this paper is to introduce a new DRL method for solving the flexible job-shop scheduling problem, with a focus on these type of instances. The approach is based on the use of heterogeneous graph neural networks to a more informative graph representation of the problem. This novel modeling of the problem enhances the policy’s ability to capture state information and improve its decision-making capacity. Additionally, we introduce two novel approaches to enhance the performance of the DRL approach: the first involves generating a diverse set of scheduling policies, while the second combines DRL with dispatching rules (DRs) constraining the action space, with a variable degree of freedom depending on the chosen policy. Experimental results on two public benchmarks show that our approach outperforms DRs and achieves superior results compared to three state-of-the-art DRL methods, particularly for large instances.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533280","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A multi-criteria decision-making method based on discrete Z-numbers and Aczel-Alsina aggregation operators and its application on early diagnosis of depression","authors":"","doi":"10.1016/j.engappai.2024.109484","DOIUrl":"10.1016/j.engappai.2024.109484","url":null,"abstract":"<div><div>In mental health diagnostics, the questionnaire is an effective and cost-effective method. However, the traditional questionnaire test methods for depression and anxiety have great ambiguity. The discrete Z-numbers (DZs) provide solutions for describing and resolving complex fuzzy issues in the intelligent multi-criteria decision-making (MCDM) process. However, large-scale datasets are not suited for the present MCDM techniques due to their extremely high computational cost. Additionally, these techniques are less stable and flexible. To address the above issues, a novel MCDM method is introduced, which is based on the DZs theory and the Aczel-Alsina (AA) aggregation operator (AO) for large-scale datasets. To begin with, centroid points are calculated for DZs, and a series of novel AOs are introduced. And then a score function with a parameter is introduced to balance the influence between the possibility restriction and the fuzzy restriction of DZs. Thirdly, a new MCDM method under DZs is presented based on the proposed AA AOs and score function. Finally, to support the early diagnosis of depression and anxiety, we apply our method to the real-life online Depression, Anxiety, and Stress Scale (DASS) which can be transformed into DZs by our proposed preprocessing method. According to experimental results, our method is applicable to large-scale datasets and has much lower complexity as well as higher flexibility and stability.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A geometric model with stochastic error for abnormal motion detection of portal crane bucket grab","authors":"","doi":"10.1016/j.engappai.2024.109481","DOIUrl":"10.1016/j.engappai.2024.109481","url":null,"abstract":"<div><div>Abnormal swing angle detection of bucket grabs is crucial for efficient harbor operations. In this study, we develop a practically convenient swing angle detection method for crane operation, requiring only a single standard surveillance camera at the fly-jib head, without the need for sophisticated sensors or markers on the payload. Specifically, our algorithm takes the video images from the camera as input. Next, a fine-tuned ‘the fifth version of the You Only Look Once algorithm’ (YOLOv5) model is used to automatically detect the position of the bucket grab on the image plane. Subsequently, a novel geometric model is constructed, which takes the pixel position of the bucket grab, the steel rope length provided by the Programmable Logic Controller (PLC) system, and the optical lens information of the camera into consideration. The key parameters of this geometric model are statistically estimated by a novel iterative algorithm. Once the key parameters are estimated, the algorithm can automatically detect swing angles from video streams. Being analytically simple, the computation of our algorithm is fast, as it takes about 0.01 s to process one single image generated by the surveillance camera. Therefore, we are able to obtain an accurate and fast estimation of the swing angle of an operating crane in real-time applications. Simulation studies are conducted to validate the model and algorithm. Real video examples from Qingdao Seaport under various weather conditions are analyzed to demonstrate its practical performance.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive multimodal control of trans-media vehicle based on deep reinforcement learning","authors":"","doi":"10.1016/j.engappai.2024.109524","DOIUrl":"10.1016/j.engappai.2024.109524","url":null,"abstract":"<div><div>To solve the problem that the control system is prone to instability due to the sudden change of physical characteristics, strong interference, and nonlinear in the process of multimodal movement of trans-media vehicle, an adaptive control method combining the Deep Deterministic Policy Gradient (DDPG) and traditional Proportional-Integral-Derivative (PID) controller is proposed in this paper. In this approach, the upper-level DDPG controller continuously monitors the vehicle's state and environmental conditions, dynamically adjusting the PID parameters in real-time. The lower-level PID controller then utilizes these updated parameters to modulate the output thrust of the vehicle's motors, thereby achieving excellent control over the vehicle's entire movement. Firstly, according to the hydrodynamic analysis, the kinematics and dynamics mathematical model of the self-designed trans-media vehicle is constructed. This model includes the multi-stage motion modal process of aerial flight, underwater navigation, and cross-media motion, which is suitable for the simulation and verification of the control method. Then, an adaptive controller called RL-PID combining DDPG and PID is built, so that PID can adjust parameters in real-time according to the changes in the external environment. Finally, after theoretical stability proof, a comparison study is performed across three approaches, namely the novel RL-PID, Fuzzy PID, and PID. The experimental results illustrate the superiority of the proposed approach over the competing ones and the generalization of the proposed approach under different interference.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Weak saliency ensemble network for person Re-identification using infrared light images","authors":"","doi":"10.1016/j.engappai.2024.109517","DOIUrl":"10.1016/j.engappai.2024.109517","url":null,"abstract":"<div><div>In recent years, person re-identification (re-id) has primarily been studied using visible light (VL) images. However, the challenges of employing VL images in nighttime environments have prompted research into using infrared light (IR) images. Yet, the utilization of both VL and IR images in person re-id has resulted in increased computational cost and processing time in multi-modality systems, leading to studies focusing solely on IR images. Nevertheless, IR images, lacking color and texture information, generally yield lower recognition performance in existing person re-id studies. In addition, previous studies have shown that person re-id performance suffers in the presence of complex background noise. To tackle these challenges, this study proposes a new weak saliency ensemble network (WSE-Net) for person re-id using IR images. WSE-Net incorporates a channel reduction of feature (CRF) method to reduce computational cost in the ensemble network, a technique for converting input images into group of patch images and feeding them into the ensemble model to enhance the reduced feature information, and a grouped convolution ensemble network (GCE-Net) that enables the fusion of features extracted from original and attention-guided ensemble models.</div><div>The performance of person re-id using WSE-Net was evaluated on the Dongguk body-based person recognition database version 1 (DBPerson-Recog-DB1) and the Sun Yat-sen university multiple modality re-identification version 1 (SYSU-MM01). Experimental results demonstrated that on DBPerson-Recog-DB1, WSE-Net achieved 93.65% in rank 1, 95.28% in mean average precision (mAP), and 93.52% in the harmonic mean of precision and recall. Additionally, on SYSU-MM01, WSE-Net achieved 86.85% in rank 1, 44.58% in mAP, and 40.06% in the harmonic mean of precision and recall. Furthermore, the accuracy of WSE-Net on both datasets surpassed that of state-of-the-art methods.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142533397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Bernstein-based oppositional-multiple learning and differential enhanced exponential distribution optimizer for real-world optimization problems","authors":"","doi":"10.1016/j.engappai.2024.109370","DOIUrl":"10.1016/j.engappai.2024.109370","url":null,"abstract":"<div><div>Meta-heuristic algorithms play an essential role in solving real-world optimization problems. However, their performance is limited by the complexity and variability of the problems. Hence, various efficient algorithms are being actively explored. The exponential distribution optimizer (EDO), having attracted attention for its efficient search performance, has been extended to several applications. However, it suffers from falling into local optima and weak exploitation. Meanwhile, it cannot be directly applied to solve binary optimization problems. To address these challenges, this paper proposes an enhanced EDO called BOMLDEDO. The Bernstein-assisted oppositional-multiple learning strategy is proposed to avoid falling into local optimality. The Bernstein-based adaptive differential strategy is developed to improve exploitation capability. Moreover, by introducing a transfer function, repair method, and binary-to-real operation, BOMLDEDO is extended to a binary version. The IEEE (Institute of Electrical and Electronics Engineers) CEC (Congress on Evolutionary Computation) test functions and engineering problems are used to evaluate BOMLDEDO's optimization performance for continuous problems. Compared to its competitors, BOMLDEDO ranks first on more than 8 out of 10 IEEE CEC 2020 functions and more than 10 out of 12 IEEE CEC 2022 functions. Meanwhile, it achieves the global optimum in 91% of engineering problems. Furthermore, the 0–1 knapsack problems are applied to verify BOMLDEDO's binary optimization capabilities, and the results show that BOMLDEDO is successfully utilized in 14 knapsack instances. The above results demonstrate that incorporating multiple strategies helps improve the performance of BOMLDEDO, making it more reliable and applicable in solving continuous optimization problems and 0–1 knapsack problems.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5,"publicationDate":"2024-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142534333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}