{"title":"Stabilizing and improving federated learning with highly non-iid data and client dropout","authors":"Jian Xu, Meilin Yang, Wenbo Ding, Shao-Lun Huang","doi":"10.1007/s10489-024-05956-3","DOIUrl":"10.1007/s10489-024-05956-3","url":null,"abstract":"<div><p>The label distribution skew has been shown to be a significant obstacle that limits the model performance in federated learning (FL). This challenge could be more serious when the participating clients are in unstable network circumstances and drop out frequently. Previous works have demonstrated that the classifier head is particularly sensitive to the label skew. Therefore, maintaining a balanced classifier head is of significant importance for building a good and unbiased global model. To this end, we propose a simple yet effective framework by introducing a calibrated softmax function with smoothed prior for computing the cross-entropy loss, and a prototype-based feature augmentation scheme to re-balance the local training, which provide a new perspective on tackling the label distribution skew in FL and are lightweight for edge devices and can facilitate the global model aggregation. With extensive experiments on two benchmark classification tasks of Fashion-MNIST and CIFAR-10, our numerical results demonstrate that our proposed method can consistently outperform the baselines, 2 8% of accuracy over FedAvg in the presence of severe label skew and client dropout.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 3","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142875251","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Efficient knowledge distillation using a shift window target-aware transformer","authors":"Jing Feng, Wen Eng Ong","doi":"10.1007/s10489-024-06207-1","DOIUrl":"10.1007/s10489-024-06207-1","url":null,"abstract":"<div><p>Target-aware Transformer (TaT) knowledge distillation effectively extracts information from intermediate layers but faces high computational costs for large feature maps. While the non-overlapping Patch-group distillation in TaT reduces complexity, it loses boundary information, affecting accuracy. We propose an improved Shifted Windows Target-aware Transformer (Swin TaT) knowledge distillation method, utilizing a hierarchical shift window strategy to preserve boundary information and balance computational efficiency. Our multi-scale approach optimizes Patch-group distillation with dynamic adjustment, ensuring effective local and global feature transfer. This flexible and efficient design enhances distillation performance, addressing previous limitations. The proposed Swin TaT method demonstrates exceptional performance across various architectures, with ResNet18 as the student network. It achieves 73.03% Top-1 accuracy on ImageNet1K, surpassing the SOTA by 1.06% while reducing parameters to approximately 46% less, and improves mIoU by 2.13% on COCOStuff10k.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 3","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142875249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An attribute reduction algorithm using relative decision mutual information in fuzzy neighborhood decision system","authors":"Jiucheng Xu, Shan Zhang, Miaoxian Ma, Wulin Niu, Jianghao Duan","doi":"10.1007/s10489-024-06171-w","DOIUrl":"10.1007/s10489-024-06171-w","url":null,"abstract":"<div><p>The fuzzy neighborhood rough set integrates the strengths of fuzzy rough set and neighborhood rough set, serving as a pivotal extension of the rough set theory in attribute reduction. However, this model’s widespread application is hindered by its sensitivity to data distribution and limited efficacy in assessing classification uncertainty for datasets with substantial density variations. To mitigate these challenges, this paper introduces an attribute reduction algorithm based on fuzzy neighborhood relative decision mutual information. Firstly, the classification uncertainty of samples is initially defined in terms of relative distance. Simultaneously, the similarity relationship of fuzzy neighborhoods is reformulated, thereby reducing the risk of sample misclassification through integration with variable-precision fuzzy neighborhood rough approximation. Secondly, the notion of representative sample is introduced, leading to a redefinition of fuzzy membership. Thirdly, fuzzy neighborhood relative mutual information from the information view is constructed and combined with fuzzy neighborhood relative dependency from the algebraic view to propose fuzzy neighborhood relative decision mutual information. Finally, an attribute reduction algorithm is devised based on fuzzy neighborhood relative decision mutual information. This algorithm evaluates the significance of attributes by integrating both informational and algebraic perspectives. Comparative tests on 12 public datasets are conducted to assess existing attribute approximation algorithms. The experimental results show that the proposed algorithm achieved an average classification accuracy of 91.28<span>(%)</span> with the KNN classifier and 89.86<span>(%)</span> with the CART classifier. In both classifiers, the algorithm produced an average reduced subset size of 8.54. While significantly reducing feature redundancy, the algorithm consistently maintains a high level of classification accuracy.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 3","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142875248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Short-term traffic flow prediction based on spatial–temporal attention time gated convolutional network with particle swarm optimization","authors":"Zhongxing Li, Zenan Li, Chaofeng Pan, Jian Wang","doi":"10.1007/s10489-024-06117-2","DOIUrl":"10.1007/s10489-024-06117-2","url":null,"abstract":"<div><p>Recently, the surge in vehicle ownership has led to a corresponding increase in the complexity of traffic data. Consequently, accurate traffic flow prediction has become crucial for effective traffic management. While the advancements in intelligent transportation system (ITS) and internet of things (IoT) technology have facilitated traffic flow prediction, many existing methods overlook the influence of the training process on model accuracy. Traditional approaches often fail to account for this critical aspect. Hence, a new approach to traffic flow prediction is introduced in this paper: a spatial–temporal attention time-gated convolutional network based on particle swarm optimization (PSO-STATG). This method uses the particle swarm algorithm to dynamically optimize the learning rate and epoch parameters throughout the training process. Firstly, spatial–temporal correlations are extracted through spatial map convolution and time-gated convolution, facilitated by an attention mechanism. Subsequently, the learning rate and epoch parameters are dynamically adjusted during the training phase via the particle swarm optimization algorithm. Finally, experiments are conducted with real-world datasets, and the results are compared with those from several existing methods. The experimental results indicate that the accuracy and stability of our proposed model in predicting traffic flow are superior.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 2","pages":""},"PeriodicalIF":3.4,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142875291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Gautam Kishore Shahi, Ali Sercan Basyurt, Stefan Stieglitz, Christoph Neuberger
{"title":"Agenda Formation and Prediction of Voting Tendencies for European Parliament Election using Textual, Social and Network Features","authors":"Gautam Kishore Shahi, Ali Sercan Basyurt, Stefan Stieglitz, Christoph Neuberger","doi":"10.1007/s10796-024-10568-w","DOIUrl":"https://doi.org/10.1007/s10796-024-10568-w","url":null,"abstract":"<p>As per agenda-setting theory, political agenda is concerned with the government’s agenda, including politicians and political parties. Political actors utilize various channels to set their political agenda, including social media platforms such as Twitter (now <i>X</i>). Political agenda-setting can be influenced by anonymous user-generated content following the Bright Internet. This is why speech acts, experts, users with affiliations and parties through annotated Tweets were analyzed in this study. In doing so, the agenda formation during the 2019 European Parliament Election in Germany based on the agenda-setting theory as our theoretical framework, was analyzed. A prediction model was trained to predict users’ voting tendencies based on three feature categories: social, network, and text. By combining features from all categories logistical regression leads to the best predictions matching the election results. The contribution to theory is an approach to identify agenda formation based on our novel variables. For practice, a novel approach is presented to forecast the winner of events.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"92 1","pages":""},"PeriodicalIF":5.9,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142874196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Proceedings of the IEEE: Stay Informed. Become Inspired.","authors":"","doi":"10.1109/JPROC.2024.3506203","DOIUrl":"https://doi.org/10.1109/JPROC.2024.3506203","url":null,"abstract":"","PeriodicalId":20556,"journal":{"name":"Proceedings of the IEEE","volume":"112 10","pages":"C4-C4"},"PeriodicalIF":23.2,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10812074","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142875119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yunfei Li, Lin Jiang, Bin Lei, Bo Tang, Jianyang Zhu
{"title":"Robust Real-Time Localization System via Semantic Dimensional Chains for Degraded Scenarios","authors":"Yunfei Li, Lin Jiang, Bin Lei, Bo Tang, Jianyang Zhu","doi":"10.1109/jiot.2024.3520998","DOIUrl":"https://doi.org/10.1109/jiot.2024.3520998","url":null,"abstract":"","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"25 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142879620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zezhao Meng, Zhi Li, Xiangwang Hou, Minrui Xu, Yi Xia, Zekai Zhang, Shaoyang Song
{"title":"Enhancing Federated Learning Performance on Heterogeneous IoT Devices Using Generative Artificial Intelligence With Resource Scheduling","authors":"Zezhao Meng, Zhi Li, Xiangwang Hou, Minrui Xu, Yi Xia, Zekai Zhang, Shaoyang Song","doi":"10.1109/jiot.2024.3521017","DOIUrl":"https://doi.org/10.1109/jiot.2024.3521017","url":null,"abstract":"","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"28 1","pages":""},"PeriodicalIF":10.6,"publicationDate":"2024-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142879808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}