{"title":"Improved coordinate attention network for classification of dangerous driving behavior","authors":"Wen Ni , Lufeng Bai","doi":"10.1016/j.fraope.2025.100219","DOIUrl":null,"url":null,"abstract":"<div><div>With the rise of traffic accidents caused by unsafe driving behaviors, the accurate classification of these behaviors has become a pressing issue in intelligent transportation systems. Traditional methods such as AlexNet and VGG, while effective for general image recognition tasks, fail to capture the complex and subtle features necessary for recognizing dangerous driving behaviors. To address this, we propose an improved residual network model, SC-ResNet, which integrates a coordinate attention mechanism and SIFT (Scale-Invariant Feature Transform) feature fusion to enhance classification accuracy under varying conditions including rotation, scale, and illumination changes. Furthermore, we introduce a multi-scale feature pyramid network and a novel joint loss function to better handle the multi-class classification imbalance problem. Experimental results show that our model outperforms traditional networks by <span><math><mrow><mn>0</mn><mo>.</mo><mn>6</mn><mtext>%</mtext></mrow></math></span> to <span><math><mrow><mn>4</mn><mo>.</mo><mn>7</mn><mtext>%</mtext></mrow></math></span> in classification accuracy. Future research will focus on improving model generalization and computational efficiency for real-time applications.</div></div>","PeriodicalId":100554,"journal":{"name":"Franklin Open","volume":"10 ","pages":"Article 100219"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Franklin Open","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S277318632500009X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
With the rise of traffic accidents caused by unsafe driving behaviors, the accurate classification of these behaviors has become a pressing issue in intelligent transportation systems. Traditional methods such as AlexNet and VGG, while effective for general image recognition tasks, fail to capture the complex and subtle features necessary for recognizing dangerous driving behaviors. To address this, we propose an improved residual network model, SC-ResNet, which integrates a coordinate attention mechanism and SIFT (Scale-Invariant Feature Transform) feature fusion to enhance classification accuracy under varying conditions including rotation, scale, and illumination changes. Furthermore, we introduce a multi-scale feature pyramid network and a novel joint loss function to better handle the multi-class classification imbalance problem. Experimental results show that our model outperforms traditional networks by to in classification accuracy. Future research will focus on improving model generalization and computational efficiency for real-time applications.