{"title":"SATVIO:基于立体注意的视觉惯性里程计","authors":"Raoof Doorshi;Hajira Saleem;Reza Malekian","doi":"10.1109/JSAS.2025.3601056","DOIUrl":null,"url":null,"abstract":"This study introduces a novel stereo attention-based visual inertial odometry model, namely, SATVIO, aiming to enhance odometry performance by leveraging deep learning techniques for sensor fusion. The research evaluates the SATVIO model against existing visual odometry methods using the KITTI odometry dataset, focusing on translational and rotational accuracy enhancements through innovative attention mechanisms and early fusion strategies. The proposed model integrates convolutional neural networks and long short-term memory networks to process and fuse data from stereo image inputs and inertial measurements effectively. SATVIO model particularly employs triplet attention and early fusion techniques with the aim of addressing the challenges posed by scale ambiguity and environmental changes. The results demonstrates that our proposed model outperforms traditional methods in specific configurations, thus demonstrating competitive or superior performance on key challenging sequences.","PeriodicalId":100622,"journal":{"name":"IEEE Journal of Selected Areas in Sensors","volume":"2 ","pages":"259-265"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11133748","citationCount":"0","resultStr":"{\"title\":\"SATVIO: Stereo Attention-Based Visual Inertial Odometry\",\"authors\":\"Raoof Doorshi;Hajira Saleem;Reza Malekian\",\"doi\":\"10.1109/JSAS.2025.3601056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study introduces a novel stereo attention-based visual inertial odometry model, namely, SATVIO, aiming to enhance odometry performance by leveraging deep learning techniques for sensor fusion. The research evaluates the SATVIO model against existing visual odometry methods using the KITTI odometry dataset, focusing on translational and rotational accuracy enhancements through innovative attention mechanisms and early fusion strategies. The proposed model integrates convolutional neural networks and long short-term memory networks to process and fuse data from stereo image inputs and inertial measurements effectively. SATVIO model particularly employs triplet attention and early fusion techniques with the aim of addressing the challenges posed by scale ambiguity and environmental changes. The results demonstrates that our proposed model outperforms traditional methods in specific configurations, thus demonstrating competitive or superior performance on key challenging sequences.\",\"PeriodicalId\":100622,\"journal\":{\"name\":\"IEEE Journal of Selected Areas in Sensors\",\"volume\":\"2 \",\"pages\":\"259-265\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11133748\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Selected Areas in Sensors\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11133748/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Areas in Sensors","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11133748/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
This study introduces a novel stereo attention-based visual inertial odometry model, namely, SATVIO, aiming to enhance odometry performance by leveraging deep learning techniques for sensor fusion. The research evaluates the SATVIO model against existing visual odometry methods using the KITTI odometry dataset, focusing on translational and rotational accuracy enhancements through innovative attention mechanisms and early fusion strategies. The proposed model integrates convolutional neural networks and long short-term memory networks to process and fuse data from stereo image inputs and inertial measurements effectively. SATVIO model particularly employs triplet attention and early fusion techniques with the aim of addressing the challenges posed by scale ambiguity and environmental changes. The results demonstrates that our proposed model outperforms traditional methods in specific configurations, thus demonstrating competitive or superior performance on key challenging sequences.