Thangarajah Akilan, E. Johnson, Gaurav Taluja, Japneet Sandhu, Ritika Chadha
{"title":"Multimodality Weight and Score Fusion for SLAM","authors":"Thangarajah Akilan, E. Johnson, Gaurav Taluja, Japneet Sandhu, Ritika Chadha","doi":"10.1109/CCECE47787.2020.9255714","DOIUrl":null,"url":null,"abstract":"Simultaneous Localization And Mapping (SLAM) is used to predict the trajectory by the Autonomous Navigation Robots (ANR), for instance Self-Driving Cars (SDC). It computes the trajectory through sensing the surroundings, like a visual perception of the environment. This work focuses on the performance improvements of a SLAM model using multimodal learning: (i), early fusion via layer weight enhancement of feature extractors, and (ii), late fusion via score refinement of the trajectory (pose) regressor. The comparative analysis on Apolloscape dataset shows that the proposed fusion strategies improve localization performance significantly. This work also evaluates applicability of various Deep Convolutional Neural Networks (DCNNs) for SLAM.","PeriodicalId":296506,"journal":{"name":"2020 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCECE47787.2020.9255714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Simultaneous Localization And Mapping (SLAM) is used to predict the trajectory by the Autonomous Navigation Robots (ANR), for instance Self-Driving Cars (SDC). It computes the trajectory through sensing the surroundings, like a visual perception of the environment. This work focuses on the performance improvements of a SLAM model using multimodal learning: (i), early fusion via layer weight enhancement of feature extractors, and (ii), late fusion via score refinement of the trajectory (pose) regressor. The comparative analysis on Apolloscape dataset shows that the proposed fusion strategies improve localization performance significantly. This work also evaluates applicability of various Deep Convolutional Neural Networks (DCNNs) for SLAM.