Rumana Aktar, H. Aliakbarpour, F. Bunyak, G. Seetharaman, K. Palaniappan
{"title":"Performance Evaluation of Feature Descriptors for Aerial Imagery Mosaicking","authors":"Rumana Aktar, H. Aliakbarpour, F. Bunyak, G. Seetharaman, K. Palaniappan","doi":"10.1109/AIPR.2018.8707402","DOIUrl":null,"url":null,"abstract":"Mosaicking enables efficient summary of geospatial content in an aerial video with applications in surveillance, activity detection, tracking, etc. Scene clutter, presence of distractors, parallax, illumination artifacts i.e. shadows, glare, and other complexities of aerial imaging such as large camera motion makes the registration process challenging. Robust feature detection and description is needed to overcome these challenges before registration. This study investigates the computational complexity versus performance of selected feature detectors such as Structure Tensor with NCC (ST+NCC), SURF, ASIFT within our Video Mosaicking and Summarization (VMZ) framework on VIRAT benchmark aerial video. ST+NCC and SURF is very fast but fails for few complex imagery (with occlusion) from VIRAT. ASIFT is more robust compared to ST+NCC or SURF, though extremely time consuming. We also propose an Adaptive Descriptor (combining ST+NCC and ASIFT) that is 9x faster than ASIFT with comparable robustness.","PeriodicalId":230582,"journal":{"name":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Applied Imagery Pattern Recognition Workshop (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIPR.2018.8707402","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Mosaicking enables efficient summary of geospatial content in an aerial video with applications in surveillance, activity detection, tracking, etc. Scene clutter, presence of distractors, parallax, illumination artifacts i.e. shadows, glare, and other complexities of aerial imaging such as large camera motion makes the registration process challenging. Robust feature detection and description is needed to overcome these challenges before registration. This study investigates the computational complexity versus performance of selected feature detectors such as Structure Tensor with NCC (ST+NCC), SURF, ASIFT within our Video Mosaicking and Summarization (VMZ) framework on VIRAT benchmark aerial video. ST+NCC and SURF is very fast but fails for few complex imagery (with occlusion) from VIRAT. ASIFT is more robust compared to ST+NCC or SURF, though extremely time consuming. We also propose an Adaptive Descriptor (combining ST+NCC and ASIFT) that is 9x faster than ASIFT with comparable robustness.