{"title":"A keypoint-based object detection method with attention mechanism and feature fusion","authors":"Hui Wang, Tangwen Yang","doi":"10.1109/CAC51589.2020.9326802","DOIUrl":null,"url":null,"abstract":"Recently, there is a new object detection framework that does not require anchor boxes, which refers to the realization of object detection tasks by detecting key points. CenterNet identifies an object with single keypoint, namely the center point of its bounding box. It finds other attributes at the same time through key point estimation, such as the size and the orientation of the object. In this work, a global attention module is introduced to the backbone called Hourglass to enhance feature extraction with the global context information. A multilevel fusion method is also added to the Hourglass to integrate the feature maps of different levels, and further improve the detection capability. Combining the two methods, the new network achieves 46.1% AP with multi-scale testing on MS COCO.","PeriodicalId":430085,"journal":{"name":"2020 Chinese Automation Congress (CAC)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Chinese Automation Congress (CAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CAC51589.2020.9326802","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, there is a new object detection framework that does not require anchor boxes, which refers to the realization of object detection tasks by detecting key points. CenterNet identifies an object with single keypoint, namely the center point of its bounding box. It finds other attributes at the same time through key point estimation, such as the size and the orientation of the object. In this work, a global attention module is introduced to the backbone called Hourglass to enhance feature extraction with the global context information. A multilevel fusion method is also added to the Hourglass to integrate the feature maps of different levels, and further improve the detection capability. Combining the two methods, the new network achieves 46.1% AP with multi-scale testing on MS COCO.