{"title":"基于图像感知哈希编码的无监督硬例提取","authors":"Jie Bai, Lianqing Zheng, Sihan Chen, Libo Huang","doi":"10.1145/3448734.3450910","DOIUrl":null,"url":null,"abstract":"The camera-based object detection algorithm is essential in autonomous driving. If the object is not detected when the vehicle is driving on the highway, it will cause a severe safety hazard. To evaluate and improve the object detection model, we propose a hard case extraction algorithm based on image perceptual hash encoding. We encode the object regions of each frame and then match them in adjacent frames. We optimize the search algorithm to achieve fast matching, improving efficiency by about ten times while ensuring accuracy. Then, we extract hard frames from a large number of unlabeled video frames, and the experimental results show that the accuracy is 92%. It is of great significance to evaluate and improve the object detection model and expand effective data.","PeriodicalId":105999,"journal":{"name":"The 2nd International Conference on Computing and Data Science","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unsupervised Hard Case Extraction Based on Image Perceptual Hash Encoding\",\"authors\":\"Jie Bai, Lianqing Zheng, Sihan Chen, Libo Huang\",\"doi\":\"10.1145/3448734.3450910\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The camera-based object detection algorithm is essential in autonomous driving. If the object is not detected when the vehicle is driving on the highway, it will cause a severe safety hazard. To evaluate and improve the object detection model, we propose a hard case extraction algorithm based on image perceptual hash encoding. We encode the object regions of each frame and then match them in adjacent frames. We optimize the search algorithm to achieve fast matching, improving efficiency by about ten times while ensuring accuracy. Then, we extract hard frames from a large number of unlabeled video frames, and the experimental results show that the accuracy is 92%. It is of great significance to evaluate and improve the object detection model and expand effective data.\",\"PeriodicalId\":105999,\"journal\":{\"name\":\"The 2nd International Conference on Computing and Data Science\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 2nd International Conference on Computing and Data Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3448734.3450910\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 2nd International Conference on Computing and Data Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3448734.3450910","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Unsupervised Hard Case Extraction Based on Image Perceptual Hash Encoding
The camera-based object detection algorithm is essential in autonomous driving. If the object is not detected when the vehicle is driving on the highway, it will cause a severe safety hazard. To evaluate and improve the object detection model, we propose a hard case extraction algorithm based on image perceptual hash encoding. We encode the object regions of each frame and then match them in adjacent frames. We optimize the search algorithm to achieve fast matching, improving efficiency by about ten times while ensuring accuracy. Then, we extract hard frames from a large number of unlabeled video frames, and the experimental results show that the accuracy is 92%. It is of great significance to evaluate and improve the object detection model and expand effective data.