Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek
{"title":"基于深度学习序列决策强化的目标识别","authors":"Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek","doi":"10.1109/SIU55565.2022.9864744","DOIUrl":null,"url":null,"abstract":"The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.","PeriodicalId":115446,"journal":{"name":"2022 30th Signal Processing and Communications Applications Conference (SIU)","volume":"2021 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Object Recognition with Sequential Decision Reinforcement of Deep Learning\",\"authors\":\"Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek\",\"doi\":\"10.1109/SIU55565.2022.9864744\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.\",\"PeriodicalId\":115446,\"journal\":{\"name\":\"2022 30th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"2021 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 30th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU55565.2022.9864744\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 30th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU55565.2022.9864744","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object Recognition with Sequential Decision Reinforcement of Deep Learning
The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.