基于深度学习序列决策强化的目标识别

2022 30th Signal Processing and Communications Applications Conference (SIU) Pub Date : 2022-05-15 DOI:10.1109/SIU55565.2022.9864744

Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek

{"title":"基于深度学习序列决策强化的目标识别","authors":"Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek","doi":"10.1109/SIU55565.2022.9864744","DOIUrl":null,"url":null,"abstract":"The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.","PeriodicalId":115446,"journal":{"name":"2022 30th Signal Processing and Communications Applications Conference (SIU)","volume":"2021 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Object Recognition with Sequential Decision Reinforcement of Deep Learning\",\"authors\":\"Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek\",\"doi\":\"10.1109/SIU55565.2022.9864744\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.\",\"PeriodicalId\":115446,\"journal\":{\"name\":\"2022 30th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"2021 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 30th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU55565.2022.9864744\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 30th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU55565.2022.9864744","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

深度学习方法在目标检测方面的巨大成功使这些方法成为相关应用的基本选择。视频序列中多目标检测的流行选择包括卷积神经网络，如YOLO, MobileNet-SSD和Faster R-CNN，它们通常将图像帧分割为小矩形区域，并试图找到受欢迎对象的边界框。目前对这些方法的研究主要集中在加速实现或提高网络层的学习性能上。作为一种新方法，这项工作在这些网络的末尾附加了一个简单的后处理阶段，通过连续视频帧使用顺序决策过程来增强决策鲁棒性。当在前一帧中也估计了可能的对象时，顺序帧提供了对对象存在的更好的置信度。一旦置信水平超过预定的阈值，在单帧中难以检测到的目标就会被准确检测到。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Object Recognition with Sequential Decision Reinforcement of Deep Learning

The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 30th Signal Processing and Communications Applications Conference (SIU)

自引率

0.00%

发文量