基于深度学习序列决策强化的目标识别

Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek
{"title":"基于深度学习序列决策强化的目标识别","authors":"Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek","doi":"10.1109/SIU55565.2022.9864744","DOIUrl":null,"url":null,"abstract":"The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.","PeriodicalId":115446,"journal":{"name":"2022 30th Signal Processing and Communications Applications Conference (SIU)","volume":"2021 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Object Recognition with Sequential Decision Reinforcement of Deep Learning\",\"authors\":\"Enes Colpan, Abdulmajid A.H.A. Mohammed, Ö. N. Gerek\",\"doi\":\"10.1109/SIU55565.2022.9864744\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.\",\"PeriodicalId\":115446,\"journal\":{\"name\":\"2022 30th Signal Processing and Communications Applications Conference (SIU)\",\"volume\":\"2021 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 30th Signal Processing and Communications Applications Conference (SIU)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SIU55565.2022.9864744\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 30th Signal Processing and Communications Applications Conference (SIU)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SIU55565.2022.9864744","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

深度学习方法在目标检测方面的巨大成功使这些方法成为相关应用的基本选择。视频序列中多目标检测的流行选择包括卷积神经网络,如YOLO, MobileNet-SSD和Faster R-CNN,它们通常将图像帧分割为小矩形区域,并试图找到受欢迎对象的边界框。目前对这些方法的研究主要集中在加速实现或提高网络层的学习性能上。作为一种新方法,这项工作在这些网络的末尾附加了一个简单的后处理阶段,通过连续视频帧使用顺序决策过程来增强决策鲁棒性。当在前一帧中也估计了可能的对象时,顺序帧提供了对对象存在的更好的置信度。一旦置信水平超过预定的阈值,在单帧中难以检测到的目标就会被准确检测到。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Object Recognition with Sequential Decision Reinforcement of Deep Learning
The great success of deep learning methods for object detection rendered such methods the fundamental choice in related applications. Popular choices for multiple object detection in video sequences include convolutional neural networks, such as YOLO, MobileNet-SSD and Faster R-CNN, which typically split image frames to small rectangular regions and attempts to find bounding boxes of sought–after objects. Current research of such methods mostly focus on speeding–up the implementations or improving the network layers’ learning properties. As a new approach, this work appends a simple post processing stage at the end of such networks to reinforce decision robustness using a sequential decision process through sequential video frames. The sequential frames provide a better confidence on the existence of an object, when a probable object was also estimated in the previous frame. Once the confidence level overshoots a predetermined threshold, objects that are difficult to be detected in a single frame get accurately detected.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信