Hao Fang , Huanyu Liu , Jiazheng Wen , Zhonglin Yang , Junbao Li , Qi Han
{"title":"Automatic visual enhancement of PTZ camera based on reinforcement learning","authors":"Hao Fang , Huanyu Liu , Jiazheng Wen , Zhonglin Yang , Junbao Li , Qi Han","doi":"10.1016/j.neucom.2025.129531","DOIUrl":null,"url":null,"abstract":"<div><div>Video surveillance systems have become indispensable for enhancing border security and effectively addressing threats such as illegal border crossings. Object detection plays a crucial role in real-time monitoring and event response in these systems. However, challenges arise in border scenarios, where targets exhibit small sizes, low resolutions, and limited extractable features, resulting in lower detection confidence. To overcome these limitations, we propose an advanced pan–tilt–zoom camera control method that does not require intrinsic camera parameters. The objective of this study is to accomplish visual enhancement tasks for low-confidence targets. The proposed method employs deep reinforcement learning techniques, integrating both discrete and continuous action spaces to enhance the generalization capability of agent decisions across diverse scenarios, thereby achieving optimal target monitoring. In addition, the introduction of the cutout feature fusion filter enables the agent to focus equally on each target in multitarget scenarios. In our experiments, we compared the proposed method with other approaches. The results demonstrate the superiority of the proposed method in various scenarios and object detection algorithms.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"626 ","pages":"Article 129531"},"PeriodicalIF":5.5000,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225002036","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Video surveillance systems have become indispensable for enhancing border security and effectively addressing threats such as illegal border crossings. Object detection plays a crucial role in real-time monitoring and event response in these systems. However, challenges arise in border scenarios, where targets exhibit small sizes, low resolutions, and limited extractable features, resulting in lower detection confidence. To overcome these limitations, we propose an advanced pan–tilt–zoom camera control method that does not require intrinsic camera parameters. The objective of this study is to accomplish visual enhancement tasks for low-confidence targets. The proposed method employs deep reinforcement learning techniques, integrating both discrete and continuous action spaces to enhance the generalization capability of agent decisions across diverse scenarios, thereby achieving optimal target monitoring. In addition, the introduction of the cutout feature fusion filter enables the agent to focus equally on each target in multitarget scenarios. In our experiments, we compared the proposed method with other approaches. The results demonstrate the superiority of the proposed method in various scenarios and object detection algorithms.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.