Active Object Detection Using a Novel Network and Partial Prior Information

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems Pub Date : 2025-07-07 DOI:10.1111/exsy.70095

Jianyu Wang, Feng Zhu, Qun Wang, Pengfei Zhao

{"title":"Active Object Detection Using a Novel Network and Partial Prior Information","authors":"Jianyu Wang, Feng Zhu, Qun Wang, Pengfei Zhao","doi":"10.1111/exsy.70095","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Active object detection (AOD) enables a system to actively adjust camera parameters or plan the next viewpoint to improve detection accuracy when the current visual input is insufficient. However, most existing AOD methods assume that the target object is visible from the initial viewpoint, which is often unrealistic and reduces task efficiency. To address this limitation, we propose a novel AOD framework that leverages partial prior information to enhance detection performance and task efficiency. Specifically, we construct an extensible prior information library that describes large and easily identifiable adjacent objects (Adj-objects) that are spatially related to the target. This allows the system to initiate AOD based on the presence of an Adj-object, even when the target is initially out of view. Our approach incorporates a duelling deep Q-learning network (Duelling-DQN) with a newly designed reward function to effectively utilise prior information. Additionally, we introduce a viewpoint storage scheme to support fast retrieval and transition between viewpoints. We evaluate the proposed method on the Active Vision Dataset (AVD) and compare it with several state-of-the-art (SOTA) approaches. The experimental results show that our method achieves a superior average success rate of 81.3%, demonstrating its effectiveness in overcoming the initial state limitations of traditional AOD tasks.</p>\n </div>","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 8","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/exsy.70095","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Active object detection (AOD) enables a system to actively adjust camera parameters or plan the next viewpoint to improve detection accuracy when the current visual input is insufficient. However, most existing AOD methods assume that the target object is visible from the initial viewpoint, which is often unrealistic and reduces task efficiency. To address this limitation, we propose a novel AOD framework that leverages partial prior information to enhance detection performance and task efficiency. Specifically, we construct an extensible prior information library that describes large and easily identifiable adjacent objects (Adj-objects) that are spatially related to the target. This allows the system to initiate AOD based on the presence of an Adj-object, even when the target is initially out of view. Our approach incorporates a duelling deep Q-learning network (Duelling-DQN) with a newly designed reward function to effectively utilise prior information. Additionally, we introduce a viewpoint storage scheme to support fast retrieval and transition between viewpoints. We evaluate the proposed method on the Active Vision Dataset (AVD) and compare it with several state-of-the-art (SOTA) approaches. The experimental results show that our method achieves a superior average success rate of 81.3%, demonstrating its effectiveness in overcoming the initial state limitations of traditional AOD tasks.

查看原文本刊更多论文

基于新网络和部分先验信息的主动目标检测

主动目标检测（AOD）使系统能够在当前视觉输入不足时主动调整相机参数或计划下一个视点，以提高检测精度。然而，现有的大多数AOD方法都假设目标对象从初始视点是可见的，这往往是不现实的，并且降低了任务效率。为了解决这一限制，我们提出了一种新的AOD框架，利用部分先验信息来提高检测性能和任务效率。具体而言，我们构建了一个可扩展的先验信息库，该库描述了与目标在空间上相关的大型且易于识别的相邻对象（adjo -objects）。这使得系统可以根据Adj-object的存在启动AOD，即使目标最初在视野之外。我们的方法结合了一个duelling深度q -学习网络（duelling - dqn）和一个新设计的奖励函数来有效地利用先验信息。此外，我们还引入了一种视点存储方案，以支持视点之间的快速检索和转换。我们在主动视觉数据集（AVD）上评估了所提出的方法，并将其与几种最先进的（SOTA）方法进行了比较。实验结果表明，该方法达到了81.3%的平均成功率，有效克服了传统AOD任务的初始状态限制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Expert Systems 工程技术-计算机：理论方法

CiteScore

7.40

自引率

6.10%

发文量

266

审稿时长

24 months

期刊介绍： Expert Systems: The Journal of Knowledge Engineering publishes papers dealing with all aspects of knowledge engineering, including individual methods and techniques in knowledge acquisition and representation, and their application in the construction of systems – including expert systems – based thereon. Detailed scientific evaluation is an essential part of any paper. As well as traditional application areas, such as Software and Requirements Engineering, Human-Computer Interaction, and Artificial Intelligence, we are aiming at the new and growing markets for these technologies, such as Business, Economy, Market Research, and Medical and Health Care. The shift towards this new focus will be marked by a series of special issues covering hot and emergent topics.