Nan Cheng;Haoran Chen;Ruijin Sun;Longfei Ma;Conghao Zhou;Yuan Zhang;Yilong Hui
{"title":"无人机无线网络中目标检测驱动的联合视频传输与处理的信息价值优化","authors":"Nan Cheng;Haoran Chen;Ruijin Sun;Longfei Ma;Conghao Zhou;Yuan Zhang;Yilong Hui","doi":"10.1109/JMASS.2025.3567087","DOIUrl":null,"url":null,"abstract":"In the wake of disasters, rapid and efficient search and rescue operations are essential. Uncrewed aerial vehicles (UAVs) have become instrumental in such scenarios, providing real-time video streaming that can be used for object detection to locate survivors. This technology, however, faces significant challenges due to the limited communication and onboard computational resources, which are critical for processing and transmitting high-quality video data. To address these issues, this article proposes a novel approach that leverages the concept of the value of information (VoI) to optimize the tradeoff between the accuracy of object detection and the associated communication costs. By dynamically adjusting the video stream’s quality, the proposed system aims to ensure that the most valuable information is transmitted within the constraints of bandwidth and computational power. To operationalize this concept, we introduce a deep reinforcement learning (DRL) algorithm that employs the soft actor-critic (SAC) method. The algorithm benefits from the integration of object features and contextual information extracted by ResNet50, which is then processed through a cross-attention structure within the critic network. Our simulation results indicate that our approach significantly enhances the VoI, achieving higher accuracy in object detection with better resource management compared to traditional strategies.","PeriodicalId":100624,"journal":{"name":"IEEE Journal on Miniaturization for Air and Space Systems","volume":"6 2","pages":"59-69"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Value-of-Information Optimization for Object Detection-Driven Joint Video Transmission and Processing in UAV-Enabled Wireless Networks\",\"authors\":\"Nan Cheng;Haoran Chen;Ruijin Sun;Longfei Ma;Conghao Zhou;Yuan Zhang;Yilong Hui\",\"doi\":\"10.1109/JMASS.2025.3567087\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the wake of disasters, rapid and efficient search and rescue operations are essential. Uncrewed aerial vehicles (UAVs) have become instrumental in such scenarios, providing real-time video streaming that can be used for object detection to locate survivors. This technology, however, faces significant challenges due to the limited communication and onboard computational resources, which are critical for processing and transmitting high-quality video data. To address these issues, this article proposes a novel approach that leverages the concept of the value of information (VoI) to optimize the tradeoff between the accuracy of object detection and the associated communication costs. By dynamically adjusting the video stream’s quality, the proposed system aims to ensure that the most valuable information is transmitted within the constraints of bandwidth and computational power. To operationalize this concept, we introduce a deep reinforcement learning (DRL) algorithm that employs the soft actor-critic (SAC) method. The algorithm benefits from the integration of object features and contextual information extracted by ResNet50, which is then processed through a cross-attention structure within the critic network. Our simulation results indicate that our approach significantly enhances the VoI, achieving higher accuracy in object detection with better resource management compared to traditional strategies.\",\"PeriodicalId\":100624,\"journal\":{\"name\":\"IEEE Journal on Miniaturization for Air and Space Systems\",\"volume\":\"6 2\",\"pages\":\"59-69\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-03-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal on Miniaturization for Air and Space Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10988650/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal on Miniaturization for Air and Space Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10988650/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Value-of-Information Optimization for Object Detection-Driven Joint Video Transmission and Processing in UAV-Enabled Wireless Networks
In the wake of disasters, rapid and efficient search and rescue operations are essential. Uncrewed aerial vehicles (UAVs) have become instrumental in such scenarios, providing real-time video streaming that can be used for object detection to locate survivors. This technology, however, faces significant challenges due to the limited communication and onboard computational resources, which are critical for processing and transmitting high-quality video data. To address these issues, this article proposes a novel approach that leverages the concept of the value of information (VoI) to optimize the tradeoff between the accuracy of object detection and the associated communication costs. By dynamically adjusting the video stream’s quality, the proposed system aims to ensure that the most valuable information is transmitted within the constraints of bandwidth and computational power. To operationalize this concept, we introduce a deep reinforcement learning (DRL) algorithm that employs the soft actor-critic (SAC) method. The algorithm benefits from the integration of object features and contextual information extracted by ResNet50, which is then processed through a cross-attention structure within the critic network. Our simulation results indicate that our approach significantly enhances the VoI, achieving higher accuracy in object detection with better resource management compared to traditional strategies.