Joint Throughput Maximization and Energy Management for Ultralow Power Ambient Backscatter Communication in WBANs by Distributed Deep Reinforcement Learning
IF 4.3 2区 综合性期刊Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
{"title":"Joint Throughput Maximization and Energy Management for Ultralow Power Ambient Backscatter Communication in WBANs by Distributed Deep Reinforcement Learning","authors":"Youze Yang;Sen Yan","doi":"10.1109/JSEN.2024.3487354","DOIUrl":null,"url":null,"abstract":"In this article, we present an ultralow power ambient backscatter communications (AmBC) framework for wireless body area networks (WBANs). In these WBANs, the AmBC sensor nodes are energy harvesting (EH) powered and are able to transmit their own collected physiological information by backscattering the signals emitted by the Wi-Fi access point (AP). To achieve maximum throughput performance with sustainable operation for these ultralow power AmBC sensor nodes in WBANs, we propose a three-phase AmBC transmission model and formulate a joint throughput maximization and energy management (JTMEM) problem. The first-order Markov process channel model and random data collection are also developed in the system model to represent practical health monitoring application environments. Since the full real-time channel state information (CSI) is difficult to obtain at the beginning of each time slot, the optimal working mode selection (WMS) policy is not available for this problem. To overcome this issue, we propose a deep reinforcement learning (DRL)-based algorithm, which can use historical CSI to learn the potential channel correlation to infer the channel changing and decide the appropriate working mode for each AmBC sensor node. Moreover, we establish a distributed DRL structure to overcome huge action space issues and make the proposed algorithm flexible and scalable. Finally, extensive numerical simulation results demonstrate that our proposed algorithm can approach the average throughput performance and rewards of the ergodic policy, which has full real-time CSI in different scenarios, and represents favorable energy management performance after convergence.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"24 24","pages":"42484-42499"},"PeriodicalIF":4.3000,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10742315/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In this article, we present an ultralow power ambient backscatter communications (AmBC) framework for wireless body area networks (WBANs). In these WBANs, the AmBC sensor nodes are energy harvesting (EH) powered and are able to transmit their own collected physiological information by backscattering the signals emitted by the Wi-Fi access point (AP). To achieve maximum throughput performance with sustainable operation for these ultralow power AmBC sensor nodes in WBANs, we propose a three-phase AmBC transmission model and formulate a joint throughput maximization and energy management (JTMEM) problem. The first-order Markov process channel model and random data collection are also developed in the system model to represent practical health monitoring application environments. Since the full real-time channel state information (CSI) is difficult to obtain at the beginning of each time slot, the optimal working mode selection (WMS) policy is not available for this problem. To overcome this issue, we propose a deep reinforcement learning (DRL)-based algorithm, which can use historical CSI to learn the potential channel correlation to infer the channel changing and decide the appropriate working mode for each AmBC sensor node. Moreover, we establish a distributed DRL structure to overcome huge action space issues and make the proposed algorithm flexible and scalable. Finally, extensive numerical simulation results demonstrate that our proposed algorithm can approach the average throughput performance and rewards of the ergodic policy, which has full real-time CSI in different scenarios, and represents favorable energy management performance after convergence.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice