Md Sarfraz Alam, R.S. Kurmvanshi, G. Prasad, P. Pattanayak
{"title":"自我持续的irs辅助双向通信EBi2ET协议:一种迁移学习方法","authors":"Md Sarfraz Alam, R.S. Kurmvanshi, G. Prasad, P. Pattanayak","doi":"10.1016/j.phycom.2025.102807","DOIUrl":null,"url":null,"abstract":"<div><div>Internet of Things (IoT) networks require sensors to collect real-time data for various applications, but they have limited battery capacity for ubiquitous services. In this work, we address the critical energy limitations in intelligent reflecting surface (IRS)-assisted wireless powered communication networks (WPCNs) by introducing an energy borrowing (EB) mechanism to ensure reliable bidirectional communication. We develop a novel framework, EB-assisted integrated information and energy transfer (EBi<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>ET) protocol for self-sustainable IoT networks. Here, a hybrid access point (HAP) performs bidirectional communication with a source (or sensor) node via an IRS. For self-sustainability, HAP harvests energy from the ambient solar/wind. But for ubiquitous energy supply, HAP borrows energy from the power grid (PG) and returns the energy with interest, when it has sufficient energy from the ambient energy harvesting (EH). Further, HAP transfers the energy to the source via IRS using nonlinear radio frequency EH (RF-EH). Then, both HAP and source perform bidirectional communication. To enhance self-sustainable communication, we formulate a joint optimization problem aimed at maximizing the bidirectional data rate while accounting for energy transactions. Although managing these transactions and bidirectional communications in an unknown environment presents challenges, we address them effectively by transforming the problem into a Markov Decision Process (MDP). Leveraging the Deep Deterministic Policy Gradient (DDPG) algorithm, we derive the optimal online policy within a continuous action space. To further accelerate convergence and improve stability, we incorporate a transfer learning (TL)-based DDPG approach. Lastly, we demonstrate the superiority of our proposed method through performance comparisons with existing benchmark schemes, where the EBi<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>ET protocol achieves an impressive average performance gain of 193.58%. Furthermore, TD3 without TL, IRS-WPCN, EBi<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>ER, and i<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>ER protocols also yield notable improvements of 187.33%, 95.33%, 67.02%, and 43.18%, respectively, over the benchmark.</div></div>","PeriodicalId":48707,"journal":{"name":"Physical Communication","volume":"72 ","pages":"Article 102807"},"PeriodicalIF":2.2000,"publicationDate":"2025-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"EBi2ET protocol for self-sustainable IRS-assisted bidirectional communication: A transfer learning approach\",\"authors\":\"Md Sarfraz Alam, R.S. Kurmvanshi, G. Prasad, P. Pattanayak\",\"doi\":\"10.1016/j.phycom.2025.102807\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Internet of Things (IoT) networks require sensors to collect real-time data for various applications, but they have limited battery capacity for ubiquitous services. In this work, we address the critical energy limitations in intelligent reflecting surface (IRS)-assisted wireless powered communication networks (WPCNs) by introducing an energy borrowing (EB) mechanism to ensure reliable bidirectional communication. We develop a novel framework, EB-assisted integrated information and energy transfer (EBi<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>ET) protocol for self-sustainable IoT networks. Here, a hybrid access point (HAP) performs bidirectional communication with a source (or sensor) node via an IRS. For self-sustainability, HAP harvests energy from the ambient solar/wind. But for ubiquitous energy supply, HAP borrows energy from the power grid (PG) and returns the energy with interest, when it has sufficient energy from the ambient energy harvesting (EH). Further, HAP transfers the energy to the source via IRS using nonlinear radio frequency EH (RF-EH). Then, both HAP and source perform bidirectional communication. To enhance self-sustainable communication, we formulate a joint optimization problem aimed at maximizing the bidirectional data rate while accounting for energy transactions. Although managing these transactions and bidirectional communications in an unknown environment presents challenges, we address them effectively by transforming the problem into a Markov Decision Process (MDP). Leveraging the Deep Deterministic Policy Gradient (DDPG) algorithm, we derive the optimal online policy within a continuous action space. To further accelerate convergence and improve stability, we incorporate a transfer learning (TL)-based DDPG approach. Lastly, we demonstrate the superiority of our proposed method through performance comparisons with existing benchmark schemes, where the EBi<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>ET protocol achieves an impressive average performance gain of 193.58%. Furthermore, TD3 without TL, IRS-WPCN, EBi<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>ER, and i<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span>ER protocols also yield notable improvements of 187.33%, 95.33%, 67.02%, and 43.18%, respectively, over the benchmark.</div></div>\",\"PeriodicalId\":48707,\"journal\":{\"name\":\"Physical Communication\",\"volume\":\"72 \",\"pages\":\"Article 102807\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-08-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physical Communication\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1874490725002101\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physical Communication","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1874490725002101","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
EBi2ET protocol for self-sustainable IRS-assisted bidirectional communication: A transfer learning approach
Internet of Things (IoT) networks require sensors to collect real-time data for various applications, but they have limited battery capacity for ubiquitous services. In this work, we address the critical energy limitations in intelligent reflecting surface (IRS)-assisted wireless powered communication networks (WPCNs) by introducing an energy borrowing (EB) mechanism to ensure reliable bidirectional communication. We develop a novel framework, EB-assisted integrated information and energy transfer (EBiET) protocol for self-sustainable IoT networks. Here, a hybrid access point (HAP) performs bidirectional communication with a source (or sensor) node via an IRS. For self-sustainability, HAP harvests energy from the ambient solar/wind. But for ubiquitous energy supply, HAP borrows energy from the power grid (PG) and returns the energy with interest, when it has sufficient energy from the ambient energy harvesting (EH). Further, HAP transfers the energy to the source via IRS using nonlinear radio frequency EH (RF-EH). Then, both HAP and source perform bidirectional communication. To enhance self-sustainable communication, we formulate a joint optimization problem aimed at maximizing the bidirectional data rate while accounting for energy transactions. Although managing these transactions and bidirectional communications in an unknown environment presents challenges, we address them effectively by transforming the problem into a Markov Decision Process (MDP). Leveraging the Deep Deterministic Policy Gradient (DDPG) algorithm, we derive the optimal online policy within a continuous action space. To further accelerate convergence and improve stability, we incorporate a transfer learning (TL)-based DDPG approach. Lastly, we demonstrate the superiority of our proposed method through performance comparisons with existing benchmark schemes, where the EBiET protocol achieves an impressive average performance gain of 193.58%. Furthermore, TD3 without TL, IRS-WPCN, EBiER, and iER protocols also yield notable improvements of 187.33%, 95.33%, 67.02%, and 43.18%, respectively, over the benchmark.
期刊介绍:
PHYCOM: Physical Communication is an international and archival journal providing complete coverage of all topics of interest to those involved in all aspects of physical layer communications. Theoretical research contributions presenting new techniques, concepts or analyses, applied contributions reporting on experiences and experiments, and tutorials are published.
Topics of interest include but are not limited to:
Physical layer issues of Wireless Local Area Networks, WiMAX, Wireless Mesh Networks, Sensor and Ad Hoc Networks, PCS Systems; Radio access protocols and algorithms for the physical layer; Spread Spectrum Communications; Channel Modeling; Detection and Estimation; Modulation and Coding; Multiplexing and Carrier Techniques; Broadband Wireless Communications; Wireless Personal Communications; Multi-user Detection; Signal Separation and Interference rejection: Multimedia Communications over Wireless; DSP Applications to Wireless Systems; Experimental and Prototype Results; Multiple Access Techniques; Space-time Processing; Synchronization Techniques; Error Control Techniques; Cryptography; Software Radios; Tracking; Resource Allocation and Inference Management; Multi-rate and Multi-carrier Communications; Cross layer Design and Optimization; Propagation and Channel Characterization; OFDM Systems; MIMO Systems; Ultra-Wideband Communications; Cognitive Radio System Architectures; Platforms and Hardware Implementations for the Support of Cognitive, Radio Systems; Cognitive Radio Resource Management and Dynamic Spectrum Sharing.