Maryam Karimi, M. Dehghan, Seyyed Majid Nourhoseini
{"title":"A systematic framework for dynamically optimizing delay-sensitive wireless transmission","authors":"Maryam Karimi, M. Dehghan, Seyyed Majid Nourhoseini","doi":"10.1109/AISP.2015.7123489","DOIUrl":null,"url":null,"abstract":"Delay sensitive applications need to overcome the service problems in dynamic environments with respect to both the multimedia source data (e.g., variable bit-rate) and the wireless channels (e.g., fading channel). This paper considers the problem of point to point transmission of scalable video coding over a fading channel. We formulate the rate adaptation challenge of WLAN multimedia networks as a Markov Decision Process and resolve this problem online based on reinforcement learning. The buffer state, channel state, and video state were considered as a joint state of system to maximize the average Quality of Service under delay constraints. To improve the convergence speed of learning, system's underlying dynamics were partitioned into a priori known and a priori unknown components. The proposed learning algorithm exploits known information about the system, so that less information needs to be learned compared with that in conventional reinforcement learning algorithms.","PeriodicalId":405857,"journal":{"name":"2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 The International Symposium on Artificial Intelligence and Signal Processing (AISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AISP.2015.7123489","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Delay sensitive applications need to overcome the service problems in dynamic environments with respect to both the multimedia source data (e.g., variable bit-rate) and the wireless channels (e.g., fading channel). This paper considers the problem of point to point transmission of scalable video coding over a fading channel. We formulate the rate adaptation challenge of WLAN multimedia networks as a Markov Decision Process and resolve this problem online based on reinforcement learning. The buffer state, channel state, and video state were considered as a joint state of system to maximize the average Quality of Service under delay constraints. To improve the convergence speed of learning, system's underlying dynamics were partitioned into a priori known and a priori unknown components. The proposed learning algorithm exploits known information about the system, so that less information needs to be learned compared with that in conventional reinforcement learning algorithms.