{"title":"BLM-DTO:基于强盗学习和匹配的雾网络分布式任务卸载","authors":"Hoa Tran-Dang, Dongsung Kim","doi":"10.1109/ICEIC57457.2023.10049981","DOIUrl":null,"url":null,"abstract":"This paper proposes an algorithm called BLM-DTO that allows each fog node (FN) to implement the task offloading operations in a distributed manner in the fog computing networks (FCNs). Fundamentally, BLM-DTO leverages the principle of matching game theory to achieve the stable matching outcome based on preference relations of two sides of the game. Due to the dynamic nature of fog computing environment, the preference relation of one-side game players is unknown a priori and achieved only by iteratively interacting with the other side of players. Thus, BLM-DTO further incorporates multi-armed bandit (MAB) learning using Thompson sampling (TS) technique to adaptively learn their unknown preferences. Extensive simulation results demonstrate the potential advantages of the proposed TS-type offloading algorithm over the ϵ-greedy and upper-bound confidence (UCB)-type baselines.","PeriodicalId":373752,"journal":{"name":"2023 International Conference on Electronics, Information, and Communication (ICEIC)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"BLM-DTO: Bandit Learning and Matching based Distributed Task Offloading in Fog Networks\",\"authors\":\"Hoa Tran-Dang, Dongsung Kim\",\"doi\":\"10.1109/ICEIC57457.2023.10049981\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes an algorithm called BLM-DTO that allows each fog node (FN) to implement the task offloading operations in a distributed manner in the fog computing networks (FCNs). Fundamentally, BLM-DTO leverages the principle of matching game theory to achieve the stable matching outcome based on preference relations of two sides of the game. Due to the dynamic nature of fog computing environment, the preference relation of one-side game players is unknown a priori and achieved only by iteratively interacting with the other side of players. Thus, BLM-DTO further incorporates multi-armed bandit (MAB) learning using Thompson sampling (TS) technique to adaptively learn their unknown preferences. Extensive simulation results demonstrate the potential advantages of the proposed TS-type offloading algorithm over the ϵ-greedy and upper-bound confidence (UCB)-type baselines.\",\"PeriodicalId\":373752,\"journal\":{\"name\":\"2023 International Conference on Electronics, Information, and Communication (ICEIC)\",\"volume\":\"97 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-02-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Electronics, Information, and Communication (ICEIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEIC57457.2023.10049981\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Electronics, Information, and Communication (ICEIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEIC57457.2023.10049981","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
BLM-DTO: Bandit Learning and Matching based Distributed Task Offloading in Fog Networks
This paper proposes an algorithm called BLM-DTO that allows each fog node (FN) to implement the task offloading operations in a distributed manner in the fog computing networks (FCNs). Fundamentally, BLM-DTO leverages the principle of matching game theory to achieve the stable matching outcome based on preference relations of two sides of the game. Due to the dynamic nature of fog computing environment, the preference relation of one-side game players is unknown a priori and achieved only by iteratively interacting with the other side of players. Thus, BLM-DTO further incorporates multi-armed bandit (MAB) learning using Thompson sampling (TS) technique to adaptively learn their unknown preferences. Extensive simulation results demonstrate the potential advantages of the proposed TS-type offloading algorithm over the ϵ-greedy and upper-bound confidence (UCB)-type baselines.