FaiR-IoT: Fairness-aware Human-in-the-Loop Reinforcement Learning for Harnessing Human Variability in Personalized IoT

Proceedings of the International Conference on Internet-of-Things Design and Implementation Pub Date : 2021-03-30 DOI:10.1145/3450268.3453525

Salma Elmalaki

{"title":"FaiR-IoT: Fairness-aware Human-in-the-Loop Reinforcement Learning for Harnessing Human Variability in Personalized IoT","authors":"Salma Elmalaki","doi":"10.1145/3450268.3453525","DOIUrl":null,"url":null,"abstract":"Thanks to the rapid growth in wearable technologies, monitoring complex human context becomes feasible, paving the way to develop human-in-the-loop IoT systems that naturally evolve to adapt to the human and environment state autonomously. Nevertheless, a central challenge in designing such personalized IoT applications arises from human variability. Such variability stems from the fact that different humans exhibit different behaviors when interacting with IoT applications (intra-human variability), the same human may change the behavior over time when interacting with the same IoT application (inter-human variability), and human behavior may be affected by the behaviors of other people in the same environment (multi-human variability). To that end, we propose FaiR-IoT, a general reinforcement learning-based framework for adaptive and fairness-aware human-in-the-loop IoT applications. In FaiR-IoT, three levels of reinforcement learning agents interact to continuously learn human preferences and maximize the system's performance and fairness while taking into account the intra-, inter-, and multi-human variability. We validate the proposed framework on two applications, namely (i) Human-in-the-Loop Automotive Advanced Driver Assistance Systems and (ii) Human-in-the-Loop Smart House. Results obtained on these two applications validate the generality of FaiR-IoT and its ability to provide a personalized experience while enhancing the system's performance by 40%-60% compared to non-personalized systems and enhancing the fairness of the multi-human systems by 1.5 orders of magnitude.","PeriodicalId":130134,"journal":{"name":"Proceedings of the International Conference on Internet-of-Things Design and Implementation","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the International Conference on Internet-of-Things Design and Implementation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3450268.3453525","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

Abstract

Thanks to the rapid growth in wearable technologies, monitoring complex human context becomes feasible, paving the way to develop human-in-the-loop IoT systems that naturally evolve to adapt to the human and environment state autonomously. Nevertheless, a central challenge in designing such personalized IoT applications arises from human variability. Such variability stems from the fact that different humans exhibit different behaviors when interacting with IoT applications (intra-human variability), the same human may change the behavior over time when interacting with the same IoT application (inter-human variability), and human behavior may be affected by the behaviors of other people in the same environment (multi-human variability). To that end, we propose FaiR-IoT, a general reinforcement learning-based framework for adaptive and fairness-aware human-in-the-loop IoT applications. In FaiR-IoT, three levels of reinforcement learning agents interact to continuously learn human preferences and maximize the system's performance and fairness while taking into account the intra-, inter-, and multi-human variability. We validate the proposed framework on two applications, namely (i) Human-in-the-Loop Automotive Advanced Driver Assistance Systems and (ii) Human-in-the-Loop Smart House. Results obtained on these two applications validate the generality of FaiR-IoT and its ability to provide a personalized experience while enhancing the system's performance by 40%-60% compared to non-personalized systems and enhancing the fairness of the multi-human systems by 1.5 orders of magnitude.

查看原文本刊更多论文

公平物联网:在个性化物联网中利用人类可变性的公平感知人在环强化学习

由于可穿戴技术的快速发展，监测复杂的人类环境变得可行，为开发自然进化以自主适应人类和环境状态的人在环物联网系统铺平了道路。然而，设计这种个性化物联网应用的一个核心挑战来自于人类的可变性。这种可变性源于这样一个事实，即不同的人在与物联网应用交互时表现出不同的行为(人内可变性)，同一个人在与同一物联网应用交互时可能会随着时间的推移改变行为(人间可变性)，人类行为可能受到同一环境中其他人行为的影响(多人可变性)。为此，我们提出了FaiR-IoT，这是一种基于通用强化学习的框架，用于自适应和公平感知的人在环物联网应用。在FaiR-IoT中，三个层次的强化学习代理相互作用，不断学习人类的偏好，并在考虑到人内部、人之间和人之间的可变性的同时，最大限度地提高系统的性能和公平性。我们在两个应用上验证了所提出的框架，即(i)人在环汽车高级驾驶辅助系统和(ii)人在环智能住宅。在这两个应用中获得的结果验证了FaiR-IoT的通用性及其提供个性化体验的能力，同时与非个性化系统相比，将系统性能提高了40%-60%，并将多人系统的公平性提高了1.5个数量级。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the International Conference on Internet-of-Things Design and Implementation

自引率

0.00%

发文量