{"title":"利用数据重采样技术建立现实挖掘中公平的分类模型","authors":"Ghassan F. Bati","doi":"10.1109/AECT47998.2020.9194174","DOIUrl":null,"url":null,"abstract":"Multiple recent efforts in human-centered computing have used mobile and ubiquitous data to infer propensities of individuals (e.g. to trust others or behave altruistically). Often studied under the umbrella of “Reality Mining” multiple such efforts have reported high accuracies at the considered prediction tasks. However, there has been little work at quantifying the “fairness” of such algorithms in terms of how the quality of the predictions varies over different demographic groups (e.g. across gender). This work takes inspiration from data resampling techniques to create fair classification models. Empirical results suggest that a combination of over and under sampling technique (SmoteTomek) to the sensitive (protected) attribute (e.g. gender) yields improved model’s performance while reducing disparity across genders.","PeriodicalId":331415,"journal":{"name":"2019 International Conference on Advances in the Emerging Computing Technologies (AECT)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards Creating Fair Classification Models in Reality Mining Using Data Resampling Techniques\",\"authors\":\"Ghassan F. Bati\",\"doi\":\"10.1109/AECT47998.2020.9194174\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multiple recent efforts in human-centered computing have used mobile and ubiquitous data to infer propensities of individuals (e.g. to trust others or behave altruistically). Often studied under the umbrella of “Reality Mining” multiple such efforts have reported high accuracies at the considered prediction tasks. However, there has been little work at quantifying the “fairness” of such algorithms in terms of how the quality of the predictions varies over different demographic groups (e.g. across gender). This work takes inspiration from data resampling techniques to create fair classification models. Empirical results suggest that a combination of over and under sampling technique (SmoteTomek) to the sensitive (protected) attribute (e.g. gender) yields improved model’s performance while reducing disparity across genders.\",\"PeriodicalId\":331415,\"journal\":{\"name\":\"2019 International Conference on Advances in the Emerging Computing Technologies (AECT)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Advances in the Emerging Computing Technologies (AECT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AECT47998.2020.9194174\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Advances in the Emerging Computing Technologies (AECT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AECT47998.2020.9194174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards Creating Fair Classification Models in Reality Mining Using Data Resampling Techniques
Multiple recent efforts in human-centered computing have used mobile and ubiquitous data to infer propensities of individuals (e.g. to trust others or behave altruistically). Often studied under the umbrella of “Reality Mining” multiple such efforts have reported high accuracies at the considered prediction tasks. However, there has been little work at quantifying the “fairness” of such algorithms in terms of how the quality of the predictions varies over different demographic groups (e.g. across gender). This work takes inspiration from data resampling techniques to create fair classification models. Empirical results suggest that a combination of over and under sampling technique (SmoteTomek) to the sensitive (protected) attribute (e.g. gender) yields improved model’s performance while reducing disparity across genders.