Riddhiman Bhattacharya, Thanh Nguyen, Will Wei Sun, Mohit Tawarmalani
{"title":"主动学习,实现公平稳定的在线分配","authors":"Riddhiman Bhattacharya, Thanh Nguyen, Will Wei Sun, Mohit Tawarmalani","doi":"arxiv-2406.14784","DOIUrl":null,"url":null,"abstract":"We explore an active learning approach for dynamic fair resource allocation\nproblems. Unlike previous work that assumes full feedback from all agents on\ntheir allocations, we consider feedback from a select subset of agents at each\nepoch of the online resource allocation process. Despite this restriction, our\nproposed algorithms provide regret bounds that are sub-linear in number of\ntime-periods for various measures that include fairness metrics commonly used\nin resource allocation problems and stability considerations in matching\nmechanisms. The key insight of our algorithms lies in adaptively identifying\nthe most informative feedback using dueling upper and lower confidence bounds.\nWith this strategy, we show that efficient decision-making does not require\nextensive feedback and produces efficient outcomes for a variety of problem\nclasses.","PeriodicalId":501323,"journal":{"name":"arXiv - STAT - Other Statistics","volume":"38 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Active Learning for Fair and Stable Online Allocations\",\"authors\":\"Riddhiman Bhattacharya, Thanh Nguyen, Will Wei Sun, Mohit Tawarmalani\",\"doi\":\"arxiv-2406.14784\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We explore an active learning approach for dynamic fair resource allocation\\nproblems. Unlike previous work that assumes full feedback from all agents on\\ntheir allocations, we consider feedback from a select subset of agents at each\\nepoch of the online resource allocation process. Despite this restriction, our\\nproposed algorithms provide regret bounds that are sub-linear in number of\\ntime-periods for various measures that include fairness metrics commonly used\\nin resource allocation problems and stability considerations in matching\\nmechanisms. The key insight of our algorithms lies in adaptively identifying\\nthe most informative feedback using dueling upper and lower confidence bounds.\\nWith this strategy, we show that efficient decision-making does not require\\nextensive feedback and produces efficient outcomes for a variety of problem\\nclasses.\",\"PeriodicalId\":501323,\"journal\":{\"name\":\"arXiv - STAT - Other Statistics\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - STAT - Other Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2406.14784\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Other Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2406.14784","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Active Learning for Fair and Stable Online Allocations
We explore an active learning approach for dynamic fair resource allocation
problems. Unlike previous work that assumes full feedback from all agents on
their allocations, we consider feedback from a select subset of agents at each
epoch of the online resource allocation process. Despite this restriction, our
proposed algorithms provide regret bounds that are sub-linear in number of
time-periods for various measures that include fairness metrics commonly used
in resource allocation problems and stability considerations in matching
mechanisms. The key insight of our algorithms lies in adaptively identifying
the most informative feedback using dueling upper and lower confidence bounds.
With this strategy, we show that efficient decision-making does not require
extensive feedback and produces efficient outcomes for a variety of problem
classes.