{"title":"走向公平的 RAG:论检索增强生成中公平排序的影响","authors":"To Eun Kim, Fernando Diaz","doi":"arxiv-2409.11598","DOIUrl":null,"url":null,"abstract":"Many language models now enhance their responses with retrieval capabilities,\nleading to the widespread adoption of retrieval-augmented generation (RAG)\nsystems. However, despite retrieval being a core component of RAG, much of the\nresearch in this area overlooks the extensive body of work on fair ranking,\nneglecting the importance of considering all stakeholders involved. This paper\npresents the first systematic evaluation of RAG systems integrated with fair\nrankings. We focus specifically on measuring the fair exposure of each relevant\nitem across the rankings utilized by RAG systems (i.e., item-side fairness),\naiming to promote equitable growth for relevant item providers. To gain a deep\nunderstanding of the relationship between item-fairness, ranking quality, and\ngeneration quality in the context of RAG, we analyze nine different RAG systems\nthat incorporate fair rankings across seven distinct datasets. Our findings\nindicate that RAG systems with fair rankings can maintain a high level of\ngeneration quality and, in many cases, even outperform traditional RAG systems,\ndespite the general trend of a tradeoff between ensuring fairness and\nmaintaining system-effectiveness. We believe our insights lay the groundwork\nfor responsible and equitable RAG systems and open new avenues for future\nresearch. We publicly release our codebase and dataset at\nhttps://github.com/kimdanny/Fair-RAG.","PeriodicalId":501281,"journal":{"name":"arXiv - CS - Information Retrieval","volume":"27 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation\",\"authors\":\"To Eun Kim, Fernando Diaz\",\"doi\":\"arxiv-2409.11598\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many language models now enhance their responses with retrieval capabilities,\\nleading to the widespread adoption of retrieval-augmented generation (RAG)\\nsystems. However, despite retrieval being a core component of RAG, much of the\\nresearch in this area overlooks the extensive body of work on fair ranking,\\nneglecting the importance of considering all stakeholders involved. This paper\\npresents the first systematic evaluation of RAG systems integrated with fair\\nrankings. We focus specifically on measuring the fair exposure of each relevant\\nitem across the rankings utilized by RAG systems (i.e., item-side fairness),\\naiming to promote equitable growth for relevant item providers. To gain a deep\\nunderstanding of the relationship between item-fairness, ranking quality, and\\ngeneration quality in the context of RAG, we analyze nine different RAG systems\\nthat incorporate fair rankings across seven distinct datasets. Our findings\\nindicate that RAG systems with fair rankings can maintain a high level of\\ngeneration quality and, in many cases, even outperform traditional RAG systems,\\ndespite the general trend of a tradeoff between ensuring fairness and\\nmaintaining system-effectiveness. We believe our insights lay the groundwork\\nfor responsible and equitable RAG systems and open new avenues for future\\nresearch. We publicly release our codebase and dataset at\\nhttps://github.com/kimdanny/Fair-RAG.\",\"PeriodicalId\":501281,\"journal\":{\"name\":\"arXiv - CS - Information Retrieval\",\"volume\":\"27 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Information Retrieval\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.11598\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.11598","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Towards Fair RAG: On the Impact of Fair Ranking in Retrieval-Augmented Generation
Many language models now enhance their responses with retrieval capabilities,
leading to the widespread adoption of retrieval-augmented generation (RAG)
systems. However, despite retrieval being a core component of RAG, much of the
research in this area overlooks the extensive body of work on fair ranking,
neglecting the importance of considering all stakeholders involved. This paper
presents the first systematic evaluation of RAG systems integrated with fair
rankings. We focus specifically on measuring the fair exposure of each relevant
item across the rankings utilized by RAG systems (i.e., item-side fairness),
aiming to promote equitable growth for relevant item providers. To gain a deep
understanding of the relationship between item-fairness, ranking quality, and
generation quality in the context of RAG, we analyze nine different RAG systems
that incorporate fair rankings across seven distinct datasets. Our findings
indicate that RAG systems with fair rankings can maintain a high level of
generation quality and, in many cases, even outperform traditional RAG systems,
despite the general trend of a tradeoff between ensuring fairness and
maintaining system-effectiveness. We believe our insights lay the groundwork
for responsible and equitable RAG systems and open new avenues for future
research. We publicly release our codebase and dataset at
https://github.com/kimdanny/Fair-RAG.