{"title":"社交媒体领域交织的偏见:解读媒体偏见维度的相关性","authors":"Yifan Liu, Yike Li, Dong Wang","doi":"arxiv-2408.15406","DOIUrl":null,"url":null,"abstract":"Media bias significantly shapes public perception by reinforcing stereotypes\nand exacerbating societal divisions. Prior research has often focused on\nisolated media bias dimensions such as \\textit{political bias} or\n\\textit{racial bias}, neglecting the complex interrelationships among various\nbias dimensions across different topic domains. Moreover, we observe that\nmodels trained on existing media bias benchmarks fail to generalize effectively\non recent social media posts, particularly in certain bias identification\ntasks. This shortfall primarily arises because these benchmarks do not\nadequately reflect the rapidly evolving nature of social media content, which\nis characterized by shifting user behaviors and emerging trends. In response to\nthese limitations, our research introduces a novel dataset collected from\nYouTube and Reddit over the past five years. Our dataset includes automated\nannotations for YouTube content across a broad spectrum of bias dimensions,\nsuch as gender, racial, and political biases, as well as hate speech, among\nothers. It spans diverse domains including politics, sports, healthcare,\neducation, and entertainment, reflecting the complex interplay of biases across\ndifferent societal sectors. Through comprehensive statistical analysis, we\nidentify significant differences in bias expression patterns and intra-domain\nbias correlations across these domains. By utilizing our understanding of the\ncorrelations among various bias dimensions, we lay the groundwork for creating\nadvanced systems capable of detecting multiple biases simultaneously. Overall,\nour dataset advances the field of media bias identification, contributing to\nthe development of tools that promote fairer media consumption. The\ncomprehensive awareness of existing media bias fosters more ethical journalism,\npromotes cultural sensitivity, and supports a more informed and equitable\npublic discourse.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Intertwined Biases Across Social Media Spheres: Unpacking Correlations in Media Bias Dimensions\",\"authors\":\"Yifan Liu, Yike Li, Dong Wang\",\"doi\":\"arxiv-2408.15406\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Media bias significantly shapes public perception by reinforcing stereotypes\\nand exacerbating societal divisions. Prior research has often focused on\\nisolated media bias dimensions such as \\\\textit{political bias} or\\n\\\\textit{racial bias}, neglecting the complex interrelationships among various\\nbias dimensions across different topic domains. Moreover, we observe that\\nmodels trained on existing media bias benchmarks fail to generalize effectively\\non recent social media posts, particularly in certain bias identification\\ntasks. This shortfall primarily arises because these benchmarks do not\\nadequately reflect the rapidly evolving nature of social media content, which\\nis characterized by shifting user behaviors and emerging trends. In response to\\nthese limitations, our research introduces a novel dataset collected from\\nYouTube and Reddit over the past five years. Our dataset includes automated\\nannotations for YouTube content across a broad spectrum of bias dimensions,\\nsuch as gender, racial, and political biases, as well as hate speech, among\\nothers. It spans diverse domains including politics, sports, healthcare,\\neducation, and entertainment, reflecting the complex interplay of biases across\\ndifferent societal sectors. Through comprehensive statistical analysis, we\\nidentify significant differences in bias expression patterns and intra-domain\\nbias correlations across these domains. By utilizing our understanding of the\\ncorrelations among various bias dimensions, we lay the groundwork for creating\\nadvanced systems capable of detecting multiple biases simultaneously. Overall,\\nour dataset advances the field of media bias identification, contributing to\\nthe development of tools that promote fairer media consumption. The\\ncomprehensive awareness of existing media bias fosters more ethical journalism,\\npromotes cultural sensitivity, and supports a more informed and equitable\\npublic discourse.\",\"PeriodicalId\":501032,\"journal\":{\"name\":\"arXiv - CS - Social and Information Networks\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Social and Information Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.15406\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.15406","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Intertwined Biases Across Social Media Spheres: Unpacking Correlations in Media Bias Dimensions
Media bias significantly shapes public perception by reinforcing stereotypes
and exacerbating societal divisions. Prior research has often focused on
isolated media bias dimensions such as \textit{political bias} or
\textit{racial bias}, neglecting the complex interrelationships among various
bias dimensions across different topic domains. Moreover, we observe that
models trained on existing media bias benchmarks fail to generalize effectively
on recent social media posts, particularly in certain bias identification
tasks. This shortfall primarily arises because these benchmarks do not
adequately reflect the rapidly evolving nature of social media content, which
is characterized by shifting user behaviors and emerging trends. In response to
these limitations, our research introduces a novel dataset collected from
YouTube and Reddit over the past five years. Our dataset includes automated
annotations for YouTube content across a broad spectrum of bias dimensions,
such as gender, racial, and political biases, as well as hate speech, among
others. It spans diverse domains including politics, sports, healthcare,
education, and entertainment, reflecting the complex interplay of biases across
different societal sectors. Through comprehensive statistical analysis, we
identify significant differences in bias expression patterns and intra-domain
bias correlations across these domains. By utilizing our understanding of the
correlations among various bias dimensions, we lay the groundwork for creating
advanced systems capable of detecting multiple biases simultaneously. Overall,
our dataset advances the field of media bias identification, contributing to
the development of tools that promote fairer media consumption. The
comprehensive awareness of existing media bias fosters more ethical journalism,
promotes cultural sensitivity, and supports a more informed and equitable
public discourse.