Yue Wang , Shizhong Yuan , Weimin Li , Yifan Feng , Xiao Yu , Fangfang Liu , Can Wang , Quanke Pan
{"title":"Heterogeneous network for Hierarchical Fine-Grained Domain Fake News Detection","authors":"Yue Wang , Shizhong Yuan , Weimin Li , Yifan Feng , Xiao Yu , Fangfang Liu , Can Wang , Quanke Pan","doi":"10.1016/j.ipm.2025.104141","DOIUrl":null,"url":null,"abstract":"<div><div>Fake news on social media has significant negative consequences for individuals and society. However, existing multi-domain detection methods exhibit two primary limitations: dependence on precise domain annotation and inherent bias arising from single-domain categorization. To address these challenges, this paper introduces the Domain-Specific Narrow-Coverage Tree-Based Taxonomy (<span><math><msup><mrow><mtext>DNT</mtext></mrow><mrow><mn>2</mn></mrow></msup></math></span>), which enables more precise domain classification and domain relationship elucidation through refined categories. The constructed dataset is annotated with multiple labels by Large Language Models (LLMs), mitigating reliance on manual efforts and reducing annotation costs while maintaining annotation quality. Furthermore, a Hierarchical Fine-Grained Domain (HFGD) Fake News Detection Method is proposed, which explicitly employs a heterogeneous network to model multi-relationships. This method can mitigate domain bias and comprehensively capture news diversity and domain interactions. Specifically, domain cohesion based on news semantics is designed to reflect the relevance of news within a domain. News items are integrated as intersection nodes into the tree structure of multilevel domains to construct the heterogeneous network. Graph representation learning then fuses directly or indirectly connected news and domain information during feature enhancement. Finally, a composite loss is designed for news and domain node classification. HFGD captures potential differences and commonalities in domains and enhances label adaptation through domain interactions. Experiments on our dataset demonstrate that HFGD outperforms state-of-the-art methods by 1.08% and 0.91% in overall accuracy and macro-F1 score, respectively. Specifically, in the education and military domains with limited sample sizes, HFGD achieves 5.74% and 4.1% improvements in macro-F1 score over the second-best method. The results demonstrate our method’s effectiveness in mitigating domain bias and enhancing detection performance, providing valuable insights for practical multi-domain fake news detection systems.</div></div>","PeriodicalId":50365,"journal":{"name":"Information Processing & Management","volume":"62 4","pages":"Article 104141"},"PeriodicalIF":7.4000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing & Management","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306457325000822","RegionNum":1,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Fake news on social media has significant negative consequences for individuals and society. However, existing multi-domain detection methods exhibit two primary limitations: dependence on precise domain annotation and inherent bias arising from single-domain categorization. To address these challenges, this paper introduces the Domain-Specific Narrow-Coverage Tree-Based Taxonomy (), which enables more precise domain classification and domain relationship elucidation through refined categories. The constructed dataset is annotated with multiple labels by Large Language Models (LLMs), mitigating reliance on manual efforts and reducing annotation costs while maintaining annotation quality. Furthermore, a Hierarchical Fine-Grained Domain (HFGD) Fake News Detection Method is proposed, which explicitly employs a heterogeneous network to model multi-relationships. This method can mitigate domain bias and comprehensively capture news diversity and domain interactions. Specifically, domain cohesion based on news semantics is designed to reflect the relevance of news within a domain. News items are integrated as intersection nodes into the tree structure of multilevel domains to construct the heterogeneous network. Graph representation learning then fuses directly or indirectly connected news and domain information during feature enhancement. Finally, a composite loss is designed for news and domain node classification. HFGD captures potential differences and commonalities in domains and enhances label adaptation through domain interactions. Experiments on our dataset demonstrate that HFGD outperforms state-of-the-art methods by 1.08% and 0.91% in overall accuracy and macro-F1 score, respectively. Specifically, in the education and military domains with limited sample sizes, HFGD achieves 5.74% and 4.1% improvements in macro-F1 score over the second-best method. The results demonstrate our method’s effectiveness in mitigating domain bias and enhancing detection performance, providing valuable insights for practical multi-domain fake news detection systems.
期刊介绍:
Information Processing and Management is dedicated to publishing cutting-edge original research at the convergence of computing and information science. Our scope encompasses theory, methods, and applications across various domains, including advertising, business, health, information science, information technology marketing, and social computing.
We aim to cater to the interests of both primary researchers and practitioners by offering an effective platform for the timely dissemination of advanced and topical issues in this interdisciplinary field. The journal places particular emphasis on original research articles, research survey articles, research method articles, and articles addressing critical applications of research. Join us in advancing knowledge and innovation at the intersection of computing and information science.