{"title":"推特谣言验证的分层数据增强","authors":"Zhouyi Wang","doi":"10.1109/ICCWAMTIP53232.2021.9674119","DOIUrl":null,"url":null,"abstract":"Unlimited dissemination of rumors in social media has a tremendous negative impact on our society. To address this issue, many rumor verification models have been proposed and achieved reasonable verification performance. However, the imbalanced data distribution between samples heavily limit the further prosperity of the deep learning-based models. To alleviate challenges, we propose a novel hierarchical data augmentation method for the rumor verification task (termed as HDA-RV), which consists two data augmentation methods (tweet-level and thread-level data augmentation). Tweet-level data augmentation simulates the noise of text information in social media and thread-level data augmentation corresponds to the noise of the propagation structure in social networks. Experiments on the PHEME dataset show that our method can effectively alleviate the problem of data imbalance.","PeriodicalId":358772,"journal":{"name":"2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hierarchical Data Augmentation for Rumor Verification on Twitter\",\"authors\":\"Zhouyi Wang\",\"doi\":\"10.1109/ICCWAMTIP53232.2021.9674119\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Unlimited dissemination of rumors in social media has a tremendous negative impact on our society. To address this issue, many rumor verification models have been proposed and achieved reasonable verification performance. However, the imbalanced data distribution between samples heavily limit the further prosperity of the deep learning-based models. To alleviate challenges, we propose a novel hierarchical data augmentation method for the rumor verification task (termed as HDA-RV), which consists two data augmentation methods (tweet-level and thread-level data augmentation). Tweet-level data augmentation simulates the noise of text information in social media and thread-level data augmentation corresponds to the noise of the propagation structure in social networks. Experiments on the PHEME dataset show that our method can effectively alleviate the problem of data imbalance.\",\"PeriodicalId\":358772,\"journal\":{\"name\":\"2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCWAMTIP53232.2021.9674119\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 18th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCWAMTIP53232.2021.9674119","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Hierarchical Data Augmentation for Rumor Verification on Twitter
Unlimited dissemination of rumors in social media has a tremendous negative impact on our society. To address this issue, many rumor verification models have been proposed and achieved reasonable verification performance. However, the imbalanced data distribution between samples heavily limit the further prosperity of the deep learning-based models. To alleviate challenges, we propose a novel hierarchical data augmentation method for the rumor verification task (termed as HDA-RV), which consists two data augmentation methods (tweet-level and thread-level data augmentation). Tweet-level data augmentation simulates the noise of text information in social media and thread-level data augmentation corresponds to the noise of the propagation structure in social networks. Experiments on the PHEME dataset show that our method can effectively alleviate the problem of data imbalance.