{"title":"我们相信他们说的话还是做的事?多模态用户嵌入提供个性化解释","authors":"Zhicheng Ren, Zhiping Xiao, Yizhou Sun","doi":"arxiv-2409.02965","DOIUrl":null,"url":null,"abstract":"With the rapid development of social media, the importance of analyzing\nsocial network user data has also been put on the agenda. User representation\nlearning in social media is a critical area of research, based on which we can\nconduct personalized content delivery, or detect malicious actors. Being more\ncomplicated than many other types of data, social network user data has\ninherent multimodal nature. Various multimodal approaches have been proposed to\nharness both text (i.e. post content) and relation (i.e. inter-user\ninteraction) information to learn user embeddings of higher quality. The advent\nof Graph Neural Network models enables more end-to-end integration of user text\nembeddings and user interaction graphs in social networks. However, most of\nthose approaches do not adequately elucidate which aspects of the data - text\nor graph structure information - are more helpful for predicting each specific\nuser under a particular task, putting some burden on personalized downstream\nanalysis and untrustworthy information filtering. We propose a simple yet\neffective framework called Contribution-Aware Multimodal User Embedding (CAMUE)\nfor social networks. We have demonstrated with empirical evidence, that our\napproach can provide personalized explainable predictions, automatically\nmitigating the impact of unreliable information. We also conducted case studies\nto show how reasonable our results are. We observe that for most users, graph\nstructure information is more trustworthy than text information, but there are\nsome reasonable cases where text helps more. Our work paves the way for more\nexplainable, reliable, and effective social media user embedding which allows\nfor better personalized content delivery.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Do We Trust What They Say or What They Do? A Multimodal User Embedding Provides Personalized Explanations\",\"authors\":\"Zhicheng Ren, Zhiping Xiao, Yizhou Sun\",\"doi\":\"arxiv-2409.02965\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the rapid development of social media, the importance of analyzing\\nsocial network user data has also been put on the agenda. User representation\\nlearning in social media is a critical area of research, based on which we can\\nconduct personalized content delivery, or detect malicious actors. Being more\\ncomplicated than many other types of data, social network user data has\\ninherent multimodal nature. Various multimodal approaches have been proposed to\\nharness both text (i.e. post content) and relation (i.e. inter-user\\ninteraction) information to learn user embeddings of higher quality. The advent\\nof Graph Neural Network models enables more end-to-end integration of user text\\nembeddings and user interaction graphs in social networks. However, most of\\nthose approaches do not adequately elucidate which aspects of the data - text\\nor graph structure information - are more helpful for predicting each specific\\nuser under a particular task, putting some burden on personalized downstream\\nanalysis and untrustworthy information filtering. We propose a simple yet\\neffective framework called Contribution-Aware Multimodal User Embedding (CAMUE)\\nfor social networks. We have demonstrated with empirical evidence, that our\\napproach can provide personalized explainable predictions, automatically\\nmitigating the impact of unreliable information. We also conducted case studies\\nto show how reasonable our results are. We observe that for most users, graph\\nstructure information is more trustworthy than text information, but there are\\nsome reasonable cases where text helps more. Our work paves the way for more\\nexplainable, reliable, and effective social media user embedding which allows\\nfor better personalized content delivery.\",\"PeriodicalId\":501032,\"journal\":{\"name\":\"arXiv - CS - Social and Information Networks\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Social and Information Networks\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.02965\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.02965","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Do We Trust What They Say or What They Do? A Multimodal User Embedding Provides Personalized Explanations
With the rapid development of social media, the importance of analyzing
social network user data has also been put on the agenda. User representation
learning in social media is a critical area of research, based on which we can
conduct personalized content delivery, or detect malicious actors. Being more
complicated than many other types of data, social network user data has
inherent multimodal nature. Various multimodal approaches have been proposed to
harness both text (i.e. post content) and relation (i.e. inter-user
interaction) information to learn user embeddings of higher quality. The advent
of Graph Neural Network models enables more end-to-end integration of user text
embeddings and user interaction graphs in social networks. However, most of
those approaches do not adequately elucidate which aspects of the data - text
or graph structure information - are more helpful for predicting each specific
user under a particular task, putting some burden on personalized downstream
analysis and untrustworthy information filtering. We propose a simple yet
effective framework called Contribution-Aware Multimodal User Embedding (CAMUE)
for social networks. We have demonstrated with empirical evidence, that our
approach can provide personalized explainable predictions, automatically
mitigating the impact of unreliable information. We also conducted case studies
to show how reasonable our results are. We observe that for most users, graph
structure information is more trustworthy than text information, but there are
some reasonable cases where text helps more. Our work paves the way for more
explainable, reliable, and effective social media user embedding which allows
for better personalized content delivery.