{"title":"基于贝叶斯推理的假新闻检测","authors":"Fatma Najar, Nuha Zamzami, N. Bouguila","doi":"10.1109/IRI.2019.00066","DOIUrl":null,"url":null,"abstract":"Given the huge volume of information available on social media, making a distinction between false information and a real one is a challenging task. In fact, several statistical models dealing with this problem are based on multinomial distributions. However, a new family of distributions that is an exponential family approximation to the Dirichlet Compound Multinomial (EDCM) has been introduced to be more adjustable to high-dimensional data and to overcome the drawbacks of the multinomial assumption. Thus, in this paper, we tackle the problem of fake news detection using finite mixture models of EDCM distributions. In particular, we develop a Bayesian approach based on Markov Chain Monte Carlo and Metropolis-Hastings algorithm for the learning of these mixture models. The proposed method is validated via extensive simulations and a comparison with multinomial-based mixture models is provided.","PeriodicalId":295028,"journal":{"name":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"122 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Fake News Detection Using Bayesian Inference\",\"authors\":\"Fatma Najar, Nuha Zamzami, N. Bouguila\",\"doi\":\"10.1109/IRI.2019.00066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Given the huge volume of information available on social media, making a distinction between false information and a real one is a challenging task. In fact, several statistical models dealing with this problem are based on multinomial distributions. However, a new family of distributions that is an exponential family approximation to the Dirichlet Compound Multinomial (EDCM) has been introduced to be more adjustable to high-dimensional data and to overcome the drawbacks of the multinomial assumption. Thus, in this paper, we tackle the problem of fake news detection using finite mixture models of EDCM distributions. In particular, we develop a Bayesian approach based on Markov Chain Monte Carlo and Metropolis-Hastings algorithm for the learning of these mixture models. The proposed method is validated via extensive simulations and a comparison with multinomial-based mixture models is provided.\",\"PeriodicalId\":295028,\"journal\":{\"name\":\"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)\",\"volume\":\"122 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI.2019.00066\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI.2019.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Given the huge volume of information available on social media, making a distinction between false information and a real one is a challenging task. In fact, several statistical models dealing with this problem are based on multinomial distributions. However, a new family of distributions that is an exponential family approximation to the Dirichlet Compound Multinomial (EDCM) has been introduced to be more adjustable to high-dimensional data and to overcome the drawbacks of the multinomial assumption. Thus, in this paper, we tackle the problem of fake news detection using finite mixture models of EDCM distributions. In particular, we develop a Bayesian approach based on Markov Chain Monte Carlo and Metropolis-Hastings algorithm for the learning of these mixture models. The proposed method is validated via extensive simulations and a comparison with multinomial-based mixture models is provided.