Tanishi Srivastava, Dristi De, Prerna Sharma, Debarka Sengupta
{"title":"基于经验Copula的朴素贝叶斯分类器","authors":"Tanishi Srivastava, Dristi De, Prerna Sharma, Debarka Sengupta","doi":"10.1109/ICECONF57129.2023.10083573","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a Bayesian model with enhanced performance on statistical datasets by incorporating the concept of Empirical copulas to compute the joint probability distribution of features present in the data. Copulas are defined as cumulative distribution functions deemed popular in highdimensional statistical applications since they easily enable one to model and estimate the distribution of random vectors by estimating the marginals and copulae separately. The key idea of this method is to replace the joint probability, which is defined as the probability of occurrence of two or more simultaneous events, with the cumulative distribution generated by the nonparametric empirical copula function and utilize it on bivariate and multivariate data to assess the performance of the model thus generated. Through extensive research on the topic of nonparametric empirical copulas and tuning the model with various smoothing techniques, we have achieved significant accuracy with a more robust statistical hold in the predictive analysis of different datasets in comparison to the simple Gaussian Naïve Bayes technique.","PeriodicalId":436733,"journal":{"name":"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Empirical Copula based Naive Bayes Classifier\",\"authors\":\"Tanishi Srivastava, Dristi De, Prerna Sharma, Debarka Sengupta\",\"doi\":\"10.1109/ICECONF57129.2023.10083573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a Bayesian model with enhanced performance on statistical datasets by incorporating the concept of Empirical copulas to compute the joint probability distribution of features present in the data. Copulas are defined as cumulative distribution functions deemed popular in highdimensional statistical applications since they easily enable one to model and estimate the distribution of random vectors by estimating the marginals and copulae separately. The key idea of this method is to replace the joint probability, which is defined as the probability of occurrence of two or more simultaneous events, with the cumulative distribution generated by the nonparametric empirical copula function and utilize it on bivariate and multivariate data to assess the performance of the model thus generated. Through extensive research on the topic of nonparametric empirical copulas and tuning the model with various smoothing techniques, we have achieved significant accuracy with a more robust statistical hold in the predictive analysis of different datasets in comparison to the simple Gaussian Naïve Bayes technique.\",\"PeriodicalId\":436733,\"journal\":{\"name\":\"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECONF57129.2023.10083573\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence and Knowledge Discovery in Concurrent Engineering (ICECONF)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECONF57129.2023.10083573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In this paper, we propose a Bayesian model with enhanced performance on statistical datasets by incorporating the concept of Empirical copulas to compute the joint probability distribution of features present in the data. Copulas are defined as cumulative distribution functions deemed popular in highdimensional statistical applications since they easily enable one to model and estimate the distribution of random vectors by estimating the marginals and copulae separately. The key idea of this method is to replace the joint probability, which is defined as the probability of occurrence of two or more simultaneous events, with the cumulative distribution generated by the nonparametric empirical copula function and utilize it on bivariate and multivariate data to assess the performance of the model thus generated. Through extensive research on the topic of nonparametric empirical copulas and tuning the model with various smoothing techniques, we have achieved significant accuracy with a more robust statistical hold in the predictive analysis of different datasets in comparison to the simple Gaussian Naïve Bayes technique.