{"title":"基于全局最小生成树的多类贝叶斯误差估计","authors":"S. Y. Sekeh, Brandon Oselio, A. Hero","doi":"10.1109/ALLERTON.2018.8635642","DOIUrl":null,"url":null,"abstract":"Henze-Penrose (HP) divergence has been used in many information theory, statistics and machine learning contexts, including the estimation of two-class Bayes classification error. Previous work has shown HP divergence can be directly estimated using the Friedman-Rafsky (FR) multivariate run test statistic. For the multi-class classification problem, HP divergence can also be used to bound the Bayes error by estimating the sum of pairwise Bayes errors between classes. In situations in which the dataset and number of classes are large, this approach is infeasible. In this paper, we present a new generalized measure that allows us to estimate the Bayes error rate without the need to compute pairwise estimates. We compare our new approach with the pairwise HP bound and the bound proposed by Lin [1], and show that our upper bound on Bayes error is tighter, while also having lower computational complexity.","PeriodicalId":299280,"journal":{"name":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Multi-class Bayes error estimation with a global minimal spanning tree\",\"authors\":\"S. Y. Sekeh, Brandon Oselio, A. Hero\",\"doi\":\"10.1109/ALLERTON.2018.8635642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Henze-Penrose (HP) divergence has been used in many information theory, statistics and machine learning contexts, including the estimation of two-class Bayes classification error. Previous work has shown HP divergence can be directly estimated using the Friedman-Rafsky (FR) multivariate run test statistic. For the multi-class classification problem, HP divergence can also be used to bound the Bayes error by estimating the sum of pairwise Bayes errors between classes. In situations in which the dataset and number of classes are large, this approach is infeasible. In this paper, we present a new generalized measure that allows us to estimate the Bayes error rate without the need to compute pairwise estimates. We compare our new approach with the pairwise HP bound and the bound proposed by Lin [1], and show that our upper bound on Bayes error is tighter, while also having lower computational complexity.\",\"PeriodicalId\":299280,\"journal\":{\"name\":\"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ALLERTON.2018.8635642\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ALLERTON.2018.8635642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-class Bayes error estimation with a global minimal spanning tree
Henze-Penrose (HP) divergence has been used in many information theory, statistics and machine learning contexts, including the estimation of two-class Bayes classification error. Previous work has shown HP divergence can be directly estimated using the Friedman-Rafsky (FR) multivariate run test statistic. For the multi-class classification problem, HP divergence can also be used to bound the Bayes error by estimating the sum of pairwise Bayes errors between classes. In situations in which the dataset and number of classes are large, this approach is infeasible. In this paper, we present a new generalized measure that allows us to estimate the Bayes error rate without the need to compute pairwise estimates. We compare our new approach with the pairwise HP bound and the bound proposed by Lin [1], and show that our upper bound on Bayes error is tighter, while also having lower computational complexity.