{"title":"相互通知相关系数(MICC)——一种新的基于滤波器的特征选择方法","authors":"Ritam Guha, K. Ghosh, Showmik Bhowmik, R. Sarkar","doi":"10.1109/CALCON49167.2020.9106516","DOIUrl":null,"url":null,"abstract":"Feature selection (FS) is a well-explored domain of data pre-processing and information theory. It is the process of selecting important features from a high-dimensional feature vectors possibly having many redundant and/or non-informative features. In this paper, we have proposed a score-based filter FS approach known as Mutually Informed Correlation Coefficient (MICC) by combining two popular statistical dependence measures namely Mutual Information (MI) and Pearson Correlation Coefficient (PCC). We have evaluated MICC on different variations of Local Binary Pattern (LBP) based feature vectors used for classifying the components of handwritten document images as text or non-text. We have compared the results with some popular filter methods namely Gini Index, T-test, ReliefF, along with MI and PCC individually. The results and corresponding comparisons show that our proposed method not only does FS efficiently but also enhances the recognition accuracy of the said classification problem. The code of the proposed algorithm can be found in this link: MICC.","PeriodicalId":318478,"journal":{"name":"2020 IEEE Calcutta Conference (CALCON)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Mutually Informed Correlation Coefficient (MICC) - a New Filter Based Feature Selection Method\",\"authors\":\"Ritam Guha, K. Ghosh, Showmik Bhowmik, R. Sarkar\",\"doi\":\"10.1109/CALCON49167.2020.9106516\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature selection (FS) is a well-explored domain of data pre-processing and information theory. It is the process of selecting important features from a high-dimensional feature vectors possibly having many redundant and/or non-informative features. In this paper, we have proposed a score-based filter FS approach known as Mutually Informed Correlation Coefficient (MICC) by combining two popular statistical dependence measures namely Mutual Information (MI) and Pearson Correlation Coefficient (PCC). We have evaluated MICC on different variations of Local Binary Pattern (LBP) based feature vectors used for classifying the components of handwritten document images as text or non-text. We have compared the results with some popular filter methods namely Gini Index, T-test, ReliefF, along with MI and PCC individually. The results and corresponding comparisons show that our proposed method not only does FS efficiently but also enhances the recognition accuracy of the said classification problem. The code of the proposed algorithm can be found in this link: MICC.\",\"PeriodicalId\":318478,\"journal\":{\"name\":\"2020 IEEE Calcutta Conference (CALCON)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE Calcutta Conference (CALCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CALCON49167.2020.9106516\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE Calcutta Conference (CALCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CALCON49167.2020.9106516","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mutually Informed Correlation Coefficient (MICC) - a New Filter Based Feature Selection Method
Feature selection (FS) is a well-explored domain of data pre-processing and information theory. It is the process of selecting important features from a high-dimensional feature vectors possibly having many redundant and/or non-informative features. In this paper, we have proposed a score-based filter FS approach known as Mutually Informed Correlation Coefficient (MICC) by combining two popular statistical dependence measures namely Mutual Information (MI) and Pearson Correlation Coefficient (PCC). We have evaluated MICC on different variations of Local Binary Pattern (LBP) based feature vectors used for classifying the components of handwritten document images as text or non-text. We have compared the results with some popular filter methods namely Gini Index, T-test, ReliefF, along with MI and PCC individually. The results and corresponding comparisons show that our proposed method not only does FS efficiently but also enhances the recognition accuracy of the said classification problem. The code of the proposed algorithm can be found in this link: MICC.