{"title":"Small Sample Estimation of Classification Metrics","authors":"S. Manna","doi":"10.1109/irtm54583.2022.9791645","DOIUrl":null,"url":null,"abstract":"Estimation of classification evaluation metrics for small sample classification data is one of the most challenging task in Machine Learning model building process. The value of an evaluation metric based on the test data is considered to be the generalized performance measure of a classification model. Measuring the performance of a model based only on the test data works well when we have a large data set and both train and test data follow the same population distribution. However, this assumption does not always work for small data set. Therefore, measuring performance of a model based on test data set may mislead us for small sample classification problems. To deal with such situations, in this paper, we propose a modified method to estimate classification evaluation metrics. The new method provides an improved and consistent estimation of classification evaluation metrics.","PeriodicalId":426354,"journal":{"name":"2022 Interdisciplinary Research in Technology and Management (IRTM)","volume":"56 5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Interdisciplinary Research in Technology and Management (IRTM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/irtm54583.2022.9791645","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Estimation of classification evaluation metrics for small sample classification data is one of the most challenging task in Machine Learning model building process. The value of an evaluation metric based on the test data is considered to be the generalized performance measure of a classification model. Measuring the performance of a model based only on the test data works well when we have a large data set and both train and test data follow the same population distribution. However, this assumption does not always work for small data set. Therefore, measuring performance of a model based on test data set may mislead us for small sample classification problems. To deal with such situations, in this paper, we propose a modified method to estimate classification evaluation metrics. The new method provides an improved and consistent estimation of classification evaluation metrics.