{"title":"职业不平衡对LVQ分类的影响","authors":"Rahmad Abdillah, Suwanto Sanjaya, Iis Afrianty","doi":"10.1109/ICon-EEI.2018.8784330","DOIUrl":null,"url":null,"abstract":"Accuracy is a measure of the capability of an algorithm, and studies use different classification methods to improve this benchmark. However, improper data collection adversely affects accuracy. In this study, we discuss how to influence the accuracy of data collection mechanisms. The learning vector quantization (LVQ) algorithm is tested to determine the effect of data sampling on accuracy. Training and test data are gathered in the data collection process. Results show that sampling techniques and retrieval of training and test data influence the accuracy of the LVQ classification method. Therefore, the chosen sampling technique can improve accuracy relative to overall data usage.","PeriodicalId":114952,"journal":{"name":"2018 2nd International Conference on Electrical Engineering and Informatics (ICon EEI)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"The Effect of Class Imbalance Against LVQ Classification\",\"authors\":\"Rahmad Abdillah, Suwanto Sanjaya, Iis Afrianty\",\"doi\":\"10.1109/ICon-EEI.2018.8784330\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accuracy is a measure of the capability of an algorithm, and studies use different classification methods to improve this benchmark. However, improper data collection adversely affects accuracy. In this study, we discuss how to influence the accuracy of data collection mechanisms. The learning vector quantization (LVQ) algorithm is tested to determine the effect of data sampling on accuracy. Training and test data are gathered in the data collection process. Results show that sampling techniques and retrieval of training and test data influence the accuracy of the LVQ classification method. Therefore, the chosen sampling technique can improve accuracy relative to overall data usage.\",\"PeriodicalId\":114952,\"journal\":{\"name\":\"2018 2nd International Conference on Electrical Engineering and Informatics (ICon EEI)\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 2nd International Conference on Electrical Engineering and Informatics (ICon EEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICon-EEI.2018.8784330\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 2nd International Conference on Electrical Engineering and Informatics (ICon EEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICon-EEI.2018.8784330","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Effect of Class Imbalance Against LVQ Classification
Accuracy is a measure of the capability of an algorithm, and studies use different classification methods to improve this benchmark. However, improper data collection adversely affects accuracy. In this study, we discuss how to influence the accuracy of data collection mechanisms. The learning vector quantization (LVQ) algorithm is tested to determine the effect of data sampling on accuracy. Training and test data are gathered in the data collection process. Results show that sampling techniques and retrieval of training and test data influence the accuracy of the LVQ classification method. Therefore, the chosen sampling technique can improve accuracy relative to overall data usage.