{"title":"一种新的Parzen窗口方法中代表性数据样本选择机制","authors":"Jianhong Ni, J. Wang, Xiaolin Li","doi":"10.1109/ICSESS.2012.6269434","DOIUrl":null,"url":null,"abstract":"Based on the experimental observations and theoretical analysis, we validate that the significant increase of data samples may not bring about the obvious improvement of estimation performance of Parzen windows method. Thus, in this paper, we discuss a new mechanism of selecting representative data samples for Parzen windows method. An importance degree function is defined to evaluate the importance of data sample. Then, a decision threshold is optimized based on particle swarm optimization (PSO) algorithm. The data samples whose importance degrees are larger than the optimized decision threshold will be selected as the representations to estimate the underlying probability density function (PDF). Finally, the experimental results on the designed datasets obeying Uniform, Normal, Exponential, and Rayleigh distributions show that the estimation of PDF by using the representative data samples can obtain the same estimation errors (the two-tailed t-test with 95% confidence level) compared with the estimation on whole dataset. Meanwhile, the computational complexity of using representative data samples to estimate PDF is decreased evidently.","PeriodicalId":205738,"journal":{"name":"2012 IEEE International Conference on Computer Science and Automation Engineering","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A new mechanism of selecting representative data samples for Parzen windows method\",\"authors\":\"Jianhong Ni, J. Wang, Xiaolin Li\",\"doi\":\"10.1109/ICSESS.2012.6269434\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Based on the experimental observations and theoretical analysis, we validate that the significant increase of data samples may not bring about the obvious improvement of estimation performance of Parzen windows method. Thus, in this paper, we discuss a new mechanism of selecting representative data samples for Parzen windows method. An importance degree function is defined to evaluate the importance of data sample. Then, a decision threshold is optimized based on particle swarm optimization (PSO) algorithm. The data samples whose importance degrees are larger than the optimized decision threshold will be selected as the representations to estimate the underlying probability density function (PDF). Finally, the experimental results on the designed datasets obeying Uniform, Normal, Exponential, and Rayleigh distributions show that the estimation of PDF by using the representative data samples can obtain the same estimation errors (the two-tailed t-test with 95% confidence level) compared with the estimation on whole dataset. Meanwhile, the computational complexity of using representative data samples to estimate PDF is decreased evidently.\",\"PeriodicalId\":205738,\"journal\":{\"name\":\"2012 IEEE International Conference on Computer Science and Automation Engineering\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Conference on Computer Science and Automation Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSESS.2012.6269434\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Computer Science and Automation Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS.2012.6269434","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A new mechanism of selecting representative data samples for Parzen windows method
Based on the experimental observations and theoretical analysis, we validate that the significant increase of data samples may not bring about the obvious improvement of estimation performance of Parzen windows method. Thus, in this paper, we discuss a new mechanism of selecting representative data samples for Parzen windows method. An importance degree function is defined to evaluate the importance of data sample. Then, a decision threshold is optimized based on particle swarm optimization (PSO) algorithm. The data samples whose importance degrees are larger than the optimized decision threshold will be selected as the representations to estimate the underlying probability density function (PDF). Finally, the experimental results on the designed datasets obeying Uniform, Normal, Exponential, and Rayleigh distributions show that the estimation of PDF by using the representative data samples can obtain the same estimation errors (the two-tailed t-test with 95% confidence level) compared with the estimation on whole dataset. Meanwhile, the computational complexity of using representative data samples to estimate PDF is decreased evidently.