V. Kuzmin, M. Zaliskyi, R. Odarchenko, Oksana Polishchuk, O. Ivanets, O. Shcherbyna
{"title":"Method of Probability Distribution Fitting for Statistical Data with Small Sample Size","authors":"V. Kuzmin, M. Zaliskyi, R. Odarchenko, Oksana Polishchuk, O. Ivanets, O. Shcherbyna","doi":"10.1109/ACIT49673.2020.9208842","DOIUrl":null,"url":null,"abstract":"The paper deals with a new approach for probability distribution fitting for empirical data with small sample size. The proposed method includes three steps: 1) outliers detection and correction; 2) transformation basis calculation; 3) basis function optimization. For the possibility of asymmetric distributions approximation, a piecewise linear basis function is used. During basis function optimization, the dependence of squared deviations sum on switching point abscissa is calculated. The mathematical formula for this dependence can be obtained by quadratic approximation according to the least squares method. The optimum of switching point abscissa coincides with minimum of obtained parabola. Method of probability distribution fitting for statistical data with small sample size is illustrated on the real empirical data example. For this example the best probability distribution fitting corresponds to the case of optimized piecewise linear basis function.","PeriodicalId":372744,"journal":{"name":"2020 10th International Conference on Advanced Computer Information Technologies (ACIT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 10th International Conference on Advanced Computer Information Technologies (ACIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACIT49673.2020.9208842","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
The paper deals with a new approach for probability distribution fitting for empirical data with small sample size. The proposed method includes three steps: 1) outliers detection and correction; 2) transformation basis calculation; 3) basis function optimization. For the possibility of asymmetric distributions approximation, a piecewise linear basis function is used. During basis function optimization, the dependence of squared deviations sum on switching point abscissa is calculated. The mathematical formula for this dependence can be obtained by quadratic approximation according to the least squares method. The optimum of switching point abscissa coincides with minimum of obtained parabola. Method of probability distribution fitting for statistical data with small sample size is illustrated on the real empirical data example. For this example the best probability distribution fitting corresponds to the case of optimized piecewise linear basis function.