Comparison of Methods for Testing the Hypothesis of Independence of Random Variables Based on a Nonparametric Classifier and Pearson’s Chi-Squared Test
{"title":"Comparison of Methods for Testing the Hypothesis of Independence of Random Variables Based on a Nonparametric Classifier and Pearson’s Chi-Squared Test","authors":"","doi":"10.3103/s8756699023050047","DOIUrl":null,"url":null,"abstract":"<span> <h3>Abstract</h3> <p>A technique for testing the hypothesis about the independence of random variables, based on a nonparametric pattern recognition algorithm, is used in the analysis of ambiguous dependencies. The pattern recognition algorithm meets the maximum likelihood criterion. The assessment of distribution laws in classes is carried out using initial statistical data under the assumption of independence and dependence of the random variables being compared. To estimate probability densities in classes, nonparametric Rosenblatt–Parzen statistics are used. The blurring coefficients of kernel functions in nonparametric estimates of probability densities in classes are determined from the condition of the minimum of their standard deviations. Under these conditions, estimates of the probabilities of pattern recognition errors in classes are calculated. Based on their minimum value, a decision is made on the independence or dependence of random variables. The hypothesis about a significant difference in the probabilities of pattern recognition errors in classes is tested. The use of the proposed technique allows us to bypass the problem of decomposing the range of values of random variables into intervals, which is characteristic of the Pearson criterion. The effectiveness of the proposed method is compared with the Pearson criterion. The results of computational experiments using the studied criteria in the analysis of ambiguous dependencies between random variables are presented.</p> </span>","PeriodicalId":44919,"journal":{"name":"Optoelectronics Instrumentation and Data Processing","volume":"31 1","pages":""},"PeriodicalIF":0.5000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optoelectronics Instrumentation and Data Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3103/s8756699023050047","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
A technique for testing the hypothesis about the independence of random variables, based on a nonparametric pattern recognition algorithm, is used in the analysis of ambiguous dependencies. The pattern recognition algorithm meets the maximum likelihood criterion. The assessment of distribution laws in classes is carried out using initial statistical data under the assumption of independence and dependence of the random variables being compared. To estimate probability densities in classes, nonparametric Rosenblatt–Parzen statistics are used. The blurring coefficients of kernel functions in nonparametric estimates of probability densities in classes are determined from the condition of the minimum of their standard deviations. Under these conditions, estimates of the probabilities of pattern recognition errors in classes are calculated. Based on their minimum value, a decision is made on the independence or dependence of random variables. The hypothesis about a significant difference in the probabilities of pattern recognition errors in classes is tested. The use of the proposed technique allows us to bypass the problem of decomposing the range of values of random variables into intervals, which is characteristic of the Pearson criterion. The effectiveness of the proposed method is compared with the Pearson criterion. The results of computational experiments using the studied criteria in the analysis of ambiguous dependencies between random variables are presented.
期刊介绍:
The scope of Optoelectronics, Instrumentation and Data Processing encompasses, but is not restricted to, the following areas: analysis and synthesis of signals and images; artificial intelligence methods; automated measurement systems; physicotechnical foundations of micro- and optoelectronics; optical information technologies; systems and components; modelling in physicotechnical research; laser physics applications; computer networks and data transmission systems. The journal publishes original papers, reviews, and short communications in order to provide the widest possible coverage of latest research and development in its chosen field.