S. Jozová, M. Matowicki, O. Přibyl, M. Zachová, Sathaporn Opasanon, R. Ziółkowski
{"title":"On the analysis of discrete data finding dependencies in small sample sizes","authors":"S. Jozová, M. Matowicki, O. Přibyl, M. Zachová, Sathaporn Opasanon, R. Ziółkowski","doi":"10.14311/nnw.2021.31.017","DOIUrl":null,"url":null,"abstract":"An analysis of survey data is a fundamental part of research concerning various aspects of human behavior. Such survey data are often discrete, and the size of the collected sample is regularly insufficient for the most potent modelling tools such as logistic regression, clustering, and other data mining techniques. In this paper, we take a closer look at the results of the stated preference survey analyzing how inhabitants of cities in Thailand, Poland, and Czechia understand and perceive “smartness” of a city. An international survey was conducted, where respondents were asked 15 questions. Since the most common data modelling tools failed to provide a useful insight into the relationship between variables, so-called lambda coefficient was used and its usefulness for such challenging data was verified. It uses the principle of conditional probability and proves to be truly useful even in data sets with relatively small sample size.","PeriodicalId":49765,"journal":{"name":"Neural Network World","volume":"1 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Network World","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.14311/nnw.2021.31.017","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 2
Abstract
An analysis of survey data is a fundamental part of research concerning various aspects of human behavior. Such survey data are often discrete, and the size of the collected sample is regularly insufficient for the most potent modelling tools such as logistic regression, clustering, and other data mining techniques. In this paper, we take a closer look at the results of the stated preference survey analyzing how inhabitants of cities in Thailand, Poland, and Czechia understand and perceive “smartness” of a city. An international survey was conducted, where respondents were asked 15 questions. Since the most common data modelling tools failed to provide a useful insight into the relationship between variables, so-called lambda coefficient was used and its usefulness for such challenging data was verified. It uses the principle of conditional probability and proves to be truly useful even in data sets with relatively small sample size.
期刊介绍:
Neural Network World is a bimonthly journal providing the latest developments in the field of informatics with attention mainly devoted to the problems of:
brain science,
theory and applications of neural networks (both artificial and natural),
fuzzy-neural systems,
methods and applications of evolutionary algorithms,
methods of parallel and mass-parallel computing,
problems of soft-computing,
methods of artificial intelligence.