{"title":"A novel modification approach for the one sample Kolmogorov-Smirnov test in large sample size.","authors":"Ugurcan Sayili, Mehmet Guven Gunver","doi":"10.1080/00365513.2025.2512384","DOIUrl":null,"url":null,"abstract":"<p><p>This study aims to propose and evaluate a modified version of the One-Sample Kolmogorov-Smirnov (K-S) test that addresses its current limitations in large sample groups, with the goal of improving its accuracy and reliability in assessing normality assumptions in medical research data. In addition to the classical K-S test, a logarithmic modification was applied to reduce the impact of sample size. This modification replaces the sample size in the test calculation with a logarithmic formula (ln n<sup>2</sup>) to prevent z-values from becoming excessively small in large samples. Statistical analyses were conducted using Microsoft 365/Excel, SPSS 21.0 and STATA/MP18 with a geometric approach employed to assess data normality using the Geometric Approach to Normality Testing. The study analyzed real-world laboratory data obtained from the complete blood count (CBC) results of 122,310 adult patients (aged ≥18 years) who were treated at Cerrahpaşa Medical Faculty Hospital throughout 2022. The modified K-S test with the proposed logarithmic modification (ln n<sup>2</sup>) reduced the tendency to reject normality solely due to large sample size. The modified test was able to confirm that some hematological parameters did indeed fit normal distribution models, while discriminating those that did not. In particular, analysis of the data set trimmed by 0.5% showed further improvement in test performance. Consequently, the proposed modification is shown to provide a more sensitive method for assessing the assumption of normal distribution in large data sets. The method can be easily integrated into existing statistical software, making it accessible for routine use in large-scale data analysis.</p>","PeriodicalId":21474,"journal":{"name":"Scandinavian Journal of Clinical & Laboratory Investigation","volume":" ","pages":"1-12"},"PeriodicalIF":1.3000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scandinavian Journal of Clinical & Laboratory Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/00365513.2025.2512384","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
This study aims to propose and evaluate a modified version of the One-Sample Kolmogorov-Smirnov (K-S) test that addresses its current limitations in large sample groups, with the goal of improving its accuracy and reliability in assessing normality assumptions in medical research data. In addition to the classical K-S test, a logarithmic modification was applied to reduce the impact of sample size. This modification replaces the sample size in the test calculation with a logarithmic formula (ln n2) to prevent z-values from becoming excessively small in large samples. Statistical analyses were conducted using Microsoft 365/Excel, SPSS 21.0 and STATA/MP18 with a geometric approach employed to assess data normality using the Geometric Approach to Normality Testing. The study analyzed real-world laboratory data obtained from the complete blood count (CBC) results of 122,310 adult patients (aged ≥18 years) who were treated at Cerrahpaşa Medical Faculty Hospital throughout 2022. The modified K-S test with the proposed logarithmic modification (ln n2) reduced the tendency to reject normality solely due to large sample size. The modified test was able to confirm that some hematological parameters did indeed fit normal distribution models, while discriminating those that did not. In particular, analysis of the data set trimmed by 0.5% showed further improvement in test performance. Consequently, the proposed modification is shown to provide a more sensitive method for assessing the assumption of normal distribution in large data sets. The method can be easily integrated into existing statistical software, making it accessible for routine use in large-scale data analysis.
期刊介绍:
The Scandinavian Journal of Clinical and Laboratory Investigation is an international scientific journal covering clinically oriented biochemical and physiological research. Since the launch of the journal in 1949, it has been a forum for international laboratory medicine, closely related to, and edited by, The Scandinavian Society for Clinical Chemistry.
The journal contains peer-reviewed articles, editorials, invited reviews, and short technical notes, as well as several supplements each year. Supplements consist of monographs, and symposium and congress reports covering subjects within clinical chemistry and clinical physiology.