A novel modification approach for the one sample Kolmogorov-Smirnov test in large sample size.

IF 1.3 4区 医学 Q4 MEDICINE, RESEARCH & EXPERIMENTAL
Ugurcan Sayili, Mehmet Guven Gunver
{"title":"A novel modification approach for the one sample Kolmogorov-Smirnov test in large sample size.","authors":"Ugurcan Sayili, Mehmet Guven Gunver","doi":"10.1080/00365513.2025.2512384","DOIUrl":null,"url":null,"abstract":"<p><p>This study aims to propose and evaluate a modified version of the One-Sample Kolmogorov-Smirnov (K-S) test that addresses its current limitations in large sample groups, with the goal of improving its accuracy and reliability in assessing normality assumptions in medical research data. In addition to the classical K-S test, a logarithmic modification was applied to reduce the impact of sample size. This modification replaces the sample size in the test calculation with a logarithmic formula (ln n<sup>2</sup>) to prevent z-values from becoming excessively small in large samples. Statistical analyses were conducted using Microsoft 365/Excel, SPSS 21.0 and STATA/MP18 with a geometric approach employed to assess data normality using the Geometric Approach to Normality Testing. The study analyzed real-world laboratory data obtained from the complete blood count (CBC) results of 122,310 adult patients (aged ≥18 years) who were treated at Cerrahpaşa Medical Faculty Hospital throughout 2022. The modified K-S test with the proposed logarithmic modification (ln n<sup>2</sup>) reduced the tendency to reject normality solely due to large sample size. The modified test was able to confirm that some hematological parameters did indeed fit normal distribution models, while discriminating those that did not. In particular, analysis of the data set trimmed by 0.5% showed further improvement in test performance. Consequently, the proposed modification is shown to provide a more sensitive method for assessing the assumption of normal distribution in large data sets. The method can be easily integrated into existing statistical software, making it accessible for routine use in large-scale data analysis.</p>","PeriodicalId":21474,"journal":{"name":"Scandinavian Journal of Clinical & Laboratory Investigation","volume":" ","pages":"1-12"},"PeriodicalIF":1.3000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scandinavian Journal of Clinical & Laboratory Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/00365513.2025.2512384","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

This study aims to propose and evaluate a modified version of the One-Sample Kolmogorov-Smirnov (K-S) test that addresses its current limitations in large sample groups, with the goal of improving its accuracy and reliability in assessing normality assumptions in medical research data. In addition to the classical K-S test, a logarithmic modification was applied to reduce the impact of sample size. This modification replaces the sample size in the test calculation with a logarithmic formula (ln n2) to prevent z-values from becoming excessively small in large samples. Statistical analyses were conducted using Microsoft 365/Excel, SPSS 21.0 and STATA/MP18 with a geometric approach employed to assess data normality using the Geometric Approach to Normality Testing. The study analyzed real-world laboratory data obtained from the complete blood count (CBC) results of 122,310 adult patients (aged ≥18 years) who were treated at Cerrahpaşa Medical Faculty Hospital throughout 2022. The modified K-S test with the proposed logarithmic modification (ln n2) reduced the tendency to reject normality solely due to large sample size. The modified test was able to confirm that some hematological parameters did indeed fit normal distribution models, while discriminating those that did not. In particular, analysis of the data set trimmed by 0.5% showed further improvement in test performance. Consequently, the proposed modification is shown to provide a more sensitive method for assessing the assumption of normal distribution in large data sets. The method can be easily integrated into existing statistical software, making it accessible for routine use in large-scale data analysis.

大样本量单样本Kolmogorov-Smirnov检验的一种新的修正方法。
本研究旨在提出并评估单样本Kolmogorov-Smirnov (K-S)检验的改进版本,以解决其目前在大样本组中的局限性,目的是提高其在评估医学研究数据中正态性假设时的准确性和可靠性。除了经典的K-S检验外,还采用对数修正来减少样本量的影响。这种修改将测试计算中的样本量替换为对数公式(ln n2),以防止z值在大样本中变得过小。采用Microsoft 365/Excel、SPSS 21.0和STATA/MP18进行统计分析,采用几何方法评估数据正态性,采用几何方法进行正态性检验。该研究分析了2022年期间在cerrahpa医学院医院接受治疗的122,310名成年患者(年龄≥18岁)的全血细胞计数(CBC)结果获得的真实实验室数据。采用对数修正(ln n2)的改进K-S检验降低了仅仅由于样本量大而拒绝正态性的倾向。修改后的检验能够确认某些血液学参数确实符合正态分布模型,而区分那些不符合正态分布模型的参数。特别地,对数据集的分析显示,测试性能进一步提高了0.5%。结果表明,所提出的修正为评估大数据集的正态分布假设提供了一种更灵敏的方法。该方法可以很容易地集成到现有的统计软件中,使其可用于大规模数据分析的常规使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.50
自引率
4.80%
发文量
85
审稿时长
4-8 weeks
期刊介绍: The Scandinavian Journal of Clinical and Laboratory Investigation is an international scientific journal covering clinically oriented biochemical and physiological research. Since the launch of the journal in 1949, it has been a forum for international laboratory medicine, closely related to, and edited by, The Scandinavian Society for Clinical Chemistry. The journal contains peer-reviewed articles, editorials, invited reviews, and short technical notes, as well as several supplements each year. Supplements consist of monographs, and symposium and congress reports covering subjects within clinical chemistry and clinical physiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信