Statistical Issues in Small and Large Sample: Need of Optimum Upper Bound for the Sample Size

Subramanian Chandrasekharan, J. Sreedharan, A. Gopakumar
{"title":"Statistical Issues in Small and Large Sample: Need of Optimum Upper Bound for the Sample Size","authors":"Subramanian Chandrasekharan, J. Sreedharan, A. Gopakumar","doi":"10.12785/ijcts/060201","DOIUrl":null,"url":null,"abstract":"As fewer samples are meaningless and lead to fallacious conclusions, researchers are used to calculate minimum sample size before the conduct of any study. Although the larger samples can yield more accurate results, an extent for maximum sample size is not fixed. Though large samples are able to give précised and accurate estimates, the studies that collect more samples than the minimum required, may lead to fallacious conclusions. Generally, the test statistics are increasing functions of sample size and limit of the p value (as ‘n’ tents to infinity) results the statistical significance. The current paper investigated the pattern of changes in the estimates and testing results for varying sample sizes. The assessment of this type of patterns in the data and an extended study on this topic will help to find an interval for the sample size. Study concluded with a finding that larger sample does not make differences on the values of descriptive statistics, but has significant impact on the values of inferential statistics and therefore an upper bound for the sample size needs to be fixed. Hence this article gives relevant information about the need of finding adequate sample size interval (n1, n2) within which valid statistical conclusions can be derived, that assures significance of real difference.","PeriodicalId":130559,"journal":{"name":"International Journal of Computational & Theoretical Statistics","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational & Theoretical Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.12785/ijcts/060201","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

As fewer samples are meaningless and lead to fallacious conclusions, researchers are used to calculate minimum sample size before the conduct of any study. Although the larger samples can yield more accurate results, an extent for maximum sample size is not fixed. Though large samples are able to give précised and accurate estimates, the studies that collect more samples than the minimum required, may lead to fallacious conclusions. Generally, the test statistics are increasing functions of sample size and limit of the p value (as ‘n’ tents to infinity) results the statistical significance. The current paper investigated the pattern of changes in the estimates and testing results for varying sample sizes. The assessment of this type of patterns in the data and an extended study on this topic will help to find an interval for the sample size. Study concluded with a finding that larger sample does not make differences on the values of descriptive statistics, but has significant impact on the values of inferential statistics and therefore an upper bound for the sample size needs to be fixed. Hence this article gives relevant information about the need of finding adequate sample size interval (n1, n2) within which valid statistical conclusions can be derived, that assures significance of real difference.
小样本和大样本的统计问题:需要样本容量的最佳上界
由于较少的样本是没有意义的,会导致错误的结论,研究人员在进行任何研究之前都会计算最小样本量。虽然较大的样本可以产生更准确的结果,但最大样本量的范围并不是固定的。虽然大样本能够给出准确的估计,但收集的样本多于所需的最少样本的研究可能会得出错误的结论。一般来说,检验统计量是样本量的递增函数,p值的极限(当n趋近于无穷大时)产生统计显著性。本文研究了不同样本量的估计和测试结果的变化模式。对数据中这类模式的评估和对这一主题的扩展研究将有助于找到样本量的区间。研究得出结论,较大的样本对描述性统计的值没有影响,但对推断性统计的值有显著影响,因此需要确定样本量的上限。因此,本文给出了需要找到足够的样本量区间(n1, n2)的相关信息,在这个区间内可以得出有效的统计结论,以保证真实差异的显著性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信