Mohammed Bappah Mohammed , Ishaq Abdullahi Baba , Hauwa Danjuma Salihu , Isah Abubakar Ibrahim
{"title":"New class width rule for continuous frequency tables","authors":"Mohammed Bappah Mohammed , Ishaq Abdullahi Baba , Hauwa Danjuma Salihu , Isah Abubakar Ibrahim","doi":"10.1016/j.rico.2024.100506","DOIUrl":null,"url":null,"abstract":"<div><div>The most significant parameter which must be determined before constructing a frequency table or a histogram is the number of classes or class width. Choosing the appropriate number of classes or class remains a long-lasting problem in statistics. Apart from the rules of thumb several more sophisticated rules were reported in the literature. However, none of them has been proven to be better in all situations. In this research, we proposed a new class width rule which can be used when building a frequency table or a histogram. The new class width rule is compared with nine existing classification rules, Sturges, Scott, Freedman and Diaconis, Doane, Terrel and Scott, Cencov, Cochran, Square root, and Rice rules, using the root mean-squared-error (RMSE). The accuracy of the classification rules is assessed using simulations from normal, uniform, exponential, log-normal, and gamma distributions, and also real data. The findings indicated that the proposed rule outperformed the other binning rules for simulations using normal, exponential, log-normal, and gamma distributions. Meanwhile, the square root rule performed better relative to the other classification rules for simulations from the uniform distribution. Comparison using real data showed that the proposed rule performed better than the other classification rules.</div></div>","PeriodicalId":34733,"journal":{"name":"Results in Control and Optimization","volume":"18 ","pages":"Article 100506"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Results in Control and Optimization","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666720724001358","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 0
Abstract
The most significant parameter which must be determined before constructing a frequency table or a histogram is the number of classes or class width. Choosing the appropriate number of classes or class remains a long-lasting problem in statistics. Apart from the rules of thumb several more sophisticated rules were reported in the literature. However, none of them has been proven to be better in all situations. In this research, we proposed a new class width rule which can be used when building a frequency table or a histogram. The new class width rule is compared with nine existing classification rules, Sturges, Scott, Freedman and Diaconis, Doane, Terrel and Scott, Cencov, Cochran, Square root, and Rice rules, using the root mean-squared-error (RMSE). The accuracy of the classification rules is assessed using simulations from normal, uniform, exponential, log-normal, and gamma distributions, and also real data. The findings indicated that the proposed rule outperformed the other binning rules for simulations using normal, exponential, log-normal, and gamma distributions. Meanwhile, the square root rule performed better relative to the other classification rules for simulations from the uniform distribution. Comparison using real data showed that the proposed rule performed better than the other classification rules.