利用STATISTICA软件包构造分组

V. S. Fetisov
{"title":"利用STATISTICA软件包构造分组","authors":"V. S. Fetisov","doi":"10.31767/su.4(83)2018.04.14","DOIUrl":null,"url":null,"abstract":"STATISTICA software package for statistical analysis incorporates a wide range of advanced statistical methods. Quite often they are preceded by aggregating statistical survey data, which main component is their grouping. Although this phase of statistical data processing is relatively simple, the manual process of aggregation can be time-consuming given the need to process large data arrays, not mentioning a high probability of errors. Therefore, the all-purpose STATISTICA software package is a logical and reasonable tool for grouping of data.     \nThe article shows the grouping algorithm in STATISTICA software package, with focus on setup when constructing tables of frequencies of discrete and continual characters. Various options of grouping are scrutinized, with providing examples of their visualization.     \nA large number of STATISTICA parameters offers ample opportunities for constructing user tables, but users often are not aware of these options or do not know how they can be applied. Yet, the apparently simple grouping process in STATISTICA software package can sometimes require the knowledge of fine mechanisms for its setup. The article gives a detailed description of the mechanisms for creating interval margins when applying the parameter “approximate number of intervals”. \nThe standard algorithm for selection is analyzed, allowing a user to limit the number of groups in a grouping. STATISTICA allows for using a number of grouping parameters, enabling to produce more convenient results or filter them. Thus, setting the clicker for label field “Grouping” in the position “Integer Categories” (integer intervals (categories)) initiates the grouping only for integer values of a variable, by excluding the observations containing its fractional values. \nWhen only standard parameters are used, it will be impossible to form uneven or open intervals.  This issue is out of focus in specialized literature and Internet sources. The article shows the algorithm for constructing open intervals by user-set conditions and the process of creating these conditions. This option allows for forming both closed and open intervals by solving all the problems in time of grouping. Because creating such conditions is time consuming, they should be preserved if they are required for further use. \nSetting up of STATISTICA software with missing data is analyzed. Its application will be advisable when a grouping for two or more variables is constructed. In this case, a separate sheet with a grouping is to be created in the worksheet for each variable.      ","PeriodicalId":52812,"journal":{"name":"Statistika Ukrayini","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2018-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Constructing Groupings by Use of STATISTICA Software Package\",\"authors\":\"V. S. Fetisov\",\"doi\":\"10.31767/su.4(83)2018.04.14\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"STATISTICA software package for statistical analysis incorporates a wide range of advanced statistical methods. Quite often they are preceded by aggregating statistical survey data, which main component is their grouping. Although this phase of statistical data processing is relatively simple, the manual process of aggregation can be time-consuming given the need to process large data arrays, not mentioning a high probability of errors. Therefore, the all-purpose STATISTICA software package is a logical and reasonable tool for grouping of data.     \\nThe article shows the grouping algorithm in STATISTICA software package, with focus on setup when constructing tables of frequencies of discrete and continual characters. Various options of grouping are scrutinized, with providing examples of their visualization.     \\nA large number of STATISTICA parameters offers ample opportunities for constructing user tables, but users often are not aware of these options or do not know how they can be applied. Yet, the apparently simple grouping process in STATISTICA software package can sometimes require the knowledge of fine mechanisms for its setup. The article gives a detailed description of the mechanisms for creating interval margins when applying the parameter “approximate number of intervals”. \\nThe standard algorithm for selection is analyzed, allowing a user to limit the number of groups in a grouping. STATISTICA allows for using a number of grouping parameters, enabling to produce more convenient results or filter them. Thus, setting the clicker for label field “Grouping” in the position “Integer Categories” (integer intervals (categories)) initiates the grouping only for integer values of a variable, by excluding the observations containing its fractional values. \\nWhen only standard parameters are used, it will be impossible to form uneven or open intervals.  This issue is out of focus in specialized literature and Internet sources. The article shows the algorithm for constructing open intervals by user-set conditions and the process of creating these conditions. This option allows for forming both closed and open intervals by solving all the problems in time of grouping. Because creating such conditions is time consuming, they should be preserved if they are required for further use. \\nSetting up of STATISTICA software with missing data is analyzed. Its application will be advisable when a grouping for two or more variables is constructed. In this case, a separate sheet with a grouping is to be created in the worksheet for each variable.      \",\"PeriodicalId\":52812,\"journal\":{\"name\":\"Statistika Ukrayini\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistika Ukrayini\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31767/su.4(83)2018.04.14\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistika Ukrayini","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31767/su.4(83)2018.04.14","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

用于统计分析的STATISTICA软件包包含了广泛的先进统计方法。通常情况下,在他们之前汇总统计调查数据,其主要组成部分是他们的分组。虽然统计数据处理的这一阶段相对简单,但考虑到需要处理大型数据数组,更不用说错误的高概率,手动聚合过程可能会很耗时。因此,通用的STATISTICA软件包是一个逻辑合理的数据分组工具。本文介绍了STATISTICA软件包中的分组算法,重点介绍了在构造离散和连续字符频率表时的设置。详细介绍了分组的各种选项,并提供了可视化的示例。大量的STATISTICA参数为构造用户表提供了充分的机会,但是用户通常不知道这些选项,或者不知道如何应用它们。然而,STATISTICA软件包中看似简单的分组过程有时需要了解其设置的良好机制。本文详细描述了应用参数“近似区间数”时创建区间边际的机制。分析了选择的标准算法,允许用户限制分组中的组数。STATISTICA允许使用许多分组参数,从而能够生成更方便的结果或过滤它们。因此,将标签字段“Grouping”的点击器设置在“Integer Categories”(整数间隔(类别))位置,通过排除包含其小数值的观测值,只对变量的整数值进行分组。当只使用标准参数时,将不可能形成不均匀或开放的间隔。这个问题在专业文献和网络资源中没有得到关注。本文展示了通过用户设置条件构造开区间的算法以及创建这些条件的过程。此选项允许通过在分组时间内解决所有问题来形成封闭和开放区间。由于创建这样的条件非常耗时,因此如果需要进一步使用,则应保留这些条件。分析了缺失数据下STATISTICA软件的设置问题。当构造两个或多个变量的分组时,它的应用将是可取的。在这种情况下,将在工作表中为每个变量创建具有分组的单独工作表。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Constructing Groupings by Use of STATISTICA Software Package
STATISTICA software package for statistical analysis incorporates a wide range of advanced statistical methods. Quite often they are preceded by aggregating statistical survey data, which main component is their grouping. Although this phase of statistical data processing is relatively simple, the manual process of aggregation can be time-consuming given the need to process large data arrays, not mentioning a high probability of errors. Therefore, the all-purpose STATISTICA software package is a logical and reasonable tool for grouping of data.     The article shows the grouping algorithm in STATISTICA software package, with focus on setup when constructing tables of frequencies of discrete and continual characters. Various options of grouping are scrutinized, with providing examples of their visualization.     A large number of STATISTICA parameters offers ample opportunities for constructing user tables, but users often are not aware of these options or do not know how they can be applied. Yet, the apparently simple grouping process in STATISTICA software package can sometimes require the knowledge of fine mechanisms for its setup. The article gives a detailed description of the mechanisms for creating interval margins when applying the parameter “approximate number of intervals”. The standard algorithm for selection is analyzed, allowing a user to limit the number of groups in a grouping. STATISTICA allows for using a number of grouping parameters, enabling to produce more convenient results or filter them. Thus, setting the clicker for label field “Grouping” in the position “Integer Categories” (integer intervals (categories)) initiates the grouping only for integer values of a variable, by excluding the observations containing its fractional values. When only standard parameters are used, it will be impossible to form uneven or open intervals.  This issue is out of focus in specialized literature and Internet sources. The article shows the algorithm for constructing open intervals by user-set conditions and the process of creating these conditions. This option allows for forming both closed and open intervals by solving all the problems in time of grouping. Because creating such conditions is time consuming, they should be preserved if they are required for further use. Setting up of STATISTICA software with missing data is analyzed. Its application will be advisable when a grouping for two or more variables is constructed. In this case, a separate sheet with a grouping is to be created in the worksheet for each variable.      
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
10
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信