统计技术vs. SEES算法:小型企业环境的应用

J. Andrés
{"title":"统计技术vs. SEES算法:小型企业环境的应用","authors":"J. Andrés","doi":"10.4192/1577-8517-V1_8","DOIUrl":null,"url":null,"abstract":"The aim of this research is to compare the accuracy of a rule induction classifier system - Quinlan's - with linear discriminant analysis and logit. The classification task chosen is the differentiation of the most efficient companies from the least efficient ones on the basis of a set of financial variables. The sample consists of a database containing the annual accounts of the companies located in the Principality of Asturias (Spain), which are mainly small businesses. The main results indicate that SEE5 outperforms logit, but it is not clearly better than discriminant analysis. However, SEE5 models suffer from bigger increases in error rates when tested with validation samples. Another interesting finding is that in SEE5 systems both the number of variables selected and the number of rules inferred grow when sample size increases.","PeriodicalId":404481,"journal":{"name":"The International Journal of Digital Accounting Research","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Statistical techniques vs. SEES algorithm : an application to a small business environment\",\"authors\":\"J. Andrés\",\"doi\":\"10.4192/1577-8517-V1_8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of this research is to compare the accuracy of a rule induction classifier system - Quinlan's - with linear discriminant analysis and logit. The classification task chosen is the differentiation of the most efficient companies from the least efficient ones on the basis of a set of financial variables. The sample consists of a database containing the annual accounts of the companies located in the Principality of Asturias (Spain), which are mainly small businesses. The main results indicate that SEE5 outperforms logit, but it is not clearly better than discriminant analysis. However, SEE5 models suffer from bigger increases in error rates when tested with validation samples. Another interesting finding is that in SEE5 systems both the number of variables selected and the number of rules inferred grow when sample size increases.\",\"PeriodicalId\":404481,\"journal\":{\"name\":\"The International Journal of Digital Accounting Research\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The International Journal of Digital Accounting Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.4192/1577-8517-V1_8\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The International Journal of Digital Accounting Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4192/1577-8517-V1_8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17

摘要

本研究的目的是比较昆兰规则归纳分类器系统与线性判别分析和logit的准确率。所选择的分类任务是在一组财务变量的基础上区分效率最高的公司和效率最低的公司。样本包括一个数据库,其中包含位于阿斯图里亚斯公国(西班牙)的公司的年度账目,这些公司主要是小企业。主要结果表明,SEE5优于logit,但并不明显优于判别分析。然而,当使用验证样本进行测试时,SEE5模型的错误率增加得更大。另一个有趣的发现是,在SEE5系统中,当样本量增加时,所选择的变量数量和推断的规则数量都会增加。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Statistical techniques vs. SEES algorithm : an application to a small business environment
The aim of this research is to compare the accuracy of a rule induction classifier system - Quinlan's - with linear discriminant analysis and logit. The classification task chosen is the differentiation of the most efficient companies from the least efficient ones on the basis of a set of financial variables. The sample consists of a database containing the annual accounts of the companies located in the Principality of Asturias (Spain), which are mainly small businesses. The main results indicate that SEE5 outperforms logit, but it is not clearly better than discriminant analysis. However, SEE5 models suffer from bigger increases in error rates when tested with validation samples. Another interesting finding is that in SEE5 systems both the number of variables selected and the number of rules inferred grow when sample size increases.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信