Data mining approach for automatic discovering success factors relationship statements in full text articles

Worarat Krathu, P. Padungweang, Chakarida Nukoolkit
{"title":"Data mining approach for automatic discovering success factors relationship statements in full text articles","authors":"Worarat Krathu, P. Padungweang, Chakarida Nukoolkit","doi":"10.1109/ICACI.2016.7449820","DOIUrl":null,"url":null,"abstract":"In the context of Business-to-Business (B2B), an understanding of inter-organizational success factors and their impacts is crucial for effective strategic management. Several studies regarding those success factors and their influences have been conducted and published as articles. We aim at applying existing techniques, especially data mining, to automatically classify relevant sentences describing an influencing relationship between success factors. This paper presents the experiment method and results to find the optimal data mining workflow for our classification task. In particular, we apply several well-known data mining techniques based on different control factors. Then all discovered models are evaluated and compared to find the optimal data mining workflow. The main contributions include (i) the application of data mining for discovering success factors and their relationships, and (ii) the optimal workflow as a standardized flow for further similar classification tasks. The major challenge of this work is that there exists no mature corpus in this context, and hence our approach is implemented without a supporting corpus. The result shows that the models derived from the workflows that consider a section where a sentence is located perform better than the others in term of average performance. Furthermore, we found that the Support Vector Machine (SVM) performs better than other classifiers.","PeriodicalId":211040,"journal":{"name":"2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACI.2016.7449820","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

In the context of Business-to-Business (B2B), an understanding of inter-organizational success factors and their impacts is crucial for effective strategic management. Several studies regarding those success factors and their influences have been conducted and published as articles. We aim at applying existing techniques, especially data mining, to automatically classify relevant sentences describing an influencing relationship between success factors. This paper presents the experiment method and results to find the optimal data mining workflow for our classification task. In particular, we apply several well-known data mining techniques based on different control factors. Then all discovered models are evaluated and compared to find the optimal data mining workflow. The main contributions include (i) the application of data mining for discovering success factors and their relationships, and (ii) the optimal workflow as a standardized flow for further similar classification tasks. The major challenge of this work is that there exists no mature corpus in this context, and hence our approach is implemented without a supporting corpus. The result shows that the models derived from the workflows that consider a section where a sentence is located perform better than the others in term of average performance. Furthermore, we found that the Support Vector Machine (SVM) performs better than other classifiers.
全文文章中成功因素关系语句自动发现的数据挖掘方法
在企业对企业(B2B)的背景下,对组织间成功因素及其影响的理解对于有效的战略管理至关重要。关于这些成功因素及其影响的一些研究已经进行并作为文章发表。我们的目标是应用现有的技术,特别是数据挖掘,来自动分类描述成功因素之间影响关系的相关句子。本文给出了为我们的分类任务寻找最优数据挖掘工作流的实验方法和结果。特别是,我们应用了几种基于不同控制因素的知名数据挖掘技术。然后对所有发现的模型进行评估和比较,以找到最优的数据挖掘工作流。主要贡献包括(i)应用数据挖掘来发现成功因素及其关系,以及(ii)将最佳工作流程作为进一步类似分类任务的标准化流程。这项工作的主要挑战是在这种情况下没有成熟的语料库,因此我们的方法是在没有支持语料库的情况下实现的。结果表明,考虑句子所在部分的工作流模型在平均性能方面优于其他模型。此外,我们发现支持向量机(SVM)比其他分类器表现更好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信