Data mining approach for automatic discovering success factors relationship statements in full text articles

2016 Eighth International Conference on Advanced Computational Intelligence (ICACI) Pub Date : 2016-02-01 DOI:10.1109/ICACI.2016.7449820

Worarat Krathu, P. Padungweang, Chakarida Nukoolkit

{"title":"Data mining approach for automatic discovering success factors relationship statements in full text articles","authors":"Worarat Krathu, P. Padungweang, Chakarida Nukoolkit","doi":"10.1109/ICACI.2016.7449820","DOIUrl":null,"url":null,"abstract":"In the context of Business-to-Business (B2B), an understanding of inter-organizational success factors and their impacts is crucial for effective strategic management. Several studies regarding those success factors and their influences have been conducted and published as articles. We aim at applying existing techniques, especially data mining, to automatically classify relevant sentences describing an influencing relationship between success factors. This paper presents the experiment method and results to find the optimal data mining workflow for our classification task. In particular, we apply several well-known data mining techniques based on different control factors. Then all discovered models are evaluated and compared to find the optimal data mining workflow. The main contributions include (i) the application of data mining for discovering success factors and their relationships, and (ii) the optimal workflow as a standardized flow for further similar classification tasks. The major challenge of this work is that there exists no mature corpus in this context, and hence our approach is implemented without a supporting corpus. The result shows that the models derived from the workflows that consider a section where a sentence is located perform better than the others in term of average performance. Furthermore, we found that the Support Vector Machine (SVM) performs better than other classifiers.","PeriodicalId":211040,"journal":{"name":"2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACI.2016.7449820","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

In the context of Business-to-Business (B2B), an understanding of inter-organizational success factors and their impacts is crucial for effective strategic management. Several studies regarding those success factors and their influences have been conducted and published as articles. We aim at applying existing techniques, especially data mining, to automatically classify relevant sentences describing an influencing relationship between success factors. This paper presents the experiment method and results to find the optimal data mining workflow for our classification task. In particular, we apply several well-known data mining techniques based on different control factors. Then all discovered models are evaluated and compared to find the optimal data mining workflow. The main contributions include (i) the application of data mining for discovering success factors and their relationships, and (ii) the optimal workflow as a standardized flow for further similar classification tasks. The major challenge of this work is that there exists no mature corpus in this context, and hence our approach is implemented without a supporting corpus. The result shows that the models derived from the workflows that consider a section where a sentence is located perform better than the others in term of average performance. Furthermore, we found that the Support Vector Machine (SVM) performs better than other classifiers.

查看原文本刊更多论文

全文文章中成功因素关系语句自动发现的数据挖掘方法

在企业对企业(B2B)的背景下，对组织间成功因素及其影响的理解对于有效的战略管理至关重要。关于这些成功因素及其影响的一些研究已经进行并作为文章发表。我们的目标是应用现有的技术，特别是数据挖掘，来自动分类描述成功因素之间影响关系的相关句子。本文给出了为我们的分类任务寻找最优数据挖掘工作流的实验方法和结果。特别是，我们应用了几种基于不同控制因素的知名数据挖掘技术。然后对所有发现的模型进行评估和比较，以找到最优的数据挖掘工作流。主要贡献包括(i)应用数据挖掘来发现成功因素及其关系，以及(ii)将最佳工作流程作为进一步类似分类任务的标准化流程。这项工作的主要挑战是在这种情况下没有成熟的语料库，因此我们的方法是在没有支持语料库的情况下实现的。结果表明，考虑句子所在部分的工作流模型在平均性能方面优于其他模型。此外，我们发现支持向量机(SVM)比其他分类器表现更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2016 Eighth International Conference on Advanced Computational Intelligence (ICACI)

自引率

0.00%

发文量