Discovering the Causal Network of Terms from the Text Corpus

Yue Wang
{"title":"Discovering the Causal Network of Terms from the Text Corpus","authors":"Yue Wang","doi":"10.1109/ICEBE.2015.11","DOIUrl":null,"url":null,"abstract":"As the \"bag of words\" approaches cut down the linkage between the words, they are hardly to be applied to explore the causal relations between the terms described from the text corpuses. To discover the networked causal knowledge from the corpus, we (1) propose the algorithm pv-swapping and the PV-parse-tree to adjust the term orders for the sentences by observing the relationship between the grammatical voice and causal relation, provide the NLP approaches to transform the corpus to the sets of ordered terms sequences (OTS) by preserving the semantic orders of the phrases in the sentences, (2) formalize the causal network extracting problem in the corpus and show that it is a NP-hard problem, (3) propose the algorithms NE-IC and heuristic-majority-vote to extract the causal network of the terms based on the OTS sequences. We provide sufficient experiments on the real data sets. The experimental results show that our methods are both effective and efficient to discover the causal network of the terms, and the resulted causal networks of heuristic-majority-vote with less conflict causal relations or cycles than the results of NE-IC. At the last, we also provide experiments on several causal knowledge discovering tasks based on the resulted causal networks to show their interesting applications.","PeriodicalId":153535,"journal":{"name":"2015 IEEE 12th International Conference on e-Business Engineering","volume":"165 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 12th International Conference on e-Business Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2015.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

As the "bag of words" approaches cut down the linkage between the words, they are hardly to be applied to explore the causal relations between the terms described from the text corpuses. To discover the networked causal knowledge from the corpus, we (1) propose the algorithm pv-swapping and the PV-parse-tree to adjust the term orders for the sentences by observing the relationship between the grammatical voice and causal relation, provide the NLP approaches to transform the corpus to the sets of ordered terms sequences (OTS) by preserving the semantic orders of the phrases in the sentences, (2) formalize the causal network extracting problem in the corpus and show that it is a NP-hard problem, (3) propose the algorithms NE-IC and heuristic-majority-vote to extract the causal network of the terms based on the OTS sequences. We provide sufficient experiments on the real data sets. The experimental results show that our methods are both effective and efficient to discover the causal network of the terms, and the resulted causal networks of heuristic-majority-vote with less conflict causal relations or cycles than the results of NE-IC. At the last, we also provide experiments on several causal knowledge discovering tasks based on the resulted causal networks to show their interesting applications.
从文本语料库中发现词的因果网络
由于“词包法”切断了词与词之间的联系,因此很难应用“词包法”来探讨语篇语料库中词与词之间的因果关系。为了从语料库中发现网络化的因果知识,我们(1)提出了pv-swap算法和PV-parse-tree算法,通过观察语法语态和因果关系之间的关系来调整句子的术语顺序,提供了通过保持句子中短语的语义顺序将语料库转换为有序术语序列集(OTS)的NLP方法;(2)形式化了语料库中的因果网络提取问题,并表明其是一个NP-hard问题;(3)提出了基于OTS序列的术语因果网络提取算法NE-IC和启发式多数投票算法。我们在真实的数据集上提供了足够的实验。实验结果表明,我们的方法能够有效地发现术语的因果网络,并且得到的启发式多数投票因果网络比NE-IC的结果具有更少的冲突因果关系或循环。最后,我们还提供了基于结果因果网络的几个因果知识发现任务的实验,以展示它们的有趣应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信