从文本中自动提取因果链

Aliaksandr Huminski, Yan Bin Ng
{"title":"从文本中自动提取因果链","authors":"Aliaksandr Huminski, Yan Bin Ng","doi":"10.32655/libres.2020.29.2.3","DOIUrl":null,"url":null,"abstract":"Background. Automatic extraction of causal chains is valuable for discovering previously unknown and hidden connections between events. However, there is only a handful of works devoted to automatic extraction of causal chains from text. Objective. To develop a method for automatic extraction of causal chains from text. Method. A new approach based on linguistic templates is suggested for causal chain extraction. It is domain-independent, not restricted to extraction from single sentences and unfolded on big data. For implementation, a sequence of four modules was deployed. These are verb restriction, part-of-speech tagging, extracting causal relations, and unification and matching events. Results. 14,821 causal chains (with length=2) have been extracted from 100,000 English Wikipedia articles. Contributions. The extracted causal chains can contribute to developing commonsense knowledge bases, reasoning resources, problem-solving, and generally in discovering previously unknown relationships between entities/events.","PeriodicalId":129706,"journal":{"name":"Library and Information Science Research E-Journal","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Automatic extraction of causal chains from text\",\"authors\":\"Aliaksandr Huminski, Yan Bin Ng\",\"doi\":\"10.32655/libres.2020.29.2.3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background. Automatic extraction of causal chains is valuable for discovering previously unknown and hidden connections between events. However, there is only a handful of works devoted to automatic extraction of causal chains from text. Objective. To develop a method for automatic extraction of causal chains from text. Method. A new approach based on linguistic templates is suggested for causal chain extraction. It is domain-independent, not restricted to extraction from single sentences and unfolded on big data. For implementation, a sequence of four modules was deployed. These are verb restriction, part-of-speech tagging, extracting causal relations, and unification and matching events. Results. 14,821 causal chains (with length=2) have been extracted from 100,000 English Wikipedia articles. Contributions. The extracted causal chains can contribute to developing commonsense knowledge bases, reasoning resources, problem-solving, and generally in discovering previously unknown relationships between entities/events.\",\"PeriodicalId\":129706,\"journal\":{\"name\":\"Library and Information Science Research E-Journal\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Library and Information Science Research E-Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.32655/libres.2020.29.2.3\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Library and Information Science Research E-Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32655/libres.2020.29.2.3","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

背景。因果链的自动提取对于发现事件之间先前未知和隐藏的联系是有价值的。然而,只有少数的作品致力于从文本中自动提取因果链。目标。开发一种自动抽取文本因果链的方法。方法。提出了一种基于语言模板的因果链抽取方法。它是独立于领域的,不局限于从单句中提取,而是在大数据上展开。为了实现,部署了四个模块的序列。这些包括动词限制、词性标注、提取因果关系以及统一和匹配事件。结果:从100,000篇英文维基百科文章中提取了14,821条因果链(长度=2)。的贡献。抽取出的因果链有助于开发常识性知识库、推理资源、解决问题,通常有助于发现实体/事件之间以前未知的关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Automatic extraction of causal chains from text
Background. Automatic extraction of causal chains is valuable for discovering previously unknown and hidden connections between events. However, there is only a handful of works devoted to automatic extraction of causal chains from text. Objective. To develop a method for automatic extraction of causal chains from text. Method. A new approach based on linguistic templates is suggested for causal chain extraction. It is domain-independent, not restricted to extraction from single sentences and unfolded on big data. For implementation, a sequence of four modules was deployed. These are verb restriction, part-of-speech tagging, extracting causal relations, and unification and matching events. Results. 14,821 causal chains (with length=2) have been extracted from 100,000 English Wikipedia articles. Contributions. The extracted causal chains can contribute to developing commonsense knowledge bases, reasoning resources, problem-solving, and generally in discovering previously unknown relationships between entities/events.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信