根据修辞关系实现阿拉伯语文本的自动摘要

Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi
{"title":"根据修辞关系实现阿拉伯语文本的自动摘要","authors":"Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi","doi":"10.1504/IJRIS.2019.10023432","DOIUrl":null,"url":null,"abstract":"Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.","PeriodicalId":360794,"journal":{"name":"Int. J. Reason. based Intell. Syst.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Toward an automatic summarisation of Arabic text depending on rhetorical relations\",\"authors\":\"Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi\",\"doi\":\"10.1504/IJRIS.2019.10023432\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.\",\"PeriodicalId\":360794,\"journal\":{\"name\":\"Int. J. Reason. based Intell. Syst.\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Reason. based Intell. Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJRIS.2019.10023432\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Reason. based Intell. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJRIS.2019.10023432","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

两个文本段之间的修辞关系是重要的信息,并已被证明在许多自然语言处理应用中是有用的。本文提出了一种监督式的阿拉伯语语篇修辞关系自动识别方法。我们的模型试图识别基本话语单元之间隐含和明确的修辞关系,这些关系将在阿拉伯语文本的自动摘要中被利用。为了开展这一研究,我们根据修辞结构理论框架开发了一个高可靠性的语篇标注语料库。关系注释使用一组23个细粒度的关系来完成,这些关系丰富了核注释。为了自动学习这些关系,我们重用了一些最先进的特征,并贡献了新的词汇和语义特征。细粒度和粗粒度关系的实验结果表明,我们的模型相对于所有基线都取得了最好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Toward an automatic summarisation of Arabic text depending on rhetorical relations
Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信