根据修辞关系实现阿拉伯语文本的自动摘要

Int. J. Reason. based Intell. Syst. Pub Date : 2019-08-23 DOI:10.1504/IJRIS.2019.10023432

Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi

{"title":"根据修辞关系实现阿拉伯语文本的自动摘要","authors":"Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi","doi":"10.1504/IJRIS.2019.10023432","DOIUrl":null,"url":null,"abstract":"Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.","PeriodicalId":360794,"journal":{"name":"Int. J. Reason. based Intell. Syst.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Toward an automatic summarisation of Arabic text depending on rhetorical relations\",\"authors\":\"Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi\",\"doi\":\"10.1504/IJRIS.2019.10023432\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.\",\"PeriodicalId\":360794,\"journal\":{\"name\":\"Int. J. Reason. based Intell. Syst.\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Reason. based Intell. Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJRIS.2019.10023432\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Reason. based Intell. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJRIS.2019.10023432","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

两个文本段之间的修辞关系是重要的信息，并已被证明在许多自然语言处理应用中是有用的。本文提出了一种监督式的阿拉伯语语篇修辞关系自动识别方法。我们的模型试图识别基本话语单元之间隐含和明确的修辞关系，这些关系将在阿拉伯语文本的自动摘要中被利用。为了开展这一研究，我们根据修辞结构理论框架开发了一个高可靠性的语篇标注语料库。关系注释使用一组23个细粒度的关系来完成，这些关系丰富了核注释。为了自动学习这些关系，我们重用了一些最先进的特征，并贡献了新的词汇和语义特征。细粒度和粗粒度关系的实验结果表明，我们的模型相对于所有基线都取得了最好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Toward an automatic summarisation of Arabic text depending on rhetorical relations

Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Int. J. Reason. based Intell. Syst.

自引率

0.00%

发文量