Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi
{"title":"根据修辞关系实现阿拉伯语文本的自动摘要","authors":"Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi","doi":"10.1504/IJRIS.2019.10023432","DOIUrl":null,"url":null,"abstract":"Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.","PeriodicalId":360794,"journal":{"name":"Int. J. Reason. based Intell. Syst.","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Toward an automatic summarisation of Arabic text depending on rhetorical relations\",\"authors\":\"Samira Lagrini, Nabiha Azizi, M. Redjimi, M. Aldwairi\",\"doi\":\"10.1504/IJRIS.2019.10023432\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.\",\"PeriodicalId\":360794,\"journal\":{\"name\":\"Int. J. Reason. based Intell. Syst.\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Int. J. Reason. based Intell. Syst.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1504/IJRIS.2019.10023432\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Reason. based Intell. Syst.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJRIS.2019.10023432","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Toward an automatic summarisation of Arabic text depending on rhetorical relations
Rhetorical relations between two text segments are crucial information and have been proven useful for many natural language processing applications. In this paper, we propose a supervised approach for automatic identifying of rhetorical relations in Arabic texts. Our model attempts to identify both implicit and explicit rhetorical relations between elementary discourse units which will be exploited in automatic summarisation of Arabic texts. To carry out this research, we developed a discourse annotated corpus following the rhetorical structure theory framework with high reliability. Relations annotation was done using a set of 23 fine-grained relations enriched with nuclearity annotation. To automatically learn these relations, we reuse some state of the arts features and contribute new lexical and semantics' features. The experimental results on fine-grained and coarse-grained relations show that our model achieved best performance relative to all baselines.