{"title":"运用修辞结构理论改进阿拉伯语文本自动摘要","authors":"A. Ibrahim, T. Elghazaly","doi":"10.1109/MICAI.2013.35","DOIUrl":null,"url":null,"abstract":"This paper uses a semantic technique by adopting a Rhetorical Structure Theory (RST) for summarization purpose, to discover the most significant paragraphs based on functional and semantic criteria. However, the quality of RST summarization suffers when dealing with large documents. This paper proposes a new hybrid summarization model for Arabic text, which mingles two sub-models: The first sub-model produces a primary summary by using Rhetorical Structure Theory for identifying a range of the most significant parts of the text (the nucleus). Then the second sub-model ranks the significant parts in the primary rhetorical-summary based on the cosine similarity feature. To evaluate the proposed model, a prototype was developed on a range of articles, which have been classified into three groups different in size. The final output summary was evaluated in relation to its manual counterpart. In terms of enhancement of the rhetorical-summary precision, the experiment shows that proposed model HSM average precision is 71.6%, superior over the primary rhetorical-summary precision 56.3%.","PeriodicalId":340039,"journal":{"name":"2013 12th Mexican International Conference on Artificial Intelligence","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":"{\"title\":\"Improve the Automatic Summarization of Arabic Text Depending on Rhetorical Structure Theory\",\"authors\":\"A. Ibrahim, T. Elghazaly\",\"doi\":\"10.1109/MICAI.2013.35\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper uses a semantic technique by adopting a Rhetorical Structure Theory (RST) for summarization purpose, to discover the most significant paragraphs based on functional and semantic criteria. However, the quality of RST summarization suffers when dealing with large documents. This paper proposes a new hybrid summarization model for Arabic text, which mingles two sub-models: The first sub-model produces a primary summary by using Rhetorical Structure Theory for identifying a range of the most significant parts of the text (the nucleus). Then the second sub-model ranks the significant parts in the primary rhetorical-summary based on the cosine similarity feature. To evaluate the proposed model, a prototype was developed on a range of articles, which have been classified into three groups different in size. The final output summary was evaluated in relation to its manual counterpart. In terms of enhancement of the rhetorical-summary precision, the experiment shows that proposed model HSM average precision is 71.6%, superior over the primary rhetorical-summary precision 56.3%.\",\"PeriodicalId\":340039,\"journal\":{\"name\":\"2013 12th Mexican International Conference on Artificial Intelligence\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"12\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 12th Mexican International Conference on Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MICAI.2013.35\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 12th Mexican International Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2013.35","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improve the Automatic Summarization of Arabic Text Depending on Rhetorical Structure Theory
This paper uses a semantic technique by adopting a Rhetorical Structure Theory (RST) for summarization purpose, to discover the most significant paragraphs based on functional and semantic criteria. However, the quality of RST summarization suffers when dealing with large documents. This paper proposes a new hybrid summarization model for Arabic text, which mingles two sub-models: The first sub-model produces a primary summary by using Rhetorical Structure Theory for identifying a range of the most significant parts of the text (the nucleus). Then the second sub-model ranks the significant parts in the primary rhetorical-summary based on the cosine similarity feature. To evaluate the proposed model, a prototype was developed on a range of articles, which have been classified into three groups different in size. The final output summary was evaluated in relation to its manual counterpart. In terms of enhancement of the rhetorical-summary precision, the experiment shows that proposed model HSM average precision is 71.6%, superior over the primary rhetorical-summary precision 56.3%.