{"title":"孟加拉语语篇生成中的话语标记生成与句法聚合","authors":"Sumit Das, A. Basu, S. Sarkar","doi":"10.1109/TECHSYM.2010.5469163","DOIUrl":null,"url":null,"abstract":"In discourse, the elementary text spans are semantically connected by coherence relations. Discourse markers linguistically realize the coherence relations in the surface form. On the other hand, by text aggregation, redundant entities are eliminated, resulting in more fluent, coherent, and concise text. For any but the most application of text generation, appropriate discourse marker selection and text aggregation are two important aspects for coherent text generation. In this paper, we explore the prevalent syntactic aggregation constructs in Bengali and present a rule based approach towards generating Bengali compound sentences using the identified constructs. We present a user based evaluation to validate our approach. At the end, we have also given an outline of a corpus based approach for generating suitable discourse marker in Bengali.","PeriodicalId":262830,"journal":{"name":"2010 IEEE Students Technology Symposium (TechSym)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Discourse marker generation and syntactic aggregation in Bengali text generation\",\"authors\":\"Sumit Das, A. Basu, S. Sarkar\",\"doi\":\"10.1109/TECHSYM.2010.5469163\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In discourse, the elementary text spans are semantically connected by coherence relations. Discourse markers linguistically realize the coherence relations in the surface form. On the other hand, by text aggregation, redundant entities are eliminated, resulting in more fluent, coherent, and concise text. For any but the most application of text generation, appropriate discourse marker selection and text aggregation are two important aspects for coherent text generation. In this paper, we explore the prevalent syntactic aggregation constructs in Bengali and present a rule based approach towards generating Bengali compound sentences using the identified constructs. We present a user based evaluation to validate our approach. At the end, we have also given an outline of a corpus based approach for generating suitable discourse marker in Bengali.\",\"PeriodicalId\":262830,\"journal\":{\"name\":\"2010 IEEE Students Technology Symposium (TechSym)\",\"volume\":\"106 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 IEEE Students Technology Symposium (TechSym)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TECHSYM.2010.5469163\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Students Technology Symposium (TechSym)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TECHSYM.2010.5469163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Discourse marker generation and syntactic aggregation in Bengali text generation
In discourse, the elementary text spans are semantically connected by coherence relations. Discourse markers linguistically realize the coherence relations in the surface form. On the other hand, by text aggregation, redundant entities are eliminated, resulting in more fluent, coherent, and concise text. For any but the most application of text generation, appropriate discourse marker selection and text aggregation are two important aspects for coherent text generation. In this paper, we explore the prevalent syntactic aggregation constructs in Bengali and present a rule based approach towards generating Bengali compound sentences using the identified constructs. We present a user based evaluation to validate our approach. At the end, we have also given an outline of a corpus based approach for generating suitable discourse marker in Bengali.