{"title":"Discourse marker generation and syntactic aggregation in Bengali text generation","authors":"Sumit Das, A. Basu, S. Sarkar","doi":"10.1109/TECHSYM.2010.5469163","DOIUrl":null,"url":null,"abstract":"In discourse, the elementary text spans are semantically connected by coherence relations. Discourse markers linguistically realize the coherence relations in the surface form. On the other hand, by text aggregation, redundant entities are eliminated, resulting in more fluent, coherent, and concise text. For any but the most application of text generation, appropriate discourse marker selection and text aggregation are two important aspects for coherent text generation. In this paper, we explore the prevalent syntactic aggregation constructs in Bengali and present a rule based approach towards generating Bengali compound sentences using the identified constructs. We present a user based evaluation to validate our approach. At the end, we have also given an outline of a corpus based approach for generating suitable discourse marker in Bengali.","PeriodicalId":262830,"journal":{"name":"2010 IEEE Students Technology Symposium (TechSym)","volume":"106 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE Students Technology Symposium (TechSym)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TECHSYM.2010.5469163","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In discourse, the elementary text spans are semantically connected by coherence relations. Discourse markers linguistically realize the coherence relations in the surface form. On the other hand, by text aggregation, redundant entities are eliminated, resulting in more fluent, coherent, and concise text. For any but the most application of text generation, appropriate discourse marker selection and text aggregation are two important aspects for coherent text generation. In this paper, we explore the prevalent syntactic aggregation constructs in Bengali and present a rule based approach towards generating Bengali compound sentences using the identified constructs. We present a user based evaluation to validate our approach. At the end, we have also given an outline of a corpus based approach for generating suitable discourse marker in Bengali.