使用条件语言模型的上下文驱动的孟加拉语文本生成

Statistics, optimization & information computing Pub Date : 2021-03-10 DOI:10.19139/SOIC-2310-5070-1061

Md. Raisul Kibria, M. Yousuf

{"title":"使用条件语言模型的上下文驱动的孟加拉语文本生成","authors":"Md. Raisul Kibria, M. Yousuf","doi":"10.19139/SOIC-2310-5070-1061","DOIUrl":null,"url":null,"abstract":"Text generation is a rapidly evolving field of Natural Language Processing (NLP) with larger Language models proposed very often setting new state-of-the-art. These models are extremely effective in learning the representation of words and their internal coherence in a particular language. However, an established context-driven, end to end text generation model is very rare, even more so for the Bengali language. In this paper, we have proposed a Bidirectional gated recurrent unit (GRU) based architecture that simulates the conditional language model or the decoder portion of the sequence to sequence (seq2seq) model and is further conditioned upon the target context vectors. We have explored several ways of combining multiple context words into a fixed dimensional vector representation that is extracted from the same GloVe language model which is used to generate the embedding matrix. We have used beam search optimization to generate the sentence with the maximum cumulative log probability score. In addition, we have proposed a human scoring based evaluation metric and used it to compare the performance of the model with unidirectional LSTM and GRU networks. Empirical results prove that the proposed model performs exceedingly well in producing meaningful outcomes depicting the target context. The experiment leads to an architecture that can be applied to an extensive domain of context-driven text generation based applications and which is also a key contribution to the NLP based literature of the Bengali language.","PeriodicalId":93376,"journal":{"name":"Statistics, optimization & information computing","volume":"9 1","pages":"334-350"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Context-driven Bengali Text Generation using Conditional Language Model\",\"authors\":\"Md. Raisul Kibria, M. Yousuf\",\"doi\":\"10.19139/SOIC-2310-5070-1061\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text generation is a rapidly evolving field of Natural Language Processing (NLP) with larger Language models proposed very often setting new state-of-the-art. These models are extremely effective in learning the representation of words and their internal coherence in a particular language. However, an established context-driven, end to end text generation model is very rare, even more so for the Bengali language. In this paper, we have proposed a Bidirectional gated recurrent unit (GRU) based architecture that simulates the conditional language model or the decoder portion of the sequence to sequence (seq2seq) model and is further conditioned upon the target context vectors. We have explored several ways of combining multiple context words into a fixed dimensional vector representation that is extracted from the same GloVe language model which is used to generate the embedding matrix. We have used beam search optimization to generate the sentence with the maximum cumulative log probability score. In addition, we have proposed a human scoring based evaluation metric and used it to compare the performance of the model with unidirectional LSTM and GRU networks. Empirical results prove that the proposed model performs exceedingly well in producing meaningful outcomes depicting the target context. The experiment leads to an architecture that can be applied to an extensive domain of context-driven text generation based applications and which is also a key contribution to the NLP based literature of the Bengali language.\",\"PeriodicalId\":93376,\"journal\":{\"name\":\"Statistics, optimization & information computing\",\"volume\":\"9 1\",\"pages\":\"334-350\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistics, optimization & information computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.19139/SOIC-2310-5070-1061\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics, optimization & information computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.19139/SOIC-2310-5070-1061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

文本生成是自然语言处理（NLP）的一个快速发展的领域，提出的更大的语言模型往往会带来新的技术。这些模型在学习特定语言中单词的表达及其内部连贯性方面非常有效。然而，已经建立的上下文驱动的端到端文本生成模型非常罕见，孟加拉语更是如此。在本文中，我们提出了一种基于双向门控递归单元（GRU）的架构，该架构模拟条件语言模型或序列到序列（seq2seq）模型的解码器部分，并进一步以目标上下文向量为条件。我们已经探索了几种将多个上下文单词组合成固定维向量表示的方法，该向量表示是从用于生成嵌入矩阵的同一GloVe语言模型中提取的。我们使用波束搜索优化来生成具有最大累积对数概率得分的句子。此外，我们还提出了一种基于人工评分的评估指标，并将其用于将模型的性能与单向LSTM和GRU网络进行比较。经验结果证明，所提出的模型在产生描述目标上下文的有意义的结果方面表现得非常好。该实验产生了一种可以应用于基于上下文驱动的文本生成应用程序的广泛领域的架构，这也是对孟加拉语基于NLP的文献的关键贡献。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Context-driven Bengali Text Generation using Conditional Language Model

Text generation is a rapidly evolving field of Natural Language Processing (NLP) with larger Language models proposed very often setting new state-of-the-art. These models are extremely effective in learning the representation of words and their internal coherence in a particular language. However, an established context-driven, end to end text generation model is very rare, even more so for the Bengali language. In this paper, we have proposed a Bidirectional gated recurrent unit (GRU) based architecture that simulates the conditional language model or the decoder portion of the sequence to sequence (seq2seq) model and is further conditioned upon the target context vectors. We have explored several ways of combining multiple context words into a fixed dimensional vector representation that is extracted from the same GloVe language model which is used to generate the embedding matrix. We have used beam search optimization to generate the sentence with the maximum cumulative log probability score. In addition, we have proposed a human scoring based evaluation metric and used it to compare the performance of the model with unidirectional LSTM and GRU networks. Empirical results prove that the proposed model performs exceedingly well in producing meaningful outcomes depicting the target context. The experiment leads to an architecture that can be applied to an extensive domain of context-driven text generation based applications and which is also a key contribution to the NLP based literature of the Bengali language.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Statistics, optimization & information computing

自引率

0.00%

发文量