用快速文本转换网络生成中文故事

2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC) Pub Date : 2019-02-01 DOI:10.1109/ICAIIC.2019.8669087

Jhe-Wei Lin, Yunwen Gao, Rong-Guey Chang

{"title":"用快速文本转换网络生成中文故事","authors":"Jhe-Wei Lin, Yunwen Gao, Rong-Guey Chang","doi":"10.1109/ICAIIC.2019.8669087","DOIUrl":null,"url":null,"abstract":"The sequence transformer models are based on complex recurrent neural network or convolutional networks that include an encoder and a decoder. High-accuracy models are usually represented by used connect the encoder and decoder through an attention mechanism. Story generation is an important thing. If we can let computers learn the ability of story-telling, computers can help people do more things. Actually, the squence2squence model combine attention mechanism is being used to Chinese poetry generation. However, it difficult to apply in Chinese story generation, because there are some rules in Chinese poetry generation. Therefore, we trying to use 1372 human-labeled summarization of paragraphs from a classic novel named “Demi-Gods and Semi-Devils” (天龍八部) to train the transformer network. In our experiment, we use FastText to combine Demi-Gods and Semi-Devils Dataset and A Large Scale Chinese Short Text Summarization Dataset to be input data. In addition, we got a lower loss rate by using two layer of self-attention mechanism.","PeriodicalId":273383,"journal":{"name":"2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Chinese Story Generation with FastText Transformer Network\",\"authors\":\"Jhe-Wei Lin, Yunwen Gao, Rong-Guey Chang\",\"doi\":\"10.1109/ICAIIC.2019.8669087\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The sequence transformer models are based on complex recurrent neural network or convolutional networks that include an encoder and a decoder. High-accuracy models are usually represented by used connect the encoder and decoder through an attention mechanism. Story generation is an important thing. If we can let computers learn the ability of story-telling, computers can help people do more things. Actually, the squence2squence model combine attention mechanism is being used to Chinese poetry generation. However, it difficult to apply in Chinese story generation, because there are some rules in Chinese poetry generation. Therefore, we trying to use 1372 human-labeled summarization of paragraphs from a classic novel named “Demi-Gods and Semi-Devils” (天龍八部) to train the transformer network. In our experiment, we use FastText to combine Demi-Gods and Semi-Devils Dataset and A Large Scale Chinese Short Text Summarization Dataset to be input data. In addition, we got a lower loss rate by using two layer of self-attention mechanism.\",\"PeriodicalId\":273383,\"journal\":{\"name\":\"2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIIC.2019.8669087\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIIC.2019.8669087","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

序列变压器模型是基于复杂的递归神经网络或卷积网络，包括一个编码器和一个解码器。高精度模型通常通过注意机制将编码器和解码器连接起来。故事生成是一件重要的事情。如果我们能让电脑学会讲故事的能力，电脑就能帮助人们做更多的事情。事实上，squence2squence模型结合注意机制已被应用到汉语诗歌生成中。然而，由于中国诗歌的生成有一定的规律，因此很难应用到中国的故事生成中。因此，我们尝试使用经典小说《半神半魔》中的1372段人工标记摘要来训练变压器网络。在我们的实验中，我们使用FastText将半神半魔数据集和大规模中文短文本摘要数据集结合起来作为输入数据。此外，通过采用两层自注意机制，我们获得了较低的损失率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Chinese Story Generation with FastText Transformer Network

The sequence transformer models are based on complex recurrent neural network or convolutional networks that include an encoder and a decoder. High-accuracy models are usually represented by used connect the encoder and decoder through an attention mechanism. Story generation is an important thing. If we can let computers learn the ability of story-telling, computers can help people do more things. Actually, the squence2squence model combine attention mechanism is being used to Chinese poetry generation. However, it difficult to apply in Chinese story generation, because there are some rules in Chinese poetry generation. Therefore, we trying to use 1372 human-labeled summarization of paragraphs from a classic novel named “Demi-Gods and Semi-Devils” (天龍八部) to train the transformer network. In our experiment, we use FastText to combine Demi-Gods and Semi-Devils Dataset and A Large Scale Chinese Short Text Summarization Dataset to be input data. In addition, we got a lower loss rate by using two layer of self-attention mechanism.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)

自引率

0.00%

发文量