{"title":"使用基于层次结构的单词替换的文本增强","authors":"kimmuseong, Namgyu Kim","doi":"10.9708/JKSCI.2021.26.01.057","DOIUrl":null,"url":null,"abstract":"[Abstract] Recently, multi-modal deep learning techniques that combine heterogeneous data for deep learning analysis have been utilized a lot. In particular, studies on the synthesis of Text to Image that automatically generate images from text are being actively conducted. Deep learning for image synthesis requires a vast amount of data consisting of pairs of images and text describing the image. Therefore, various data augmentation techniques have been devised to generate a large amount of data from small data. A number of text augmentation techniques based on synonym replacement have been proposed so far. However, these techniques have a common limitation in that there is a possibility of generating a incorrect text from the content of an image when replacing the synonym for a noun word. In this study","PeriodicalId":17254,"journal":{"name":"Journal of the Korea Society of Computer and Information","volume":"20 1","pages":"57-67"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Text Augmentation Using Hierarchy-based Word Replacement\",\"authors\":\"kimmuseong, Namgyu Kim\",\"doi\":\"10.9708/JKSCI.2021.26.01.057\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"[Abstract] Recently, multi-modal deep learning techniques that combine heterogeneous data for deep learning analysis have been utilized a lot. In particular, studies on the synthesis of Text to Image that automatically generate images from text are being actively conducted. Deep learning for image synthesis requires a vast amount of data consisting of pairs of images and text describing the image. Therefore, various data augmentation techniques have been devised to generate a large amount of data from small data. A number of text augmentation techniques based on synonym replacement have been proposed so far. However, these techniques have a common limitation in that there is a possibility of generating a incorrect text from the content of an image when replacing the synonym for a noun word. In this study\",\"PeriodicalId\":17254,\"journal\":{\"name\":\"Journal of the Korea Society of Computer and Information\",\"volume\":\"20 1\",\"pages\":\"57-67\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the Korea Society of Computer and Information\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.9708/JKSCI.2021.26.01.057\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Korea Society of Computer and Information","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.9708/JKSCI.2021.26.01.057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
摘要
[摘要]近年来,结合异构数据进行深度学习分析的多模态深度学习技术得到了广泛的应用。特别是从文本自动生成图像的Text to Image合成的研究正在积极进行。图像合成的深度学习需要大量的数据,包括成对的图像和描述图像的文本。因此,人们设计了各种数据增强技术,以便从小数据中生成大量数据。目前已经提出了许多基于同义词替换的文本增强技术。然而,这些技术有一个共同的局限性,即在替换名词词的同义词时,有可能从图像的内容生成不正确的文本。在这项研究中
Text Augmentation Using Hierarchy-based Word Replacement
[Abstract] Recently, multi-modal deep learning techniques that combine heterogeneous data for deep learning analysis have been utilized a lot. In particular, studies on the synthesis of Text to Image that automatically generate images from text are being actively conducted. Deep learning for image synthesis requires a vast amount of data consisting of pairs of images and text describing the image. Therefore, various data augmentation techniques have been devised to generate a large amount of data from small data. A number of text augmentation techniques based on synonym replacement have been proposed so far. However, these techniques have a common limitation in that there is a possibility of generating a incorrect text from the content of an image when replacing the synonym for a noun word. In this study