{"title":"部分同步GAN在图像和标题数据生成中的应用","authors":"Kovtun Valery, Sajjad Kamali Siahroudi, Wei Gang","doi":"10.1145/3373419.3373431","DOIUrl":null,"url":null,"abstract":"A common problem that is seen in most practical applications of the general machine learning algorithms is the insufficient amount of data that has been already tagged for the application in hand. The abundance of unsyncronized (untagged) data is growing exponentially, yet its useability for practical applications is limited due to the amount of time it would take to process each data entry and classify it for each specific task. PSyncGAN is a novel approach that is able to suppliment existing sets of data that are either incomplete or insufficient for practical big data and machine learning applications. In our model, we are solving two multi-modal generative problems at the same time, Image captioning and Image Generation from natural language description with a limited amount of synchronized textual descriptions and images. The benefit of such a model is that there is no strict requirement of having synchronized corpora of text and image, but can actually make heavy use of the large amounts of singular data freely available on the internet, thus to be able to train on an almost infinite amount of data, including the data generated by the model itself. The results described below are indeed very promising, and could constitute a deeper research direction. Besides being able to train on the limited amount of training data, in this paper we also show that this model can be used as a basis for various other deep generative methods that are learning data distribution correlations (both symmetric and asymmetric) as a training data extension, by providing endless training examples for each proposed training data pair.","PeriodicalId":352528,"journal":{"name":"Proceedings of the 2019 3rd International Conference on Advances in Image Processing","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Psyncgan for Data Generation Application of Partially Syncronized GAN to Image and Caption Data Generation\",\"authors\":\"Kovtun Valery, Sajjad Kamali Siahroudi, Wei Gang\",\"doi\":\"10.1145/3373419.3373431\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A common problem that is seen in most practical applications of the general machine learning algorithms is the insufficient amount of data that has been already tagged for the application in hand. The abundance of unsyncronized (untagged) data is growing exponentially, yet its useability for practical applications is limited due to the amount of time it would take to process each data entry and classify it for each specific task. PSyncGAN is a novel approach that is able to suppliment existing sets of data that are either incomplete or insufficient for practical big data and machine learning applications. In our model, we are solving two multi-modal generative problems at the same time, Image captioning and Image Generation from natural language description with a limited amount of synchronized textual descriptions and images. The benefit of such a model is that there is no strict requirement of having synchronized corpora of text and image, but can actually make heavy use of the large amounts of singular data freely available on the internet, thus to be able to train on an almost infinite amount of data, including the data generated by the model itself. The results described below are indeed very promising, and could constitute a deeper research direction. Besides being able to train on the limited amount of training data, in this paper we also show that this model can be used as a basis for various other deep generative methods that are learning data distribution correlations (both symmetric and asymmetric) as a training data extension, by providing endless training examples for each proposed training data pair.\",\"PeriodicalId\":352528,\"journal\":{\"name\":\"Proceedings of the 2019 3rd International Conference on Advances in Image Processing\",\"volume\":\"34 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 3rd International Conference on Advances in Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3373419.3373431\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 3rd International Conference on Advances in Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3373419.3373431","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Psyncgan for Data Generation Application of Partially Syncronized GAN to Image and Caption Data Generation
A common problem that is seen in most practical applications of the general machine learning algorithms is the insufficient amount of data that has been already tagged for the application in hand. The abundance of unsyncronized (untagged) data is growing exponentially, yet its useability for practical applications is limited due to the amount of time it would take to process each data entry and classify it for each specific task. PSyncGAN is a novel approach that is able to suppliment existing sets of data that are either incomplete or insufficient for practical big data and machine learning applications. In our model, we are solving two multi-modal generative problems at the same time, Image captioning and Image Generation from natural language description with a limited amount of synchronized textual descriptions and images. The benefit of such a model is that there is no strict requirement of having synchronized corpora of text and image, but can actually make heavy use of the large amounts of singular data freely available on the internet, thus to be able to train on an almost infinite amount of data, including the data generated by the model itself. The results described below are indeed very promising, and could constitute a deeper research direction. Besides being able to train on the limited amount of training data, in this paper we also show that this model can be used as a basis for various other deep generative methods that are learning data distribution correlations (both symmetric and asymmetric) as a training data extension, by providing endless training examples for each proposed training data pair.