Mansi Bende, Mayank Khandelwal, Dhanashree Borgaonkar, Prashant K. Khobragade
{"title":"VISMA:一种图像处理的机器学习方法","authors":"Mansi Bende, Mayank Khandelwal, Dhanashree Borgaonkar, Prashant K. Khobragade","doi":"10.1109/ISCON57294.2023.10112168","DOIUrl":null,"url":null,"abstract":"Powerful tools for semantic picture modification have recently been developed thanks to recent improvements in image production. Existing methods, however, may need a large amount of supplementary data. They are unable to handle all editing actions, including the insertion, modification, or deletion of semantic attributes of the image. The proposed method, a new Image Semantic Manipulator pair for Adding, Moving, or Erasing Objects in Scenes during Semantic Editing, to overcome these restrictions. In our arrangement, the generator generates the relevant pixels and edits the image when the end user gives the semantic identifiers of the regions to be changed. The paper aims to present results for two jobs: (a) picture editing and (b) image production with semantic label constraint using General Adversarial Networks (GAN) machine learning approach. A pre-trained neural network can also adjust to images of various sizes. Making the image fit the needs of the user by altering the overall aesthetic and emotional tone also helps. When given appropriate training input/output pairings, our technique may be expanded to provide a range of outputs when label mappings are changed, enabling interactive image modification that enables users to interact with changing an object’s appearance in real time. Last but not least, our suggested approach will raise resolution while establishing a new standard for realistic photos.","PeriodicalId":280183,"journal":{"name":"2023 6th International Conference on Information Systems and Computer Networks (ISCON)","volume":"95 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"VISMA: A Machine Learning Approach to Image Manipulation\",\"authors\":\"Mansi Bende, Mayank Khandelwal, Dhanashree Borgaonkar, Prashant K. Khobragade\",\"doi\":\"10.1109/ISCON57294.2023.10112168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Powerful tools for semantic picture modification have recently been developed thanks to recent improvements in image production. Existing methods, however, may need a large amount of supplementary data. They are unable to handle all editing actions, including the insertion, modification, or deletion of semantic attributes of the image. The proposed method, a new Image Semantic Manipulator pair for Adding, Moving, or Erasing Objects in Scenes during Semantic Editing, to overcome these restrictions. In our arrangement, the generator generates the relevant pixels and edits the image when the end user gives the semantic identifiers of the regions to be changed. The paper aims to present results for two jobs: (a) picture editing and (b) image production with semantic label constraint using General Adversarial Networks (GAN) machine learning approach. A pre-trained neural network can also adjust to images of various sizes. Making the image fit the needs of the user by altering the overall aesthetic and emotional tone also helps. When given appropriate training input/output pairings, our technique may be expanded to provide a range of outputs when label mappings are changed, enabling interactive image modification that enables users to interact with changing an object’s appearance in real time. Last but not least, our suggested approach will raise resolution while establishing a new standard for realistic photos.\",\"PeriodicalId\":280183,\"journal\":{\"name\":\"2023 6th International Conference on Information Systems and Computer Networks (ISCON)\",\"volume\":\"95 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 6th International Conference on Information Systems and Computer Networks (ISCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCON57294.2023.10112168\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th International Conference on Information Systems and Computer Networks (ISCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCON57294.2023.10112168","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
VISMA: A Machine Learning Approach to Image Manipulation
Powerful tools for semantic picture modification have recently been developed thanks to recent improvements in image production. Existing methods, however, may need a large amount of supplementary data. They are unable to handle all editing actions, including the insertion, modification, or deletion of semantic attributes of the image. The proposed method, a new Image Semantic Manipulator pair for Adding, Moving, or Erasing Objects in Scenes during Semantic Editing, to overcome these restrictions. In our arrangement, the generator generates the relevant pixels and edits the image when the end user gives the semantic identifiers of the regions to be changed. The paper aims to present results for two jobs: (a) picture editing and (b) image production with semantic label constraint using General Adversarial Networks (GAN) machine learning approach. A pre-trained neural network can also adjust to images of various sizes. Making the image fit the needs of the user by altering the overall aesthetic and emotional tone also helps. When given appropriate training input/output pairings, our technique may be expanded to provide a range of outputs when label mappings are changed, enabling interactive image modification that enables users to interact with changing an object’s appearance in real time. Last but not least, our suggested approach will raise resolution while establishing a new standard for realistic photos.