Zibo Nie, Jianjun Cao, Nianfeng Weng, Xu Yu, Mengda Wang
{"title":"面向目标检测的基于对象的透视变换数据增强","authors":"Zibo Nie, Jianjun Cao, Nianfeng Weng, Xu Yu, Mengda Wang","doi":"10.1109/FAIML57028.2022.00043","DOIUrl":null,"url":null,"abstract":"Perspective transformation can mimic the perspective phenomenon in captured images, while few researches focused on utilizing perspective transformation to augment image data. To well exploit its potential in data augmentation, a novel object-based perspective transformation data augmentation framework is proposed in this paper. First, this framework automatically cuts out objects from images based on bounding boxes provided by corresponding annotations. Next, a simulated random perspective transformation is performed on only those cut objects rather than the whole images. The final step consists of the combination of transformed objects and backgrounds, together with calculation of new annotation. Our framework can generate new images by mimicking the perspective phenomenon caused by different orientations of objects and can also introduce new backgrounds at the same time, hence the further augmentation towards insufficient or imbalanced image data without requirements of additional manual manipulations. Experiments are carried out on the Pascal VOC2007 datasets and the results show the effectiveness of our framework on insufficient or imbalanced datasets.","PeriodicalId":307172,"journal":{"name":"2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Object-Based Perspective Transformation Data Augmentation for Object Detection\",\"authors\":\"Zibo Nie, Jianjun Cao, Nianfeng Weng, Xu Yu, Mengda Wang\",\"doi\":\"10.1109/FAIML57028.2022.00043\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Perspective transformation can mimic the perspective phenomenon in captured images, while few researches focused on utilizing perspective transformation to augment image data. To well exploit its potential in data augmentation, a novel object-based perspective transformation data augmentation framework is proposed in this paper. First, this framework automatically cuts out objects from images based on bounding boxes provided by corresponding annotations. Next, a simulated random perspective transformation is performed on only those cut objects rather than the whole images. The final step consists of the combination of transformed objects and backgrounds, together with calculation of new annotation. Our framework can generate new images by mimicking the perspective phenomenon caused by different orientations of objects and can also introduce new backgrounds at the same time, hence the further augmentation towards insufficient or imbalanced image data without requirements of additional manual manipulations. Experiments are carried out on the Pascal VOC2007 datasets and the results show the effectiveness of our framework on insufficient or imbalanced datasets.\",\"PeriodicalId\":307172,\"journal\":{\"name\":\"2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)\",\"volume\":\"74 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FAIML57028.2022.00043\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Frontiers of Artificial Intelligence and Machine Learning (FAIML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FAIML57028.2022.00043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object-Based Perspective Transformation Data Augmentation for Object Detection
Perspective transformation can mimic the perspective phenomenon in captured images, while few researches focused on utilizing perspective transformation to augment image data. To well exploit its potential in data augmentation, a novel object-based perspective transformation data augmentation framework is proposed in this paper. First, this framework automatically cuts out objects from images based on bounding boxes provided by corresponding annotations. Next, a simulated random perspective transformation is performed on only those cut objects rather than the whole images. The final step consists of the combination of transformed objects and backgrounds, together with calculation of new annotation. Our framework can generate new images by mimicking the perspective phenomenon caused by different orientations of objects and can also introduce new backgrounds at the same time, hence the further augmentation towards insufficient or imbalanced image data without requirements of additional manual manipulations. Experiments are carried out on the Pascal VOC2007 datasets and the results show the effectiveness of our framework on insufficient or imbalanced datasets.