{"title":"基于 CNN 的智能预训练模型在 DOTA 上进行物体检测的对比分析","authors":"Hina Hashmi, Rakesh Kumar Dwivedi, Anil Kumar","doi":"10.14313/jamris/2-2024/11","DOIUrl":null,"url":null,"abstract":"In this paper, we proposed comparative research on the classification of various objects in satellite images using some pre-trained models of CNN (VGG-16, Inception-V3, ResNet-50, EfficientNet-B7) and R-CNN. In this research work, we have used the DOTA dataset, which combines data from 14 classes. We have implemented above mentioned pre-trained models of CNN, and R-CNN to achieve optimal results for accuracy as well as productivity. To detect objects like ships, tennis courts, swimming pools, vehicles, and harbors from remotely accessed images. In this study, we have used a convolutional neural network (CNN) as the base model. The transfer learning mechanism is employed to speed up the results and for complex computations. We have discovered with the help of experimental analysis that R-CNN and Inception-V3 are performing best out of the five pre-trained models.","PeriodicalId":37910,"journal":{"name":"Journal of Automation, Mobile Robotics and Intelligent Systems","volume":"7 12","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative Analysis of CNN-Based Smart Pre-Trained Models for Object Detection on DOTA\",\"authors\":\"Hina Hashmi, Rakesh Kumar Dwivedi, Anil Kumar\",\"doi\":\"10.14313/jamris/2-2024/11\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we proposed comparative research on the classification of various objects in satellite images using some pre-trained models of CNN (VGG-16, Inception-V3, ResNet-50, EfficientNet-B7) and R-CNN. In this research work, we have used the DOTA dataset, which combines data from 14 classes. We have implemented above mentioned pre-trained models of CNN, and R-CNN to achieve optimal results for accuracy as well as productivity. To detect objects like ships, tennis courts, swimming pools, vehicles, and harbors from remotely accessed images. In this study, we have used a convolutional neural network (CNN) as the base model. The transfer learning mechanism is employed to speed up the results and for complex computations. We have discovered with the help of experimental analysis that R-CNN and Inception-V3 are performing best out of the five pre-trained models.\",\"PeriodicalId\":37910,\"journal\":{\"name\":\"Journal of Automation, Mobile Robotics and Intelligent Systems\",\"volume\":\"7 12\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Automation, Mobile Robotics and Intelligent Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14313/jamris/2-2024/11\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Engineering\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Automation, Mobile Robotics and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14313/jamris/2-2024/11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Engineering","Score":null,"Total":0}
Comparative Analysis of CNN-Based Smart Pre-Trained Models for Object Detection on DOTA
In this paper, we proposed comparative research on the classification of various objects in satellite images using some pre-trained models of CNN (VGG-16, Inception-V3, ResNet-50, EfficientNet-B7) and R-CNN. In this research work, we have used the DOTA dataset, which combines data from 14 classes. We have implemented above mentioned pre-trained models of CNN, and R-CNN to achieve optimal results for accuracy as well as productivity. To detect objects like ships, tennis courts, swimming pools, vehicles, and harbors from remotely accessed images. In this study, we have used a convolutional neural network (CNN) as the base model. The transfer learning mechanism is employed to speed up the results and for complex computations. We have discovered with the help of experimental analysis that R-CNN and Inception-V3 are performing best out of the five pre-trained models.
期刊介绍:
Fundamentals of automation and robotics Applied automatics Mobile robots control Distributed systems Navigation Mechatronics systems in robotics Sensors and actuators Data transmission Biomechatronics Mobile computing