{"title":"Intersection over Union based analysis of Image detection/segmentation using CNN model","authors":"Amitkumar N Gajjar, Jignesh B. Jethva","doi":"10.1109/ICPC2T53885.2022.9776896","DOIUrl":null,"url":null,"abstract":"Neural networks are capable of learning high-dimensional hierarchical structures of objects from huge quantities Deep-learning systems can learn to recognize photographs based on a large amount of training data. Artificial intelligence has this as one of its features. Deep-learning algorithms for picture interpretation may be divided into two groups. SegNet, U-Net, and SharpMask are examples of fully convolutional methods that use an encoder-decoder architecture. Region-based methods, on the other hand, use a convolutional neural network (CNNs) stack to extract features, such as Mask-RCNN, PSP Net and DeepLab. When the networks are trained on a large enough number of annotated datasets, region-based methods beat for most image segmentation tasks, fully convolutional techniques are used. We designed and incorporated deep-learning techniques based on Mask-RCNN to detect 2D images while creating a segmentation for each mask item in this paper.","PeriodicalId":283298,"journal":{"name":"2022 Second International Conference on Power, Control and Computing Technologies (ICPC2T)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 Second International Conference on Power, Control and Computing Technologies (ICPC2T)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPC2T53885.2022.9776896","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Neural networks are capable of learning high-dimensional hierarchical structures of objects from huge quantities Deep-learning systems can learn to recognize photographs based on a large amount of training data. Artificial intelligence has this as one of its features. Deep-learning algorithms for picture interpretation may be divided into two groups. SegNet, U-Net, and SharpMask are examples of fully convolutional methods that use an encoder-decoder architecture. Region-based methods, on the other hand, use a convolutional neural network (CNNs) stack to extract features, such as Mask-RCNN, PSP Net and DeepLab. When the networks are trained on a large enough number of annotated datasets, region-based methods beat for most image segmentation tasks, fully convolutional techniques are used. We designed and incorporated deep-learning techniques based on Mask-RCNN to detect 2D images while creating a segmentation for each mask item in this paper.