Abdulaziz Anorboev, Javokhir Musaev, Sarvinoz Anorboeva, Jeongkyu Hong, Yeong-Seok Seo, N. Nguyen, D. Hwang
{"title":"基于深度学习的图像像素间隔方法集成top3预测","authors":"Abdulaziz Anorboev, Javokhir Musaev, Sarvinoz Anorboeva, Jeongkyu Hong, Yeong-Seok Seo, N. Nguyen, D. Hwang","doi":"10.2298/csis230223056a","DOIUrl":null,"url":null,"abstract":"Computer vision (CV) has been successfully used in picture categorization applications in various fields, including medicine, production quality control, and transportation systems. CV models use an excessive number of photos to train potential models. Considering that image acquisition is typically expensive and time-consuming, in this study, we provide a multistep strategy to improve image categorization accuracy with less data. In the first stage, we constructed numerous datasets from a single dataset. Given that an image has pixels with values ranging from 0 to 255, the images were separated into pixel intervals based on the type of dataset. The pixel interval was split into two portions when the dataset was grayscale and five portions when it was composed of RGB images. Next, we trained the model using both the original and newly constructed datasets. Each image in the training process showed a non-identical prediction space, and we suggested using the top three prediction probability ensemble technique. The top three predictions for the newly created images were combined with the corresponding probability for the original image. The results showed that learning patterns from each interval of pixels and ensembling the top three predictions significantly improve the performance and accuracy, and this strategy can be used with any model.","PeriodicalId":50636,"journal":{"name":"Computer Science and Information Systems","volume":"20 1","pages":"1503-1517"},"PeriodicalIF":1.2000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ensemble of top3 prediction with image pixel interval method using deep learning\",\"authors\":\"Abdulaziz Anorboev, Javokhir Musaev, Sarvinoz Anorboeva, Jeongkyu Hong, Yeong-Seok Seo, N. Nguyen, D. Hwang\",\"doi\":\"10.2298/csis230223056a\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Computer vision (CV) has been successfully used in picture categorization applications in various fields, including medicine, production quality control, and transportation systems. CV models use an excessive number of photos to train potential models. Considering that image acquisition is typically expensive and time-consuming, in this study, we provide a multistep strategy to improve image categorization accuracy with less data. In the first stage, we constructed numerous datasets from a single dataset. Given that an image has pixels with values ranging from 0 to 255, the images were separated into pixel intervals based on the type of dataset. The pixel interval was split into two portions when the dataset was grayscale and five portions when it was composed of RGB images. Next, we trained the model using both the original and newly constructed datasets. Each image in the training process showed a non-identical prediction space, and we suggested using the top three prediction probability ensemble technique. The top three predictions for the newly created images were combined with the corresponding probability for the original image. The results showed that learning patterns from each interval of pixels and ensembling the top three predictions significantly improve the performance and accuracy, and this strategy can be used with any model.\",\"PeriodicalId\":50636,\"journal\":{\"name\":\"Computer Science and Information Systems\",\"volume\":\"20 1\",\"pages\":\"1503-1517\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Science and Information Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.2298/csis230223056a\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science and Information Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.2298/csis230223056a","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
Ensemble of top3 prediction with image pixel interval method using deep learning
Computer vision (CV) has been successfully used in picture categorization applications in various fields, including medicine, production quality control, and transportation systems. CV models use an excessive number of photos to train potential models. Considering that image acquisition is typically expensive and time-consuming, in this study, we provide a multistep strategy to improve image categorization accuracy with less data. In the first stage, we constructed numerous datasets from a single dataset. Given that an image has pixels with values ranging from 0 to 255, the images were separated into pixel intervals based on the type of dataset. The pixel interval was split into two portions when the dataset was grayscale and five portions when it was composed of RGB images. Next, we trained the model using both the original and newly constructed datasets. Each image in the training process showed a non-identical prediction space, and we suggested using the top three prediction probability ensemble technique. The top three predictions for the newly created images were combined with the corresponding probability for the original image. The results showed that learning patterns from each interval of pixels and ensembling the top three predictions significantly improve the performance and accuracy, and this strategy can be used with any model.
期刊介绍:
About the journal
Home page
Contact information
Aims and scope
Indexing information
Editorial policies
ComSIS consortium
Journal boards
Managing board
For authors
Information for contributors
Paper submission
Article submission through OJS
Copyright transfer form
Download section
For readers
Forthcoming articles
Current issue
Archive
Subscription
For reviewers
View and review submissions
News
Journal''s Facebook page
Call for special issue
New issue notification
Aims and scope
Computer Science and Information Systems (ComSIS) is an international refereed journal, published in Serbia. The objective of ComSIS is to communicate important research and development results in the areas of computer science, software engineering, and information systems.