Balanced Mini-Batch Training for Imbalanced Image Data Classification with Neural Network

Ryota Shimizu, Kosuke Asako, Hiroki Ojima, Shohei Morinaga, M. Hamada, T. Kuroda
{"title":"Balanced Mini-Batch Training for Imbalanced Image Data Classification with Neural Network","authors":"Ryota Shimizu, Kosuke Asako, Hiroki Ojima, Shohei Morinaga, M. Hamada, T. Kuroda","doi":"10.1109/AI4I.2018.8665709","DOIUrl":null,"url":null,"abstract":"We propose a novel method of training neural networks for industrial image classification that can reduce the effect of imbalanced data in supervised training. We considered visual quality inspection of industrial products as an image-classification task and attempted to solve this with a convolutional neural network; however, a problem of imbalanced data emerged in supervised training in which the neural network cannot optimize parameters. Since most industrial products are not defective, samples of defective products were fewer than those of the non-defective products; this difference in the number of samples causes an imbalance in training data. A neural network trained with imbalanced data often has varied levels of precision in determining each class depending on the difference in the number of class samples in the training data, which is a significant problem in industrial quality inspection. As a solution to this problem, we propose a balanced mini-batch training method that can virtually balance the class ratio of training samples. In an experiment, the neural network trained with the proposed method achieved higher classification ability than that trained with over-sampled or undersampled data for two types of imbalanced image datasets.","PeriodicalId":133657,"journal":{"name":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 First International Conference on Artificial Intelligence for Industries (AI4I)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AI4I.2018.8665709","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

We propose a novel method of training neural networks for industrial image classification that can reduce the effect of imbalanced data in supervised training. We considered visual quality inspection of industrial products as an image-classification task and attempted to solve this with a convolutional neural network; however, a problem of imbalanced data emerged in supervised training in which the neural network cannot optimize parameters. Since most industrial products are not defective, samples of defective products were fewer than those of the non-defective products; this difference in the number of samples causes an imbalance in training data. A neural network trained with imbalanced data often has varied levels of precision in determining each class depending on the difference in the number of class samples in the training data, which is a significant problem in industrial quality inspection. As a solution to this problem, we propose a balanced mini-batch training method that can virtually balance the class ratio of training samples. In an experiment, the neural network trained with the proposed method achieved higher classification ability than that trained with over-sampled or undersampled data for two types of imbalanced image datasets.
基于神经网络的不平衡图像数据分类的平衡小批训练
提出了一种新的工业图像分类神经网络训练方法,可以减少监督训练中不平衡数据的影响。我们认为工业产品的视觉质量检测是一个图像分类任务,并试图用卷积神经网络来解决这个问题;然而,在监督训练中出现了数据不平衡的问题,神经网络无法优化参数。由于大多数工业产品不存在缺陷,因此缺陷产品的样品少于非缺陷产品的样品;样本数量的差异导致了训练数据的不平衡。用不平衡数据训练的神经网络,由于训练数据中类别样本数量的不同,在确定每个类别时往往具有不同程度的精度,这是工业质量检测中的一个重要问题。为了解决这一问题,我们提出了一种平衡的小批量训练方法,该方法实际上可以平衡训练样本的类比。在实验中,使用本文方法训练的神经网络对两类不平衡图像数据集的分类能力优于使用过采样或欠采样数据训练的神经网络。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信