Yasuhiro Miura, Yuki Sawamura, Yuki Shinomiya, Shinichi Yoshida
{"title":"Vegetable Mass Estimation based on Monocular Camera using Convolutional Neural Network","authors":"Yasuhiro Miura, Yuki Sawamura, Yuki Shinomiya, Shinichi Yoshida","doi":"10.1109/SMC42975.2020.9282930","DOIUrl":null,"url":null,"abstract":"Vegetable mass estimation from monocular RGB camera images is proposed. Vegetables are fragmented and placed on a conveyor belt of food processing machine and the monocular camera placed over the belt take pictures of vegetables on the belt. The proposed system does not employ any scale, load cell, and other mass scaling equipment. We apply pre-trained convolutional neural networks to estimate the mass of vegetables. Transfer learning including various levels of fine-tuning is also applied. For pre-trained network, we use Xception, VGG16, ResNet50, and Inception_v3, which are pre-trained using ImageNet. The result shows that the best estimation accuracy is achieved by VGG16, whose MAPE (mean average percentage error) is 11.1%. Additionally, we fine-tune VGG16 and the accuracy reduces to 7.9% for MAPE. From this result, the performance of CNN model can improve by fine-tuning. The proposed system can be applied to low-cost, high-speed, and efficient measurement of foods replaced to load cells.","PeriodicalId":6718,"journal":{"name":"2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)","volume":"61 1","pages":"2106-2112"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMC42975.2020.9282930","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Vegetable mass estimation from monocular RGB camera images is proposed. Vegetables are fragmented and placed on a conveyor belt of food processing machine and the monocular camera placed over the belt take pictures of vegetables on the belt. The proposed system does not employ any scale, load cell, and other mass scaling equipment. We apply pre-trained convolutional neural networks to estimate the mass of vegetables. Transfer learning including various levels of fine-tuning is also applied. For pre-trained network, we use Xception, VGG16, ResNet50, and Inception_v3, which are pre-trained using ImageNet. The result shows that the best estimation accuracy is achieved by VGG16, whose MAPE (mean average percentage error) is 11.1%. Additionally, we fine-tune VGG16 and the accuracy reduces to 7.9% for MAPE. From this result, the performance of CNN model can improve by fine-tuning. The proposed system can be applied to low-cost, high-speed, and efficient measurement of foods replaced to load cells.