Yasuhiro Miura, Yuki Sawamura, Yuki Shinomiya, Shinichi Yoshida
{"title":"基于卷积神经网络的单目摄像机蔬菜质量估计","authors":"Yasuhiro Miura, Yuki Sawamura, Yuki Shinomiya, Shinichi Yoshida","doi":"10.1109/SMC42975.2020.9282930","DOIUrl":null,"url":null,"abstract":"Vegetable mass estimation from monocular RGB camera images is proposed. Vegetables are fragmented and placed on a conveyor belt of food processing machine and the monocular camera placed over the belt take pictures of vegetables on the belt. The proposed system does not employ any scale, load cell, and other mass scaling equipment. We apply pre-trained convolutional neural networks to estimate the mass of vegetables. Transfer learning including various levels of fine-tuning is also applied. For pre-trained network, we use Xception, VGG16, ResNet50, and Inception_v3, which are pre-trained using ImageNet. The result shows that the best estimation accuracy is achieved by VGG16, whose MAPE (mean average percentage error) is 11.1%. Additionally, we fine-tune VGG16 and the accuracy reduces to 7.9% for MAPE. From this result, the performance of CNN model can improve by fine-tuning. The proposed system can be applied to low-cost, high-speed, and efficient measurement of foods replaced to load cells.","PeriodicalId":6718,"journal":{"name":"2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)","volume":"61 1","pages":"2106-2112"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Vegetable Mass Estimation based on Monocular Camera using Convolutional Neural Network\",\"authors\":\"Yasuhiro Miura, Yuki Sawamura, Yuki Shinomiya, Shinichi Yoshida\",\"doi\":\"10.1109/SMC42975.2020.9282930\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vegetable mass estimation from monocular RGB camera images is proposed. Vegetables are fragmented and placed on a conveyor belt of food processing machine and the monocular camera placed over the belt take pictures of vegetables on the belt. The proposed system does not employ any scale, load cell, and other mass scaling equipment. We apply pre-trained convolutional neural networks to estimate the mass of vegetables. Transfer learning including various levels of fine-tuning is also applied. For pre-trained network, we use Xception, VGG16, ResNet50, and Inception_v3, which are pre-trained using ImageNet. The result shows that the best estimation accuracy is achieved by VGG16, whose MAPE (mean average percentage error) is 11.1%. Additionally, we fine-tune VGG16 and the accuracy reduces to 7.9% for MAPE. From this result, the performance of CNN model can improve by fine-tuning. The proposed system can be applied to low-cost, high-speed, and efficient measurement of foods replaced to load cells.\",\"PeriodicalId\":6718,\"journal\":{\"name\":\"2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)\",\"volume\":\"61 1\",\"pages\":\"2106-2112\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SMC42975.2020.9282930\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMC42975.2020.9282930","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Vegetable Mass Estimation based on Monocular Camera using Convolutional Neural Network
Vegetable mass estimation from monocular RGB camera images is proposed. Vegetables are fragmented and placed on a conveyor belt of food processing machine and the monocular camera placed over the belt take pictures of vegetables on the belt. The proposed system does not employ any scale, load cell, and other mass scaling equipment. We apply pre-trained convolutional neural networks to estimate the mass of vegetables. Transfer learning including various levels of fine-tuning is also applied. For pre-trained network, we use Xception, VGG16, ResNet50, and Inception_v3, which are pre-trained using ImageNet. The result shows that the best estimation accuracy is achieved by VGG16, whose MAPE (mean average percentage error) is 11.1%. Additionally, we fine-tune VGG16 and the accuracy reduces to 7.9% for MAPE. From this result, the performance of CNN model can improve by fine-tuning. The proposed system can be applied to low-cost, high-speed, and efficient measurement of foods replaced to load cells.