Cassava stalk detection for a cassava harvesting robot based on YOLO v4 and Mask R-CNN

IF 2.4 4区 农林科学 Q2 AGRICULTURAL ENGINEERING
T. Singhpoo, K. Saengprachatanarug, S. Wongpichet, J. Posom, Kanda Runapongsa Saikaew
{"title":"Cassava stalk detection for a cassava harvesting robot based on YOLO v4 and Mask R-CNN","authors":"T. Singhpoo, K. Saengprachatanarug, S. Wongpichet, J. Posom, Kanda Runapongsa Saikaew","doi":"10.4081/jae.2023.1301","DOIUrl":null,"url":null,"abstract":"The quality of fresh cassava roots can be increased through the use of precision equipment. As a first step towards developing an automatic cassava root cutting system, this study demonstrates the use of a computer vision system with deep learning for cassava stalk detection. An RGB image of a cassava tree mounted on a cassava-pulling machine was captured, and the YOLO v4 model and two Mask R-CNN models with ResNet 101 and ResNet 50 base architectures were employed to train the weights to predict the position of the cassava stalk. One hundred test images of stalks of various shapes and sizes were used to determine the grasping point and inclination, and the results from manual annotation were compared with the predicted results. Regarding localisation, Mask R-CNN with ResNet 101 gave a significantly higher performance than the other models, with an F1 score and a mean IoU of 0.81 and 0.70, respectively. YOLO v4 showed the highest correlation for the x- and y-coordinates for the prediction of the grasping point, with values for R2 of 0.89 and 0.53, respectively. For inclination prediction, Mask R-CNN with ResNet 101 and Mask R-CNN with ResNet 50 gave the same level of correlation, with values for R2 of 0.50 and 0.61, respectively. These results were acceptable for use as design criteria for developing a cassava rootcutting robot.","PeriodicalId":48507,"journal":{"name":"Journal of Agricultural Engineering","volume":"36 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Agricultural Engineering","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.4081/jae.2023.1301","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

Abstract

The quality of fresh cassava roots can be increased through the use of precision equipment. As a first step towards developing an automatic cassava root cutting system, this study demonstrates the use of a computer vision system with deep learning for cassava stalk detection. An RGB image of a cassava tree mounted on a cassava-pulling machine was captured, and the YOLO v4 model and two Mask R-CNN models with ResNet 101 and ResNet 50 base architectures were employed to train the weights to predict the position of the cassava stalk. One hundred test images of stalks of various shapes and sizes were used to determine the grasping point and inclination, and the results from manual annotation were compared with the predicted results. Regarding localisation, Mask R-CNN with ResNet 101 gave a significantly higher performance than the other models, with an F1 score and a mean IoU of 0.81 and 0.70, respectively. YOLO v4 showed the highest correlation for the x- and y-coordinates for the prediction of the grasping point, with values for R2 of 0.89 and 0.53, respectively. For inclination prediction, Mask R-CNN with ResNet 101 and Mask R-CNN with ResNet 50 gave the same level of correlation, with values for R2 of 0.50 and 0.61, respectively. These results were acceptable for use as design criteria for developing a cassava rootcutting robot.
基于YOLO v4和Mask R-CNN的木薯收割机器人木薯秸秆检测
通过精密设备的使用,可以提高新鲜木薯根的质量。作为开发木薯自动切根系统的第一步,本研究展示了使用具有深度学习的计算机视觉系统进行木薯茎检测。采集了一棵挂在木薯采摘机上的木薯树的RGB图像,采用YOLO v4模型和两个基于ResNet 101和ResNet 50基础架构的Mask R-CNN模型训练权值来预测木薯茎秆的位置。利用100张不同形状和大小的秸秆试验图像确定抓取点和抓取倾角,并将人工标注结果与预测结果进行比较。在本地化方面,基于ResNet 101的Mask R-CNN的表现明显高于其他模型,其F1得分和平均IoU分别为0.81和0.70。YOLO v4与抓取点预测的x坐标和y坐标相关性最高,R2分别为0.89和0.53。对于倾角预测,使用ResNet 101的Mask R-CNN和使用ResNet 50的Mask R-CNN具有相同的相关性,R2分别为0.50和0.61。这些结果可以作为开发木薯切根机器人的设计标准。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Agricultural Engineering
Journal of Agricultural Engineering AGRICULTURAL ENGINEERING-
CiteScore
2.30
自引率
5.60%
发文量
40
审稿时长
10 weeks
期刊介绍: The Journal of Agricultural Engineering (JAE) is the official journal of the Italian Society of Agricultural Engineering supported by University of Bologna, Italy. The subject matter covers a complete and interdisciplinary range of research in engineering for agriculture and biosystems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信