MixSegNext: A CNN-Transformer hybrid model for semantic segmentation and picking point localization algorithm of Sichuan pepper in natural environments

IF 7.7 1区农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY

Computers and Electronics in Agriculture Pub Date : 2025-05-24 DOI:10.1016/j.compag.2025.110564

Pengjun Xiang , Fei Pan , Tao Liu , Xiaoyu Zhao , Mengdie Hu , Dawei He , Boda Zhang

{"title":"MixSegNext: A CNN-Transformer hybrid model for semantic segmentation and picking point localization algorithm of Sichuan pepper in natural environments","authors":"Pengjun Xiang , Fei Pan , Tao Liu , Xiaoyu Zhao , Mengdie Hu , Dawei He , Boda Zhang","doi":"10.1016/j.compag.2025.110564","DOIUrl":null,"url":null,"abstract":"<div><div>Precise identification of Sichuan pepper picking points is a prerequisite for the robotic harvesting of the crop. Picking robots typically operate in open, dynamic natural environments, which demands robustness in the Sichuan pepper picking point localization algorithm. Generally, the growth environment of Sichuan pepper is complex, and the growth posture varies. The branches of the pepper clusters are similar to the pepper branches, which can easily lead to misjudgment and omission in the localization process, making accurate visual picking point localization challenging. To rapidly and accurately locate target Sichuan pepper picking points in natural environments, this paper proposes a Sichuan pepper segmentation model and picking point localization algorithm based on MixSegNext. The algorithm is divided into three main parts. First, the MixSegNext network performs semantic segmentation on Sichuan pepper clusters and fruits to extract the picking targets. Then, by subtracting the extracted pepper fruit mask from the pepper cluster mask, the Sichuan pepper branch mask is obtained, and the main pepper branch mask is acquired through morphological operations and maximal connectivity analysis. Finally, edge extraction is performed on the main pepper branch mask, and the picking point is determined by finding the intersection between the central line of the contour and the edge. This paper compares MixSegNext with typical semantic segmentation networks and conducts picking point localization experiments. The results show that the network has better segmentation precision and high picking point localization accuracy. Furthermore, this paper deploys the network on embedded devices to perform Sichuan pepper inference segmentation, verifying the application effect of the algorithm, which can provide a reference for the visual positioning system of Sichuan pepper-picking robots.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"237 ","pages":"Article 110564"},"PeriodicalIF":7.7000,"publicationDate":"2025-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925006702","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Precise identification of Sichuan pepper picking points is a prerequisite for the robotic harvesting of the crop. Picking robots typically operate in open, dynamic natural environments, which demands robustness in the Sichuan pepper picking point localization algorithm. Generally, the growth environment of Sichuan pepper is complex, and the growth posture varies. The branches of the pepper clusters are similar to the pepper branches, which can easily lead to misjudgment and omission in the localization process, making accurate visual picking point localization challenging. To rapidly and accurately locate target Sichuan pepper picking points in natural environments, this paper proposes a Sichuan pepper segmentation model and picking point localization algorithm based on MixSegNext. The algorithm is divided into three main parts. First, the MixSegNext network performs semantic segmentation on Sichuan pepper clusters and fruits to extract the picking targets. Then, by subtracting the extracted pepper fruit mask from the pepper cluster mask, the Sichuan pepper branch mask is obtained, and the main pepper branch mask is acquired through morphological operations and maximal connectivity analysis. Finally, edge extraction is performed on the main pepper branch mask, and the picking point is determined by finding the intersection between the central line of the contour and the edge. This paper compares MixSegNext with typical semantic segmentation networks and conducts picking point localization experiments. The results show that the network has better segmentation precision and high picking point localization accuracy. Furthermore, this paper deploys the network on embedded devices to perform Sichuan pepper inference segmentation, verifying the application effect of the algorithm, which can provide a reference for the visual positioning system of Sichuan pepper-picking robots.

查看原文本刊更多论文

MixSegNext：一种CNN-Transformer混合模型用于自然环境下花椒语义分割和采摘点定位算法

准确识别花椒采摘点是实现机器人收割的先决条件。采摘机器人通常在开放、动态的自然环境中工作，这就要求花椒采摘点定位算法具有鲁棒性。一般来说，花椒的生长环境是复杂的，生长姿势也各不相同。辣椒聚类的分支与辣椒的分支相似，容易导致定位过程中的误判和遗漏，给准确的视觉拾取点定位带来挑战。为了在自然环境中快速准确地定位目标花椒采摘点，本文提出了一种基于MixSegNext的花椒分割模型和采摘点定位算法。该算法主要分为三个部分。首先，MixSegNext网络对花椒聚类和果实进行语义分割，提取采摘目标；然后，从辣椒簇掩码中减去提取的辣椒果掩码，得到花椒枝掩码，并通过形态学运算和最大连通性分析得到主辣椒枝掩码。最后对主辣椒枝掩模进行边缘提取，通过寻找轮廓中心线与边缘的交点确定采摘点。本文将MixSegNext与典型的语义分割网络进行了比较，并进行了拾取点定位实验。结果表明，该网络具有较好的分割精度和较高的拾取点定位精度。并将该网络部署在嵌入式设备上进行花椒推理分割，验证了算法的应用效果，为花椒采摘机器人的视觉定位系统提供参考。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers and Electronics in Agriculture 工程技术-计算机：跨学科应用

CiteScore

15.30

自引率

14.50%

发文量

800

审稿时长

62 days

期刊介绍： Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.