{"title":"使用视觉变换器进行空间关联的农业应用中的多目标跟踪","authors":"","doi":"10.1016/j.compag.2024.109379","DOIUrl":null,"url":null,"abstract":"<div><p>This paper introduces a Multi-Object Tracking (MOT) framework for agricultural applications that estimates global positions in pixel coordinates using the local feature matching transformer — LoFTR. We design an efficient tracker that augments the capabilities of a state-of-the-art tracking algorithm by incorporating a novel association strategy based on spatial information of targets leaving and returning the camera field of view. We evaluate our framework using the publicly available LettuceMOT benchmark dataset and an adapted version of the AppleMOTS benchmark dataset that we denominate AppleMOT. Our experimental results demonstrate that our method outperforms cutting-edge algorithms for robotic plant tracking in the LettuceMOT dataset. The evaluation metrics show average improvements of up to 25% compared to the best publicly available results, demonstrating the benefits of our spatial association approach. For the AppleMOT dataset, we obtained bounding-box-based MOT evaluation metrics comparable to the segmentation-based (MOTS) counterparts presented in the original AppleMOTS paper. These findings highlight the effectiveness and potential of our approach in addressing the unique challenges posed by agricultural environments.</p></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":null,"pages":null},"PeriodicalIF":7.7000,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-Object Tracking in Agricultural Applications using a Vision Transformer for Spatial Association\",\"authors\":\"\",\"doi\":\"10.1016/j.compag.2024.109379\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This paper introduces a Multi-Object Tracking (MOT) framework for agricultural applications that estimates global positions in pixel coordinates using the local feature matching transformer — LoFTR. We design an efficient tracker that augments the capabilities of a state-of-the-art tracking algorithm by incorporating a novel association strategy based on spatial information of targets leaving and returning the camera field of view. We evaluate our framework using the publicly available LettuceMOT benchmark dataset and an adapted version of the AppleMOTS benchmark dataset that we denominate AppleMOT. Our experimental results demonstrate that our method outperforms cutting-edge algorithms for robotic plant tracking in the LettuceMOT dataset. The evaluation metrics show average improvements of up to 25% compared to the best publicly available results, demonstrating the benefits of our spatial association approach. For the AppleMOT dataset, we obtained bounding-box-based MOT evaluation metrics comparable to the segmentation-based (MOTS) counterparts presented in the original AppleMOTS paper. These findings highlight the effectiveness and potential of our approach in addressing the unique challenges posed by agricultural environments.</p></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2024-09-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169924007701\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169924007701","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
本文介绍了一种用于农业应用的多目标跟踪(MOT)框架,该框架利用局部特征匹配变换器(LoFTR)以像素坐标估算全局位置。我们设计了一种高效的跟踪器,通过结合基于目标离开和返回相机视场的空间信息的新型关联策略,增强了最先进跟踪算法的功能。我们使用公开的 LettuceMOT 基准数据集和 AppleMOTS 基准数据集的改编版(我们称之为 AppleMOT)对我们的框架进行了评估。实验结果表明,在 LettuceMOT 数据集中,我们的方法优于用于机器人植物跟踪的先进算法。评估指标显示,与公开的最佳结果相比,我们的方法平均提高了 25%,这证明了我们的空间关联方法的优势。对于 AppleMOT 数据集,我们获得的基于边界框的 MOT 评估指标与最初 AppleMOTS 论文中提出的基于分割的 MOTS 评估指标相当。这些发现凸显了我们的方法在应对农业环境带来的独特挑战方面的有效性和潜力。
Multi-Object Tracking in Agricultural Applications using a Vision Transformer for Spatial Association
This paper introduces a Multi-Object Tracking (MOT) framework for agricultural applications that estimates global positions in pixel coordinates using the local feature matching transformer — LoFTR. We design an efficient tracker that augments the capabilities of a state-of-the-art tracking algorithm by incorporating a novel association strategy based on spatial information of targets leaving and returning the camera field of view. We evaluate our framework using the publicly available LettuceMOT benchmark dataset and an adapted version of the AppleMOTS benchmark dataset that we denominate AppleMOT. Our experimental results demonstrate that our method outperforms cutting-edge algorithms for robotic plant tracking in the LettuceMOT dataset. The evaluation metrics show average improvements of up to 25% compared to the best publicly available results, demonstrating the benefits of our spatial association approach. For the AppleMOT dataset, we obtained bounding-box-based MOT evaluation metrics comparable to the segmentation-based (MOTS) counterparts presented in the original AppleMOTS paper. These findings highlight the effectiveness and potential of our approach in addressing the unique challenges posed by agricultural environments.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.