Adaptive spatial aggregation and viewpoint alignment for three-dimensional online multiple fish tracking

IF 7.7 1区农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY

Computers and Electronics in Agriculture Pub Date : 2025-04-19 DOI:10.1016/j.compag.2025.110408

Yiran Liu , Beibei Li , Dingshuo Liu , Qingling Duan

{"title":"Adaptive spatial aggregation and viewpoint alignment for three-dimensional online multiple fish tracking","authors":"Yiran Liu , Beibei Li , Dingshuo Liu , Qingling Duan","doi":"10.1016/j.compag.2025.110408","DOIUrl":null,"url":null,"abstract":"<div><div>Three-dimensional (3D) multi-object tracking can simultaneously capture the movement trajectories of multiple fish, which is essential for understanding and analysing their movements and behavioural patterns in 3D space. It also provides essential data for applications such as water-quality monitoring, disease diagnosis, and ecological assessment. However, the multi-object tracking of fish in 3D space requires data associations across different perspectives. Variations in scale and appearance across perspectives can lead to inaccurate object positioning and low identification rates. In response to these challenges, in this study, an online 3D multi-object tracking method for fish is proposed based on adaptive spatial aggregation and viewpoint alignment. Dynamic deformable convolution networks (DCNv3) and upsampling techniques are employed to adaptively fuse the fixed-scale features generated by the backbone network, addressing the difficulties in object positioning caused by scale differences. The trajectories of the fish from both the top and side views are then obtained using a cascade tracker. Finally, a viewpoint-alignment approach is proposed to reconstruct the trajectories in 3D space using the two-dimensional (2D) trajectories, thereby avoiding the identity recognition issues caused by drastic changes in appearance. In verifying the effectiveness of the proposed algorithm on the 3D-ZeF20 zebrafish dataset, multi-object tracking accuracy (MOTA) reached 95.03 %; identification F1-score (IDF1) was 97.40 %; and monotonic mean time between failures (MTBFm) was 172 frames. The results demonstrate that this method addresses the difficulties in cross-view matching caused by changes in appearance and scale differences. It enables the simultaneous acquisition of fish multi-object trajectories from front view, top view, and in 3D space, thereby achieving precise online tracking of multiple fish.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"236 ","pages":"Article 110408"},"PeriodicalIF":7.7000,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925005149","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Three-dimensional (3D) multi-object tracking can simultaneously capture the movement trajectories of multiple fish, which is essential for understanding and analysing their movements and behavioural patterns in 3D space. It also provides essential data for applications such as water-quality monitoring, disease diagnosis, and ecological assessment. However, the multi-object tracking of fish in 3D space requires data associations across different perspectives. Variations in scale and appearance across perspectives can lead to inaccurate object positioning and low identification rates. In response to these challenges, in this study, an online 3D multi-object tracking method for fish is proposed based on adaptive spatial aggregation and viewpoint alignment. Dynamic deformable convolution networks (DCNv3) and upsampling techniques are employed to adaptively fuse the fixed-scale features generated by the backbone network, addressing the difficulties in object positioning caused by scale differences. The trajectories of the fish from both the top and side views are then obtained using a cascade tracker. Finally, a viewpoint-alignment approach is proposed to reconstruct the trajectories in 3D space using the two-dimensional (2D) trajectories, thereby avoiding the identity recognition issues caused by drastic changes in appearance. In verifying the effectiveness of the proposed algorithm on the 3D-ZeF20 zebrafish dataset, multi-object tracking accuracy (MOTA) reached 95.03 %; identification F1-score (IDF1) was 97.40 %; and monotonic mean time between failures (MTBFm) was 172 frames. The results demonstrate that this method addresses the difficulties in cross-view matching caused by changes in appearance and scale differences. It enables the simultaneous acquisition of fish multi-object trajectories from front view, top view, and in 3D space, thereby achieving precise online tracking of multiple fish.

查看原文本刊更多论文

三维在线多鱼跟踪的自适应空间聚合和视点对齐

三维（3D）多目标跟踪可以同时捕获多条鱼的运动轨迹，这对于理解和分析它们在3D空间中的运动和行为模式至关重要。它还为水质监测、疾病诊断和生态评价等应用提供了必要的数据。然而，在三维空间中对鱼类进行多目标跟踪需要跨不同视角的数据关联。不同视角的尺度和外观变化可能导致物体定位不准确和识别率低。针对这些挑战，本研究提出了一种基于自适应空间聚合和视点对齐的鱼类在线三维多目标跟踪方法。采用动态变形卷积网络（DCNv3）和上采样技术自适应融合主干网产生的固定尺度特征，解决了尺度差异带来的目标定位困难。然后使用级联跟踪器从顶部和侧面视图获得鱼的轨迹。最后，提出了一种视点对齐方法，利用二维轨迹在三维空间中重建轨迹，从而避免了由于外观剧烈变化而导致的身份识别问题。在3D-ZeF20斑马鱼数据集上验证算法的有效性，多目标跟踪精度（MOTA）达到95.03%；鉴定f1评分（IDF1）为97.40%；单调平均故障间隔时间（MTBFm）为172帧。结果表明，该方法解决了因外观变化和尺度差异引起的交叉视匹配困难。它可以同时从前视图、俯视图和3D空间中获取鱼的多目标轨迹，从而实现对多条鱼的精确在线跟踪。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers and Electronics in Agriculture 工程技术-计算机：跨学科应用

CiteScore

15.30

自引率

14.50%

发文量

800

审稿时长

62 days

期刊介绍： Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.