Bit-STED:一个轻量级的转换器,用于精确的龙舌兰计数与无人机图像

IF 8.9 1区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Diego Villatoro-Geronimo, Gildardo Sanchez-Ante, Luis E. Falcon-Morales
{"title":"Bit-STED:一个轻量级的转换器,用于精确的龙舌兰计数与无人机图像","authors":"Diego Villatoro-Geronimo,&nbsp;Gildardo Sanchez-Ante,&nbsp;Luis E. Falcon-Morales","doi":"10.1016/j.compag.2025.111047","DOIUrl":null,"url":null,"abstract":"<div><div>This paper presented Bit-STED, a novel and simplified transformer encoder architecture for efficient agave plant detection and accurate counting using unmanned aerial vehicle (UAV) imagery. Addressing the critical need for accessible and cost-efficient solutions in agricultural monitoring, this approach automates a process that is typically time-consuming, labor-intensive, and prone to human error in manual practices. The Bit-STED model features a lightweight transformer design that incorporates innovative techniques for efficient feature extraction, model compression through quantization, and shape-aware object localization using circular bounding boxes for the roughly circular shape of the agave rosettes. To complement the detection model, a novel counting algorithm was developed to manage plants spanning multiple image tiles accurately. The experimental results demonstrated that the Bit-STED model outperformed the baseline models in terms of detection and agave plant count performance. Specifically, the Bit-STED nano model achieved F1 scores of 96.66% on a map with younger plants and 96.43% on a map with larger, highly overlapping plants. These scores surpassed state-of-the-art baselines, such as YOLOv8 Nano (F1 scores of 96.42% and 96.38%, respectively) and DETR (F1 scores of 93.03% and 85.61%, respectively). Furthermore, the Bit-STED nano model was significantly smaller, being less than one-eighth the size of the YOLOv8 nano model (1.4 MB compared to 12.0 MB), had fewer trainable parameters (0.35M compared to 3.01M), and was faster in average inference times (14.62 ms compared to 18.28 ms).</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"239 ","pages":"Article 111047"},"PeriodicalIF":8.9000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Bit-STED: A lightweight transformer for accurate agave counting with UAV imagery\",\"authors\":\"Diego Villatoro-Geronimo,&nbsp;Gildardo Sanchez-Ante,&nbsp;Luis E. Falcon-Morales\",\"doi\":\"10.1016/j.compag.2025.111047\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper presented Bit-STED, a novel and simplified transformer encoder architecture for efficient agave plant detection and accurate counting using unmanned aerial vehicle (UAV) imagery. Addressing the critical need for accessible and cost-efficient solutions in agricultural monitoring, this approach automates a process that is typically time-consuming, labor-intensive, and prone to human error in manual practices. The Bit-STED model features a lightweight transformer design that incorporates innovative techniques for efficient feature extraction, model compression through quantization, and shape-aware object localization using circular bounding boxes for the roughly circular shape of the agave rosettes. To complement the detection model, a novel counting algorithm was developed to manage plants spanning multiple image tiles accurately. The experimental results demonstrated that the Bit-STED model outperformed the baseline models in terms of detection and agave plant count performance. Specifically, the Bit-STED nano model achieved F1 scores of 96.66% on a map with younger plants and 96.43% on a map with larger, highly overlapping plants. These scores surpassed state-of-the-art baselines, such as YOLOv8 Nano (F1 scores of 96.42% and 96.38%, respectively) and DETR (F1 scores of 93.03% and 85.61%, respectively). Furthermore, the Bit-STED nano model was significantly smaller, being less than one-eighth the size of the YOLOv8 nano model (1.4 MB compared to 12.0 MB), had fewer trainable parameters (0.35M compared to 3.01M), and was faster in average inference times (14.62 ms compared to 18.28 ms).</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"239 \",\"pages\":\"Article 111047\"},\"PeriodicalIF\":8.9000,\"publicationDate\":\"2025-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169925011536\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925011536","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

本文提出了Bit-STED,一种新型的简化变压器编码器结构,用于利用无人机(UAV)图像高效检测龙舌兰植物并进行精确计数。该方法解决了对农业监测中可访问且具有成本效益的解决方案的关键需求,使人工操作中通常耗时、劳动密集型且容易出现人为错误的过程自动化。Bit-STED模型采用了轻量级的变压器设计,结合了高效特征提取的创新技术,通过量化进行模型压缩,并使用圆形边界盒对龙舌兰花环的大致圆形形状进行形状感知对象定位。为了补充检测模型,开发了一种新的计数算法来准确地管理跨越多个图像块的植物。实验结果表明,Bit-STED模型在检测和龙舌兰植物计数性能方面优于基线模型。具体而言,Bit-STED纳米模型在具有较年轻植物的地图上获得了96.66%的F1分数,在具有较大,高度重叠的植物的地图上获得了96.43%的F1分数。这些分数超过了最先进的基线,如YOLOv8 Nano (F1分数分别为96.42%和96.38%)和DETR (F1分数分别为93.03%和85.61%)。此外,Bit-STED纳米模型明显更小,小于YOLOv8纳米模型的八分之一(1.4 MB比12.0 MB),可训练参数更少(0.35M比3.01M),平均推理时间更快(14.62 ms比18.28 ms)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Bit-STED: A lightweight transformer for accurate agave counting with UAV imagery
This paper presented Bit-STED, a novel and simplified transformer encoder architecture for efficient agave plant detection and accurate counting using unmanned aerial vehicle (UAV) imagery. Addressing the critical need for accessible and cost-efficient solutions in agricultural monitoring, this approach automates a process that is typically time-consuming, labor-intensive, and prone to human error in manual practices. The Bit-STED model features a lightweight transformer design that incorporates innovative techniques for efficient feature extraction, model compression through quantization, and shape-aware object localization using circular bounding boxes for the roughly circular shape of the agave rosettes. To complement the detection model, a novel counting algorithm was developed to manage plants spanning multiple image tiles accurately. The experimental results demonstrated that the Bit-STED model outperformed the baseline models in terms of detection and agave plant count performance. Specifically, the Bit-STED nano model achieved F1 scores of 96.66% on a map with younger plants and 96.43% on a map with larger, highly overlapping plants. These scores surpassed state-of-the-art baselines, such as YOLOv8 Nano (F1 scores of 96.42% and 96.38%, respectively) and DETR (F1 scores of 93.03% and 85.61%, respectively). Furthermore, the Bit-STED nano model was significantly smaller, being less than one-eighth the size of the YOLOv8 nano model (1.4 MB compared to 12.0 MB), had fewer trainable parameters (0.35M compared to 3.01M), and was faster in average inference times (14.62 ms compared to 18.28 ms).
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers and Electronics in Agriculture
Computers and Electronics in Agriculture 工程技术-计算机:跨学科应用
CiteScore
15.30
自引率
14.50%
发文量
800
审稿时长
62 days
期刊介绍: Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信