通过弱语言监督进行零次昆虫探测

Ben Feuer, Ameya Joshi, Minsu Cho, Kewal Jani, Shivani Chiranjeevi, Ziwei Deng, Aditya Balu, Ashutosh Kumar Singh, S. Sarkar, Nirav C. Merchant, Arti Singh, B. Ganapathysubramanian, C. Hegde
{"title":"通过弱语言监督进行零次昆虫探测","authors":"Ben Feuer, Ameya Joshi, Minsu Cho, Kewal Jani, Shivani Chiranjeevi, Ziwei Deng, Aditya Balu, Ashutosh Kumar Singh, S. Sarkar, Nirav C. Merchant, Arti Singh, B. Ganapathysubramanian, C. Hegde","doi":"10.1002/ppj2.20107","DOIUrl":null,"url":null,"abstract":"Cheap and ubiquitous sensing has made collecting large agricultural datasets relatively straightforward. These large datasets (for instance, citizen science data curation platforms like iNaturalist) can pave the way for developing powerful artificial intelligence (AI) models for detection and counting. However, traditional supervised learning methods require labeled data, and manual annotation of these raw datasets with useful labels (such as bounding boxes or segmentation masks) can be extremely laborious, expensive, and error‐prone. In this paper, we demonstrate the power of zero‐shot computer vision methods—a new family of approaches that require (almost) no manual supervision—for plant phenomics applications. Focusing on insect detection as the primary use case, we show that our models enable highly accurate detection of insects in a variety of challenging imaging environments. Our technical contributions are two‐fold: (a) We curate the Insecta rank class of iNaturalist to form a new benchmark dataset of approximately 6 million images consisting of 2526 agriculturally and ecologically important species, including pests and beneficial insects. (b) Using a vision‐language object detection method coupled with weak language supervision, we are able to automatically annotate images in this dataset with bounding box information localizing the insect within each image. Our method succeeds in detecting diverse insect species present in a wide variety of backgrounds, producing high‐quality bounding boxes in a zero‐shot manner with no additional training cost. This open dataset can serve as a use‐inspired benchmark for the AI community. We demonstrate that our method can also be used for other applications in plant phenomics, such as fruit detection in images of strawberry and apple trees. Overall, our framework highlights the promise of zero‐shot approaches to make high‐throughput plant phenotyping more affordable.","PeriodicalId":504448,"journal":{"name":"The Plant Phenome Journal","volume":"140 34","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Zero‐shot insect detection via weak language supervision\",\"authors\":\"Ben Feuer, Ameya Joshi, Minsu Cho, Kewal Jani, Shivani Chiranjeevi, Ziwei Deng, Aditya Balu, Ashutosh Kumar Singh, S. Sarkar, Nirav C. Merchant, Arti Singh, B. Ganapathysubramanian, C. Hegde\",\"doi\":\"10.1002/ppj2.20107\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Cheap and ubiquitous sensing has made collecting large agricultural datasets relatively straightforward. These large datasets (for instance, citizen science data curation platforms like iNaturalist) can pave the way for developing powerful artificial intelligence (AI) models for detection and counting. However, traditional supervised learning methods require labeled data, and manual annotation of these raw datasets with useful labels (such as bounding boxes or segmentation masks) can be extremely laborious, expensive, and error‐prone. In this paper, we demonstrate the power of zero‐shot computer vision methods—a new family of approaches that require (almost) no manual supervision—for plant phenomics applications. Focusing on insect detection as the primary use case, we show that our models enable highly accurate detection of insects in a variety of challenging imaging environments. Our technical contributions are two‐fold: (a) We curate the Insecta rank class of iNaturalist to form a new benchmark dataset of approximately 6 million images consisting of 2526 agriculturally and ecologically important species, including pests and beneficial insects. (b) Using a vision‐language object detection method coupled with weak language supervision, we are able to automatically annotate images in this dataset with bounding box information localizing the insect within each image. Our method succeeds in detecting diverse insect species present in a wide variety of backgrounds, producing high‐quality bounding boxes in a zero‐shot manner with no additional training cost. This open dataset can serve as a use‐inspired benchmark for the AI community. We demonstrate that our method can also be used for other applications in plant phenomics, such as fruit detection in images of strawberry and apple trees. Overall, our framework highlights the promise of zero‐shot approaches to make high‐throughput plant phenotyping more affordable.\",\"PeriodicalId\":504448,\"journal\":{\"name\":\"The Plant Phenome Journal\",\"volume\":\"140 34\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Plant Phenome Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/ppj2.20107\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Plant Phenome Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/ppj2.20107","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

廉价且无处不在的传感技术使得收集大型农业数据集变得相对简单。这些大型数据集(例如 iNaturalist 等公民科学数据整理平台)可以为开发强大的人工智能(AI)检测和计数模型铺平道路。然而,传统的监督学习方法需要标注数据,而为这些原始数据集手动标注有用的标签(如边界框或分割掩码)可能极其费力、昂贵且容易出错。在本文中,我们展示了零镜头计算机视觉方法--无需(几乎)人工监督的新方法系列--在植物表型组学应用中的威力。我们以昆虫检测为主要应用案例,展示了我们的模型能够在各种具有挑战性的成像环境中高度准确地检测昆虫。我们的技术贡献有两方面:(a) 我们对 iNaturalist 的昆虫等级类别进行了整理,形成了一个新的基准数据集,该数据集包含约 600 万张图像,其中有 2526 种具有重要农业和生态意义的物种,包括害虫和益虫。(b) 利用视觉语言对象检测方法和弱语言监督,我们能够为该数据集中的图像自动注释边界框信息,定位每张图像中的昆虫。我们的方法能成功地检测出各种背景下的不同昆虫种类,并以零拍摄的方式生成高质量的边界框,无需额外的训练成本。这一开放式数据集可作为人工智能界的使用启发基准。我们证明,我们的方法还可用于植物表型组学的其他应用,如草莓和苹果树图像中的果实检测。总之,我们的框架凸显了零镜头方法的前景,使高通量植物表型分析更加经济实惠。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Zero‐shot insect detection via weak language supervision
Cheap and ubiquitous sensing has made collecting large agricultural datasets relatively straightforward. These large datasets (for instance, citizen science data curation platforms like iNaturalist) can pave the way for developing powerful artificial intelligence (AI) models for detection and counting. However, traditional supervised learning methods require labeled data, and manual annotation of these raw datasets with useful labels (such as bounding boxes or segmentation masks) can be extremely laborious, expensive, and error‐prone. In this paper, we demonstrate the power of zero‐shot computer vision methods—a new family of approaches that require (almost) no manual supervision—for plant phenomics applications. Focusing on insect detection as the primary use case, we show that our models enable highly accurate detection of insects in a variety of challenging imaging environments. Our technical contributions are two‐fold: (a) We curate the Insecta rank class of iNaturalist to form a new benchmark dataset of approximately 6 million images consisting of 2526 agriculturally and ecologically important species, including pests and beneficial insects. (b) Using a vision‐language object detection method coupled with weak language supervision, we are able to automatically annotate images in this dataset with bounding box information localizing the insect within each image. Our method succeeds in detecting diverse insect species present in a wide variety of backgrounds, producing high‐quality bounding boxes in a zero‐shot manner with no additional training cost. This open dataset can serve as a use‐inspired benchmark for the AI community. We demonstrate that our method can also be used for other applications in plant phenomics, such as fruit detection in images of strawberry and apple trees. Overall, our framework highlights the promise of zero‐shot approaches to make high‐throughput plant phenotyping more affordable.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信