SALMA: A machine learning tool for precise leaf morphology measurements

IF 7.3 2区 环境科学与生态学 Q1 ECOLOGY
Ecological Informatics Pub Date : 2026-02-01 Epub Date: 2026-01-02 DOI:10.1016/j.ecoinf.2025.103592
Ilya Shabanov , Julie R Deslippe , Andrew Lensen
{"title":"SALMA: A machine learning tool for precise leaf morphology measurements","authors":"Ilya Shabanov ,&nbsp;Julie R Deslippe ,&nbsp;Andrew Lensen","doi":"10.1016/j.ecoinf.2025.103592","DOIUrl":null,"url":null,"abstract":"<div><div>Leaf area is a critical plant functional trait, widely used for understanding plant responses to climate change, ecosystem productivity, and species' adaptive strategies. Inaccurate leaf area measurements compromise the accuracy of other traits normalised by area, such as foliar chemical traits, respiration, and photosynthesis. However, existing measurement methods are ineffective for small-leaved plants and often necessitate manual processing, which limits sample sizes and potentially obscures subtle trait-environment relationships. We developed SALMA (Semi-Automated Leaf Morphological Analysis), which employs logistic regression trained on one to four human-generated examples per species to delineate leaf boundaries for that species accurately. SALMA's training step adapts to species-specific features by integrating multiple characteristics, such as colour variations and edge details. The approach is validated on an extensive dataset (64 species, 3332 images) that covers 91.4 % of the worldwide leaf area variation, as well as two smaller datasets comprising low-quality photographs of morphologically complex or damaged leaves. SALMA consistently achieved leaf area errors 2 to 15 times lower than existing algorithms and a theoretical upper bound of any grayscale intensity-based method. Critically, we identify a previously overlooked power-law relationship between leaf area and measurement error, demonstrating that existing methods may overestimate leaf area by at least 5 % for 43 % of global species, whereas SALMA achieves comparable errors for only 2.1 % of species. SALMA is a standalone software with an intuitive interface that supports parallel processing, making it accessible for large-scale ecological studies globally. It can potentially enhance the quality of trait datasets and facilitate large-scale sampling, thereby advancing our understanding of plant-environment interactions. Our published dataset contains manually created binary segmentations of leaves and background, providing a baseline for future leaf measurement algorithms.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"93 ","pages":"Article 103592"},"PeriodicalIF":7.3000,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574954125006016","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/1/2 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Leaf area is a critical plant functional trait, widely used for understanding plant responses to climate change, ecosystem productivity, and species' adaptive strategies. Inaccurate leaf area measurements compromise the accuracy of other traits normalised by area, such as foliar chemical traits, respiration, and photosynthesis. However, existing measurement methods are ineffective for small-leaved plants and often necessitate manual processing, which limits sample sizes and potentially obscures subtle trait-environment relationships. We developed SALMA (Semi-Automated Leaf Morphological Analysis), which employs logistic regression trained on one to four human-generated examples per species to delineate leaf boundaries for that species accurately. SALMA's training step adapts to species-specific features by integrating multiple characteristics, such as colour variations and edge details. The approach is validated on an extensive dataset (64 species, 3332 images) that covers 91.4 % of the worldwide leaf area variation, as well as two smaller datasets comprising low-quality photographs of morphologically complex or damaged leaves. SALMA consistently achieved leaf area errors 2 to 15 times lower than existing algorithms and a theoretical upper bound of any grayscale intensity-based method. Critically, we identify a previously overlooked power-law relationship between leaf area and measurement error, demonstrating that existing methods may overestimate leaf area by at least 5 % for 43 % of global species, whereas SALMA achieves comparable errors for only 2.1 % of species. SALMA is a standalone software with an intuitive interface that supports parallel processing, making it accessible for large-scale ecological studies globally. It can potentially enhance the quality of trait datasets and facilitate large-scale sampling, thereby advancing our understanding of plant-environment interactions. Our published dataset contains manually created binary segmentations of leaves and background, providing a baseline for future leaf measurement algorithms.
SALMA:用于精确测量叶片形态的机器学习工具
叶面积是一项重要的植物功能性状,广泛用于了解植物对气候变化的响应、生态系统生产力和物种适应策略。不准确的叶面积测量会损害其他性状的准确性,如叶化学性状、呼吸作用和光合作用。然而,现有的测量方法对小叶植物是无效的,通常需要人工处理,这限制了样本量,并可能模糊微妙的性状-环境关系。我们开发了SALMA(半自动叶片形态分析),它使用逻辑回归训练每个物种的一到四个人类生成的例子来准确地描绘该物种的叶片边界。SALMA的训练步骤通过整合多种特征(如颜色变化和边缘细节)来适应物种特定的特征。该方法在一个广泛的数据集(64种,3332张图像)上进行了验证,该数据集覆盖了全球91.4%的叶面积变化,以及两个较小的数据集,其中包括形态复杂或受损叶片的低质量照片。SALMA始终实现叶面积误差比现有算法低2到15倍,并且是任何基于灰度强度的方法的理论上限。重要的是,我们确定了以前被忽视的叶面积与测量误差之间的幂律关系,表明现有方法可能对全球43%的物种高估了至少5%的叶面积,而SALMA仅对2.1%的物种实现了类似的误差。SALMA是一个独立的软件,具有直观的界面,支持并行处理,使其可用于全球大规模的生态研究。它可以潜在地提高性状数据集的质量,促进大规模采样,从而提高我们对植物-环境相互作用的理解。我们发布的数据集包含手动创建的叶片和背景的二值分割,为未来的叶片测量算法提供基线。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ecological Informatics
Ecological Informatics 环境科学-生态学
CiteScore
8.30
自引率
11.80%
发文量
346
审稿时长
46 days
期刊介绍: The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change. The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书