Thais Vasconcelos,William N Weaver,Aly Baumgartner,Zoë Bugnaski,James Boyko
{"title":"从数字化植物标本中自动提取每面积叶质量。","authors":"Thais Vasconcelos,William N Weaver,Aly Baumgartner,Zoë Bugnaski,James Boyko","doi":"10.1111/nph.70292","DOIUrl":null,"url":null,"abstract":"The digitization of vast herbarium collections has made millions of plant specimen images freely available online, which can now be used to generate phenotypic datasets of unprecedented scope. Here, we assess the potential of computer vision tools to automate the extraction of predicted leaf mass per area (LMApred) from digitized herbarium specimens. We use an automated pipeline to extract leaf area and petiole width from 22 680 leaves, representing a phylogenetic informed sample of 1580 species of woody angiosperms. LMApred is estimated using a proxy equation that models the scaling relationship between petiole width and leaf mass. We assess potential sources of error in LMApred estimates and evaluate whether documented LMA-climate patterns are recovered using this dataset and phylogenetic comparative methods. Our LMApred dataset responds mainly to temperature and solar radiation and presents a positive correlation with latitude. The proxy equation, not the automated pipeline, is responsible for most of the error in LMApred estimates. Our pipeline underscores the power of combining herbarium digitization with new techniques for automated trait scoring. The increased size of datasets generated using this tool allows investigation of potential LMA-climate relationships with a geographically balanced sample while also utilizing comprehensive phylogenetic information.","PeriodicalId":214,"journal":{"name":"New Phytologist","volume":"3 1","pages":""},"PeriodicalIF":8.3000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated extraction of leaf mass per area from digitized herbarium specimens.\",\"authors\":\"Thais Vasconcelos,William N Weaver,Aly Baumgartner,Zoë Bugnaski,James Boyko\",\"doi\":\"10.1111/nph.70292\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The digitization of vast herbarium collections has made millions of plant specimen images freely available online, which can now be used to generate phenotypic datasets of unprecedented scope. Here, we assess the potential of computer vision tools to automate the extraction of predicted leaf mass per area (LMApred) from digitized herbarium specimens. We use an automated pipeline to extract leaf area and petiole width from 22 680 leaves, representing a phylogenetic informed sample of 1580 species of woody angiosperms. LMApred is estimated using a proxy equation that models the scaling relationship between petiole width and leaf mass. We assess potential sources of error in LMApred estimates and evaluate whether documented LMA-climate patterns are recovered using this dataset and phylogenetic comparative methods. Our LMApred dataset responds mainly to temperature and solar radiation and presents a positive correlation with latitude. The proxy equation, not the automated pipeline, is responsible for most of the error in LMApred estimates. Our pipeline underscores the power of combining herbarium digitization with new techniques for automated trait scoring. The increased size of datasets generated using this tool allows investigation of potential LMA-climate relationships with a geographically balanced sample while also utilizing comprehensive phylogenetic information.\",\"PeriodicalId\":214,\"journal\":{\"name\":\"New Phytologist\",\"volume\":\"3 1\",\"pages\":\"\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"New Phytologist\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1111/nph.70292\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PLANT SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"New Phytologist","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1111/nph.70292","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
Automated extraction of leaf mass per area from digitized herbarium specimens.
The digitization of vast herbarium collections has made millions of plant specimen images freely available online, which can now be used to generate phenotypic datasets of unprecedented scope. Here, we assess the potential of computer vision tools to automate the extraction of predicted leaf mass per area (LMApred) from digitized herbarium specimens. We use an automated pipeline to extract leaf area and petiole width from 22 680 leaves, representing a phylogenetic informed sample of 1580 species of woody angiosperms. LMApred is estimated using a proxy equation that models the scaling relationship between petiole width and leaf mass. We assess potential sources of error in LMApred estimates and evaluate whether documented LMA-climate patterns are recovered using this dataset and phylogenetic comparative methods. Our LMApred dataset responds mainly to temperature and solar radiation and presents a positive correlation with latitude. The proxy equation, not the automated pipeline, is responsible for most of the error in LMApred estimates. Our pipeline underscores the power of combining herbarium digitization with new techniques for automated trait scoring. The increased size of datasets generated using this tool allows investigation of potential LMA-climate relationships with a geographically balanced sample while also utilizing comprehensive phylogenetic information.
期刊介绍:
New Phytologist is an international electronic journal published 24 times a year. It is owned by the New Phytologist Foundation, a non-profit-making charitable organization dedicated to promoting plant science. The journal publishes excellent, novel, rigorous, and timely research and scholarship in plant science and its applications. The articles cover topics in five sections: Physiology & Development, Environment, Interaction, Evolution, and Transformative Plant Biotechnology. These sections encompass intracellular processes, global environmental change, and encourage cross-disciplinary approaches. The journal recognizes the use of techniques from molecular and cell biology, functional genomics, modeling, and system-based approaches in plant science. Abstracting and Indexing Information for New Phytologist includes Academic Search, AgBiotech News & Information, Agroforestry Abstracts, Biochemistry & Biophysics Citation Index, Botanical Pesticides, CAB Abstracts®, Environment Index, Global Health, and Plant Breeding Abstracts, and others.