Chengming Ou, Zhicheng Jia, Shiqiang Zhao, Shoujiang Sun, Ming Sun, Jingyu Liu, Manli Li, Shangang Jia, Peisheng Mao
{"title":"A novel approach integrating multispectral imaging and machine learning to identify seed maturity and vigor in smooth bromegrass.","authors":"Chengming Ou, Zhicheng Jia, Shiqiang Zhao, Shoujiang Sun, Ming Sun, Jingyu Liu, Manli Li, Shangang Jia, Peisheng Mao","doi":"10.1186/s13007-025-01359-8","DOIUrl":null,"url":null,"abstract":"<p><p>Smooth bromegrass (Bromus inermis) was adopted as experiment materials for identifying the seed maturity using a combination of multispectral imaging and machine learning. The trials were conducted to investigate the effects of three nitrogen application levels (0, 100 and 200 kg N ha<sup>- 1</sup>, defined as CK, N1 and N2 respectively) and two spikelet grain positions: superior grain (SG) at the basal position and inferior grain (IG) at the upper position, on smooth bromegrass seeds. The germination characteristics of the seeds revealed that the variations in nitrogen application and grain positions significantly influenced seeds vigor. The seed vigor of increased gradually with their maturity, reaching a high level at 30 and 36 days after anthesis. A stacking ensemble learning approach was employed to identify the seed maturity based on multispectral imaging and autofluorescence imaging. The results demonstrated that the Ensemble model outperformed Support Vector Machine, Bayesian, XGBoost and Random Forest across all evaluated metrics in different scenarios. The model accuracy in CK, N1 and N2 were 89%, 87% and 93%, respectively. Furthermore, the SHapley Additive exPlanations method was selected to interpret the Ensemble model, identifying important features such as 405, 430, 540, 630, 645, 690, 850, 880 and 970 nm. These features exhibited a significant correlation with fresh weight, shoot length and vigor index. These findings showed the high accuracy and generalizability of the Ensemble model for identifying the maturity and quality of smooth bromegrass seeds. Therefore, a new strategy would be offered for evaluating seed maturity and vigor level.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":"21 1","pages":"45"},"PeriodicalIF":4.7000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11938725/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Methods","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13007-025-01359-8","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Smooth bromegrass (Bromus inermis) was adopted as experiment materials for identifying the seed maturity using a combination of multispectral imaging and machine learning. The trials were conducted to investigate the effects of three nitrogen application levels (0, 100 and 200 kg N ha- 1, defined as CK, N1 and N2 respectively) and two spikelet grain positions: superior grain (SG) at the basal position and inferior grain (IG) at the upper position, on smooth bromegrass seeds. The germination characteristics of the seeds revealed that the variations in nitrogen application and grain positions significantly influenced seeds vigor. The seed vigor of increased gradually with their maturity, reaching a high level at 30 and 36 days after anthesis. A stacking ensemble learning approach was employed to identify the seed maturity based on multispectral imaging and autofluorescence imaging. The results demonstrated that the Ensemble model outperformed Support Vector Machine, Bayesian, XGBoost and Random Forest across all evaluated metrics in different scenarios. The model accuracy in CK, N1 and N2 were 89%, 87% and 93%, respectively. Furthermore, the SHapley Additive exPlanations method was selected to interpret the Ensemble model, identifying important features such as 405, 430, 540, 630, 645, 690, 850, 880 and 970 nm. These features exhibited a significant correlation with fresh weight, shoot length and vigor index. These findings showed the high accuracy and generalizability of the Ensemble model for identifying the maturity and quality of smooth bromegrass seeds. Therefore, a new strategy would be offered for evaluating seed maturity and vigor level.
采用多光谱成像与机器学习相结合的方法,以光滑雀麦(Bromus inermis)为实验材料进行种子成熟度鉴定。本试验研究了3个施氮水平(0、100和200 kg N ha- 1,分别定义为CK、N1和N2)和2个小穗粒位(基部优粒位和上部劣粒位)对雀麦种子光滑性的影响。种子的萌发特性表明,施氮量和粒位的变化对种子活力有显著影响。随着种子成熟,种子活力逐渐增强,在开花后30和36 d达到较高水平。采用基于多光谱成像和自荧光成像的叠加集成学习方法对种子成熟度进行识别。结果表明,集成模型在不同场景下的所有评估指标上都优于支持向量机、贝叶斯、XGBoost和随机森林。在CK、N1和N2条件下,模型准确率分别为89%、87%和93%。此外,选择SHapley加性解释方法来解释集合模型,识别出405、430、540、630、645、690、850、880和970 nm等重要特征。这些特征与鲜重、茎长和活力指数呈极显著相关。这些结果表明,该模型具有较高的准确性和通用性,可用于鉴定光滑雀稗种子的成熟度和质量。为评价种子成熟度和活力水平提供了一种新的策略。
期刊介绍:
Plant Methods is an open access, peer-reviewed, online journal for the plant research community that encompasses all aspects of technological innovation in the plant sciences.
There is no doubt that we have entered an exciting new era in plant biology. The completion of the Arabidopsis genome sequence, and the rapid progress being made in other plant genomics projects are providing unparalleled opportunities for progress in all areas of plant science. Nevertheless, enormous challenges lie ahead if we are to understand the function of every gene in the genome, and how the individual parts work together to make the whole organism. Achieving these goals will require an unprecedented collaborative effort, combining high-throughput, system-wide technologies with more focused approaches that integrate traditional disciplines such as cell biology, biochemistry and molecular genetics.
Technological innovation is probably the most important catalyst for progress in any scientific discipline. Plant Methods’ goal is to stimulate the development and adoption of new and improved techniques and research tools and, where appropriate, to promote consistency of methodologies for better integration of data from different laboratories.