Melissa Cristina de Carvalho Miranda, Alexandre Hild Aono, Talieisse Gomes Fagundes, Giovanni Michelan Arduini, José Baldin Pinheiro
{"title":"High-throughput phenotyping and machine learning techniques in soybean breeding: Exploring the potential of aerial imaging and vegetation indices","authors":"Melissa Cristina de Carvalho Miranda, Alexandre Hild Aono, Talieisse Gomes Fagundes, Giovanni Michelan Arduini, José Baldin Pinheiro","doi":"10.1002/agj2.70012","DOIUrl":null,"url":null,"abstract":"<p>Soybean (<i>Glycine max</i> (L.) Merr.) breeding programs face challenges in evaluating large progeny populations, which is labor- and resource-intensive. This study addresses these challenges using high-throughput phenotyping and machine learning (ML) models to predict phenotypic traits in soybeans. We developed and validated ML models using vegetation indices and canopy images from aerial imagery. A total of 275 soybean genotypes were characterized across two environments and management practices. A total of 11 classical traits were measured, and five vegetation indices were calculated from aerial images at different growth stages. ML algorithms, including support vector machine for regression, random forest (RF), multilayer perceptron (MLP), and adaptive boosting, were employed. Additionally, convolutional neural networks with transfer learning were used to extract features from the images. Significant correlations were found between agronomic traits, vegetation indices, and canopy characteristics. The high heritability of the red–green–blue vegetation index and green leaf index (mean broad-sense heritability of 0.56) compared to other RGB-based indices indicates their potential usefulness in genetic evaluations. Advanced ML techniques, particularly transfer learning with ResNet 50, enhanced the prediction of phenotypic traits such as days to the R7 growth stage (DR7) and plant height at maturation (PHM). The integration of ResNet 50 with RF achieved a prediction accuracy of 0.64 for DR7, while ResNet 50 with MLP reached an accuracy of 0.68 for PHM. These findings highlight the potential of these techniques to improve decision-making in soybean breeding. Lastly, principal component analysis identified genotypes with desirable trait combinations, advancing soybean development.</p>","PeriodicalId":7522,"journal":{"name":"Agronomy Journal","volume":"117 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/agj2.70012","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Agronomy Journal","FirstCategoryId":"97","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/agj2.70012","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
Soybean (Glycine max (L.) Merr.) breeding programs face challenges in evaluating large progeny populations, which is labor- and resource-intensive. This study addresses these challenges using high-throughput phenotyping and machine learning (ML) models to predict phenotypic traits in soybeans. We developed and validated ML models using vegetation indices and canopy images from aerial imagery. A total of 275 soybean genotypes were characterized across two environments and management practices. A total of 11 classical traits were measured, and five vegetation indices were calculated from aerial images at different growth stages. ML algorithms, including support vector machine for regression, random forest (RF), multilayer perceptron (MLP), and adaptive boosting, were employed. Additionally, convolutional neural networks with transfer learning were used to extract features from the images. Significant correlations were found between agronomic traits, vegetation indices, and canopy characteristics. The high heritability of the red–green–blue vegetation index and green leaf index (mean broad-sense heritability of 0.56) compared to other RGB-based indices indicates their potential usefulness in genetic evaluations. Advanced ML techniques, particularly transfer learning with ResNet 50, enhanced the prediction of phenotypic traits such as days to the R7 growth stage (DR7) and plant height at maturation (PHM). The integration of ResNet 50 with RF achieved a prediction accuracy of 0.64 for DR7, while ResNet 50 with MLP reached an accuracy of 0.68 for PHM. These findings highlight the potential of these techniques to improve decision-making in soybean breeding. Lastly, principal component analysis identified genotypes with desirable trait combinations, advancing soybean development.
期刊介绍:
After critical review and approval by the editorial board, AJ publishes articles reporting research findings in soil–plant relationships; crop science; soil science; biometry; crop, soil, pasture, and range management; crop, forage, and pasture production and utilization; turfgrass; agroclimatology; agronomic models; integrated pest management; integrated agricultural systems; and various aspects of entomology, weed science, animal science, plant pathology, and agricultural economics as applied to production agriculture.
Notes are published about apparatus, observations, and experimental techniques. Observations usually are limited to studies and reports of unrepeatable phenomena or other unique circumstances. Review and interpretation papers are also published, subject to standard review. Contributions to the Forum section deal with current agronomic issues and questions in brief, thought-provoking form. Such papers are reviewed by the editor in consultation with the editorial board.