Yan Sun, Mengqi Li, Meiling Liu, Jingyi Zhang, Yingli Cao, Xue Ao
{"title":"基于田间表型特征的大豆高产出苗率统计计算方法。","authors":"Yan Sun, Mengqi Li, Meiling Liu, Jingyi Zhang, Yingli Cao, Xue Ao","doi":"10.1186/s13007-025-01356-x","DOIUrl":null,"url":null,"abstract":"<p><p>In the process of smart breeding, the rapid statistics of soybean emergence rate, as an important part of breeding screening, face challenges under environmental constraints, especially the selection and breeding of soybean varieties in dense environments. Due to the influence of environmental factors, the existing methods have shortcomings, such as low throughput, low efficiency, and insufficient precision. Therefore, an effective and precise statistical method is required. In this study, UAV (Unmanned Aerial Vehicle)-scale data combined with ground measurement data were used as the research object to explore the feasibility of improving the throughput, efficiency, and accuracy of breeding screening under intensive soybean planting. To this end, a set of technical solutions, including background removal, object detection, and accurate counting, were designed. Firstly, a combined background segmentation method based on contrast enhancement filtering combined with ultra-green eigenvalues and the Otsu algorithm was proposed to remove the complex background in remote sensing images and retain the morphological information of soybean seedlings. Secondly, the deep learning object detection model was used to infer and predict the processed images to label soybean seedlings. Then, a soybean seedling counting algorithm was constructed: by establishing a soybean seedling growth model, the idea of \"growth normalization\" was proposed, and the expansion-compression factor was defined to eliminate the influence of soybean seedling growth inconsistency on counting. After statistical and in-depth analysis of the growth and planting characteristics of soybean seedlings under overlapping conditions, the \"inter-seedling occlusion counting algorithm\" was proposed to solve the problem of overlapping counting between seedlings. In order to solve the problem of an overlapping bounding box, a soft strategy is specially designed to avoid the redundant values brought by it. Finally, according to the calculation results, the statistical thematic map of soybean emergence rate based on plot plots was displayed. After experiments, the proposed method can effectively count the number of soybean seedlings in the image, with an overall accuracy of 99.18% and an error rate of 0.82%. In addition, Yolov8n had the best recognition effect in the soybean seedling detection task, with a mAP (0.5-0.95) of 85.15%. The proposed background segmentation method increased the mAP (0.5-0.95) of the detection results by 4.06%. It has been demonstrated through experimental tests and verifications that solid support for the statistical work concerning the soybean emergence rate under the condition of intensive planting is provided by this method. This innovative method has played a facilitating role in accelerating the breeding process and has also provided some new ideas and reference directions for further exploration of efficient screening.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":"21 1","pages":"40"},"PeriodicalIF":4.7000,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11931824/pdf/","citationCount":"0","resultStr":"{\"title\":\"A statistical method for high-throughput emergence rate calculation for soybean breeding plots based on field phenotypic characteristics.\",\"authors\":\"Yan Sun, Mengqi Li, Meiling Liu, Jingyi Zhang, Yingli Cao, Xue Ao\",\"doi\":\"10.1186/s13007-025-01356-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>In the process of smart breeding, the rapid statistics of soybean emergence rate, as an important part of breeding screening, face challenges under environmental constraints, especially the selection and breeding of soybean varieties in dense environments. Due to the influence of environmental factors, the existing methods have shortcomings, such as low throughput, low efficiency, and insufficient precision. Therefore, an effective and precise statistical method is required. In this study, UAV (Unmanned Aerial Vehicle)-scale data combined with ground measurement data were used as the research object to explore the feasibility of improving the throughput, efficiency, and accuracy of breeding screening under intensive soybean planting. To this end, a set of technical solutions, including background removal, object detection, and accurate counting, were designed. Firstly, a combined background segmentation method based on contrast enhancement filtering combined with ultra-green eigenvalues and the Otsu algorithm was proposed to remove the complex background in remote sensing images and retain the morphological information of soybean seedlings. Secondly, the deep learning object detection model was used to infer and predict the processed images to label soybean seedlings. Then, a soybean seedling counting algorithm was constructed: by establishing a soybean seedling growth model, the idea of \\\"growth normalization\\\" was proposed, and the expansion-compression factor was defined to eliminate the influence of soybean seedling growth inconsistency on counting. After statistical and in-depth analysis of the growth and planting characteristics of soybean seedlings under overlapping conditions, the \\\"inter-seedling occlusion counting algorithm\\\" was proposed to solve the problem of overlapping counting between seedlings. In order to solve the problem of an overlapping bounding box, a soft strategy is specially designed to avoid the redundant values brought by it. Finally, according to the calculation results, the statistical thematic map of soybean emergence rate based on plot plots was displayed. After experiments, the proposed method can effectively count the number of soybean seedlings in the image, with an overall accuracy of 99.18% and an error rate of 0.82%. In addition, Yolov8n had the best recognition effect in the soybean seedling detection task, with a mAP (0.5-0.95) of 85.15%. The proposed background segmentation method increased the mAP (0.5-0.95) of the detection results by 4.06%. It has been demonstrated through experimental tests and verifications that solid support for the statistical work concerning the soybean emergence rate under the condition of intensive planting is provided by this method. This innovative method has played a facilitating role in accelerating the breeding process and has also provided some new ideas and reference directions for further exploration of efficient screening.</p>\",\"PeriodicalId\":20100,\"journal\":{\"name\":\"Plant Methods\",\"volume\":\"21 1\",\"pages\":\"40\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-03-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11931824/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Plant Methods\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s13007-025-01356-x\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Methods","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13007-025-01356-x","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
A statistical method for high-throughput emergence rate calculation for soybean breeding plots based on field phenotypic characteristics.
In the process of smart breeding, the rapid statistics of soybean emergence rate, as an important part of breeding screening, face challenges under environmental constraints, especially the selection and breeding of soybean varieties in dense environments. Due to the influence of environmental factors, the existing methods have shortcomings, such as low throughput, low efficiency, and insufficient precision. Therefore, an effective and precise statistical method is required. In this study, UAV (Unmanned Aerial Vehicle)-scale data combined with ground measurement data were used as the research object to explore the feasibility of improving the throughput, efficiency, and accuracy of breeding screening under intensive soybean planting. To this end, a set of technical solutions, including background removal, object detection, and accurate counting, were designed. Firstly, a combined background segmentation method based on contrast enhancement filtering combined with ultra-green eigenvalues and the Otsu algorithm was proposed to remove the complex background in remote sensing images and retain the morphological information of soybean seedlings. Secondly, the deep learning object detection model was used to infer and predict the processed images to label soybean seedlings. Then, a soybean seedling counting algorithm was constructed: by establishing a soybean seedling growth model, the idea of "growth normalization" was proposed, and the expansion-compression factor was defined to eliminate the influence of soybean seedling growth inconsistency on counting. After statistical and in-depth analysis of the growth and planting characteristics of soybean seedlings under overlapping conditions, the "inter-seedling occlusion counting algorithm" was proposed to solve the problem of overlapping counting between seedlings. In order to solve the problem of an overlapping bounding box, a soft strategy is specially designed to avoid the redundant values brought by it. Finally, according to the calculation results, the statistical thematic map of soybean emergence rate based on plot plots was displayed. After experiments, the proposed method can effectively count the number of soybean seedlings in the image, with an overall accuracy of 99.18% and an error rate of 0.82%. In addition, Yolov8n had the best recognition effect in the soybean seedling detection task, with a mAP (0.5-0.95) of 85.15%. The proposed background segmentation method increased the mAP (0.5-0.95) of the detection results by 4.06%. It has been demonstrated through experimental tests and verifications that solid support for the statistical work concerning the soybean emergence rate under the condition of intensive planting is provided by this method. This innovative method has played a facilitating role in accelerating the breeding process and has also provided some new ideas and reference directions for further exploration of efficient screening.
期刊介绍:
Plant Methods is an open access, peer-reviewed, online journal for the plant research community that encompasses all aspects of technological innovation in the plant sciences.
There is no doubt that we have entered an exciting new era in plant biology. The completion of the Arabidopsis genome sequence, and the rapid progress being made in other plant genomics projects are providing unparalleled opportunities for progress in all areas of plant science. Nevertheless, enormous challenges lie ahead if we are to understand the function of every gene in the genome, and how the individual parts work together to make the whole organism. Achieving these goals will require an unprecedented collaborative effort, combining high-throughput, system-wide technologies with more focused approaches that integrate traditional disciplines such as cell biology, biochemistry and molecular genetics.
Technological innovation is probably the most important catalyst for progress in any scientific discipline. Plant Methods’ goal is to stimulate the development and adoption of new and improved techniques and research tools and, where appropriate, to promote consistency of methodologies for better integration of data from different laboratories.