Ludmilla Ferreira Justino , Alexandre Bryan Heinemann , David Henriques da Matta , Luís Fernando Stone , Paulo Augusto de Oliveira Gonçalves , Silvando Carlos da Silva
{"title":"利用机器学习技术确定巴西普通豆类产区的特征","authors":"Ludmilla Ferreira Justino , Alexandre Bryan Heinemann , David Henriques da Matta , Luís Fernando Stone , Paulo Augusto de Oliveira Gonçalves , Silvando Carlos da Silva","doi":"10.1016/j.agsy.2024.104237","DOIUrl":null,"url":null,"abstract":"<div><h3>PROBLEM</h3><div>Understanding the interactions between genotype, environment, and management is crucial for guiding the development of new cultivars and defining strategies to maximize yield under specific environmental conditions.</div></div><div><h3>OBJECTIVE</h3><div>This study aimed to classify and characterize homogeneous production regions for common beans in Brazil by leveraging simulated yield data and machine learning techniques. The goal was to identify the environmental factors that define these homogeneous regions and to develop a spatiotemporal sowing calendar for rainfed (wet and dry seasons) and irrigated (winter season) production.</div></div><div><h3>METHODS</h3><div>The CSM-CROPGRO-Dry Bean model was used to simulate the yield of common beans in Brazilian municipalities during the wet (rainfed) season (sowing between August and December - 275 municipalities), dry (rainfed) season (sowing between January and April - 251 municipalities) and winter (irrigated) season (sowing between April and July - 59 municipalities), utilizing soil data, daily climate data (1980 to 2016), management information (sowing date and irrigation/rainfed), and genetic coefficients to reflect the performance of the BRS Estilo cultivar related to phenology, growth and yield components. To create homogeneous environmental groups and associate them with specific environmental features, we applied machine learning techniques, including K-means clustering and decision tree analysis.</div></div><div><h3>RESULTS</h3><div>According to the results, we identified three distinct homogeneous regions — high, medium, and low yields — for each cultivation season (wet, dry, and winter). During the wet season, regions with yields between 2326 and 3500 kg ha<sup>−1</sup> were classified as high-yield, those between 1404 and 2325 kg ha<sup>−1</sup> as medium-yield, and those between 500 and 1403 kg ha<sup>−1</sup> as low-yield. In the dry season, high-yield regions had yields ranging from 2492 to 3500 kg ha<sup>−1</sup>, medium-yield regions from 1484 to 2491 kg ha<sup>−1</sup>, and low-yield regions from 500 to 1483 kg ha<sup>−1</sup>. For the winter season, high-yield regions achieved yields between 2972 and 3500 kg ha<sup>−1</sup>, medium-yield regions between 2252 and 2971 kg ha<sup>−1</sup>, and low-yield regions between 634 and 2251 kg ha<sup>−1</sup>. For rainfed seasons (wet and dry), the water stress (WSPD) had a greater impact on yield than air temperature and global solar radiation. While in the winter season, air temperature was the most relevant factor. Overall, in the wet season, delayed sowing contributed to increased yield, especially in the state of Paraná. In the dry season, delayed sowing caused a reduction in yield, particularly in the Midwest and Southeast regions. In the winter season, yield varied less significantly between sowing dates, except in the state of Mato Grosso, where harmful increases in air temperature were observed in the later months.</div></div><div><h3>CONCLUSIONS</h3><div>The integration of crop simulation models with machine learning tools is valuable for defining and characterizing homogeneous regions for common bean production. This approach has identified three distinct yield regions—high, medium, and low—for each crop season (wet, dry, and winter). By distinguishing these regions, this methodology supports breeding programs in developing cultivars optimized for specific environments and provides insights into how environmental factors influence crop performance. Additionally, it helps optimize sowing dates to align with favorable conditions, particularly during the wet and dry seasons, thereby contributing to reduced yield losses.</div></div><div><h3>IMPLICATIONS</h3><div>The classification and characterization of homogeneous production regions helps to better understand the genotype (G) x environment (E) x management (M) interactions and to adjust sowing dates to more favorable conditions, especially during the wet and dry seasons, contributing to reducing yield losses. Despite the limitations of crop modeling—such as not accounting for biotic factors and waterlogging—addressing water and air temperature stress presents an even greater challenge. In this context, crop modeling plays a crucial role in identifying effective adaptation strategies.</div></div>","PeriodicalId":7730,"journal":{"name":"Agricultural Systems","volume":"224 ","pages":"Article 104237"},"PeriodicalIF":6.1000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Characterization of common bean production regions in Brazil using machine learning techniques\",\"authors\":\"Ludmilla Ferreira Justino , Alexandre Bryan Heinemann , David Henriques da Matta , Luís Fernando Stone , Paulo Augusto de Oliveira Gonçalves , Silvando Carlos da Silva\",\"doi\":\"10.1016/j.agsy.2024.104237\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>PROBLEM</h3><div>Understanding the interactions between genotype, environment, and management is crucial for guiding the development of new cultivars and defining strategies to maximize yield under specific environmental conditions.</div></div><div><h3>OBJECTIVE</h3><div>This study aimed to classify and characterize homogeneous production regions for common beans in Brazil by leveraging simulated yield data and machine learning techniques. The goal was to identify the environmental factors that define these homogeneous regions and to develop a spatiotemporal sowing calendar for rainfed (wet and dry seasons) and irrigated (winter season) production.</div></div><div><h3>METHODS</h3><div>The CSM-CROPGRO-Dry Bean model was used to simulate the yield of common beans in Brazilian municipalities during the wet (rainfed) season (sowing between August and December - 275 municipalities), dry (rainfed) season (sowing between January and April - 251 municipalities) and winter (irrigated) season (sowing between April and July - 59 municipalities), utilizing soil data, daily climate data (1980 to 2016), management information (sowing date and irrigation/rainfed), and genetic coefficients to reflect the performance of the BRS Estilo cultivar related to phenology, growth and yield components. To create homogeneous environmental groups and associate them with specific environmental features, we applied machine learning techniques, including K-means clustering and decision tree analysis.</div></div><div><h3>RESULTS</h3><div>According to the results, we identified three distinct homogeneous regions — high, medium, and low yields — for each cultivation season (wet, dry, and winter). During the wet season, regions with yields between 2326 and 3500 kg ha<sup>−1</sup> were classified as high-yield, those between 1404 and 2325 kg ha<sup>−1</sup> as medium-yield, and those between 500 and 1403 kg ha<sup>−1</sup> as low-yield. In the dry season, high-yield regions had yields ranging from 2492 to 3500 kg ha<sup>−1</sup>, medium-yield regions from 1484 to 2491 kg ha<sup>−1</sup>, and low-yield regions from 500 to 1483 kg ha<sup>−1</sup>. For the winter season, high-yield regions achieved yields between 2972 and 3500 kg ha<sup>−1</sup>, medium-yield regions between 2252 and 2971 kg ha<sup>−1</sup>, and low-yield regions between 634 and 2251 kg ha<sup>−1</sup>. For rainfed seasons (wet and dry), the water stress (WSPD) had a greater impact on yield than air temperature and global solar radiation. While in the winter season, air temperature was the most relevant factor. Overall, in the wet season, delayed sowing contributed to increased yield, especially in the state of Paraná. In the dry season, delayed sowing caused a reduction in yield, particularly in the Midwest and Southeast regions. In the winter season, yield varied less significantly between sowing dates, except in the state of Mato Grosso, where harmful increases in air temperature were observed in the later months.</div></div><div><h3>CONCLUSIONS</h3><div>The integration of crop simulation models with machine learning tools is valuable for defining and characterizing homogeneous regions for common bean production. This approach has identified three distinct yield regions—high, medium, and low—for each crop season (wet, dry, and winter). By distinguishing these regions, this methodology supports breeding programs in developing cultivars optimized for specific environments and provides insights into how environmental factors influence crop performance. Additionally, it helps optimize sowing dates to align with favorable conditions, particularly during the wet and dry seasons, thereby contributing to reduced yield losses.</div></div><div><h3>IMPLICATIONS</h3><div>The classification and characterization of homogeneous production regions helps to better understand the genotype (G) x environment (E) x management (M) interactions and to adjust sowing dates to more favorable conditions, especially during the wet and dry seasons, contributing to reducing yield losses. Despite the limitations of crop modeling—such as not accounting for biotic factors and waterlogging—addressing water and air temperature stress presents an even greater challenge. In this context, crop modeling plays a crucial role in identifying effective adaptation strategies.</div></div>\",\"PeriodicalId\":7730,\"journal\":{\"name\":\"Agricultural Systems\",\"volume\":\"224 \",\"pages\":\"Article 104237\"},\"PeriodicalIF\":6.1000,\"publicationDate\":\"2024-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Agricultural Systems\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0308521X24003871\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Agricultural Systems","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0308521X24003871","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
Characterization of common bean production regions in Brazil using machine learning techniques
PROBLEM
Understanding the interactions between genotype, environment, and management is crucial for guiding the development of new cultivars and defining strategies to maximize yield under specific environmental conditions.
OBJECTIVE
This study aimed to classify and characterize homogeneous production regions for common beans in Brazil by leveraging simulated yield data and machine learning techniques. The goal was to identify the environmental factors that define these homogeneous regions and to develop a spatiotemporal sowing calendar for rainfed (wet and dry seasons) and irrigated (winter season) production.
METHODS
The CSM-CROPGRO-Dry Bean model was used to simulate the yield of common beans in Brazilian municipalities during the wet (rainfed) season (sowing between August and December - 275 municipalities), dry (rainfed) season (sowing between January and April - 251 municipalities) and winter (irrigated) season (sowing between April and July - 59 municipalities), utilizing soil data, daily climate data (1980 to 2016), management information (sowing date and irrigation/rainfed), and genetic coefficients to reflect the performance of the BRS Estilo cultivar related to phenology, growth and yield components. To create homogeneous environmental groups and associate them with specific environmental features, we applied machine learning techniques, including K-means clustering and decision tree analysis.
RESULTS
According to the results, we identified three distinct homogeneous regions — high, medium, and low yields — for each cultivation season (wet, dry, and winter). During the wet season, regions with yields between 2326 and 3500 kg ha−1 were classified as high-yield, those between 1404 and 2325 kg ha−1 as medium-yield, and those between 500 and 1403 kg ha−1 as low-yield. In the dry season, high-yield regions had yields ranging from 2492 to 3500 kg ha−1, medium-yield regions from 1484 to 2491 kg ha−1, and low-yield regions from 500 to 1483 kg ha−1. For the winter season, high-yield regions achieved yields between 2972 and 3500 kg ha−1, medium-yield regions between 2252 and 2971 kg ha−1, and low-yield regions between 634 and 2251 kg ha−1. For rainfed seasons (wet and dry), the water stress (WSPD) had a greater impact on yield than air temperature and global solar radiation. While in the winter season, air temperature was the most relevant factor. Overall, in the wet season, delayed sowing contributed to increased yield, especially in the state of Paraná. In the dry season, delayed sowing caused a reduction in yield, particularly in the Midwest and Southeast regions. In the winter season, yield varied less significantly between sowing dates, except in the state of Mato Grosso, where harmful increases in air temperature were observed in the later months.
CONCLUSIONS
The integration of crop simulation models with machine learning tools is valuable for defining and characterizing homogeneous regions for common bean production. This approach has identified three distinct yield regions—high, medium, and low—for each crop season (wet, dry, and winter). By distinguishing these regions, this methodology supports breeding programs in developing cultivars optimized for specific environments and provides insights into how environmental factors influence crop performance. Additionally, it helps optimize sowing dates to align with favorable conditions, particularly during the wet and dry seasons, thereby contributing to reduced yield losses.
IMPLICATIONS
The classification and characterization of homogeneous production regions helps to better understand the genotype (G) x environment (E) x management (M) interactions and to adjust sowing dates to more favorable conditions, especially during the wet and dry seasons, contributing to reducing yield losses. Despite the limitations of crop modeling—such as not accounting for biotic factors and waterlogging—addressing water and air temperature stress presents an even greater challenge. In this context, crop modeling plays a crucial role in identifying effective adaptation strategies.
期刊介绍:
Agricultural Systems is an international journal that deals with interactions - among the components of agricultural systems, among hierarchical levels of agricultural systems, between agricultural and other land use systems, and between agricultural systems and their natural, social and economic environments.
The scope includes the development and application of systems analysis methodologies in the following areas:
Systems approaches in the sustainable intensification of agriculture; pathways for sustainable intensification; crop-livestock integration; farm-level resource allocation; quantification of benefits and trade-offs at farm to landscape levels; integrative, participatory and dynamic modelling approaches for qualitative and quantitative assessments of agricultural systems and decision making;
The interactions between agricultural and non-agricultural landscapes; the multiple services of agricultural systems; food security and the environment;
Global change and adaptation science; transformational adaptations as driven by changes in climate, policy, values and attitudes influencing the design of farming systems;
Development and application of farming systems design tools and methods for impact, scenario and case study analysis; managing the complexities of dynamic agricultural systems; innovation systems and multi stakeholder arrangements that support or promote change and (or) inform policy decisions.