农业产量预测中的知识信息混合机器学习

IF 7.7 1区农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY

Computers and Electronics in Agriculture Pub Date : 2024-11-09 DOI:10.1016/j.compag.2024.109606

Malte von Bloh , David Lobell , Senthold Asseng

{"title":"农业产量预测中的知识信息混合机器学习","authors":"Malte von Bloh , David Lobell , Senthold Asseng","doi":"10.1016/j.compag.2024.109606","DOIUrl":null,"url":null,"abstract":"<div><div>Research on yield predictions is dominated by two approaches: machine learning and process-based models. Machine learning has shown impressive results in capturing complex relationships but is often limited by data availability in agriculture. Conversely, process-based models, with over 60 years of research history, simulate crop growth processes using biophysical equations. Here, we present a method to transfer domain knowledge from the Decision Support System for Agrotechnology Transfer framework (DSSAT) using the Nwheat crop simulation process-model into neural networks and random forest for predicting wheat yield at field scale. Expanding the feature and distribution space involved simulating crop parameters and synthetic samples through the utilization of observed and historical weather recordings, as well as future climate projections. We demonstrated that neural networks can learn both general crop growth and yield processes and then effectively adapt to regional, field-specific growth patterns using synthetic and high-resolution field data. This approach boosts overall performance and reduces model error by 8 % compared to a purely data-centric model without process-knowledge transfer and solely trained on observed field data and features. Synthetic samples generated from warmer conditions were the greatest driver for improvements and we showed that the climate scenario for data generation is more important than the actual synthetic data set size. The proposed method shows the potential of combining process-based and machine-learning models, highlighting the potential to leverage the strengths of both methods in a collaborative manner.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"227 ","pages":"Article 109606"},"PeriodicalIF":7.7000,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Knowledge informed hybrid machine learning in agricultural yield prediction\",\"authors\":\"Malte von Bloh , David Lobell , Senthold Asseng\",\"doi\":\"10.1016/j.compag.2024.109606\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Research on yield predictions is dominated by two approaches: machine learning and process-based models. Machine learning has shown impressive results in capturing complex relationships but is often limited by data availability in agriculture. Conversely, process-based models, with over 60 years of research history, simulate crop growth processes using biophysical equations. Here, we present a method to transfer domain knowledge from the Decision Support System for Agrotechnology Transfer framework (DSSAT) using the Nwheat crop simulation process-model into neural networks and random forest for predicting wheat yield at field scale. Expanding the feature and distribution space involved simulating crop parameters and synthetic samples through the utilization of observed and historical weather recordings, as well as future climate projections. We demonstrated that neural networks can learn both general crop growth and yield processes and then effectively adapt to regional, field-specific growth patterns using synthetic and high-resolution field data. This approach boosts overall performance and reduces model error by 8 % compared to a purely data-centric model without process-knowledge transfer and solely trained on observed field data and features. Synthetic samples generated from warmer conditions were the greatest driver for improvements and we showed that the climate scenario for data generation is more important than the actual synthetic data set size. The proposed method shows the potential of combining process-based and machine-learning models, highlighting the potential to leverage the strengths of both methods in a collaborative manner.</div></div>\",\"PeriodicalId\":50627,\"journal\":{\"name\":\"Computers and Electronics in Agriculture\",\"volume\":\"227 \",\"pages\":\"Article 109606\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2024-11-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers and Electronics in Agriculture\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0168169924009979\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURE, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169924009979","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

摘要

产量预测研究主要采用两种方法：机器学习和基于过程的模型。机器学习在捕捉复杂关系方面取得了令人瞩目的成果，但往往受到农业数据可用性的限制。相反，基于过程的模型已有 60 多年的研究历史，它使用生物物理方程模拟作物生长过程。在此，我们介绍一种方法，利用 Nwheat 农作物模拟过程模型，将农业技术转让决策支持系统框架（DSSAT）中的领域知识转移到神经网络和随机森林中，用于预测田间小麦产量。通过利用观测和历史天气记录以及未来气候预测，扩展特征和分布空间涉及模拟作物参数和合成样本。我们证明，神经网络可以学习一般的作物生长和产量过程，然后利用合成和高分辨率田间数据有效地适应区域性、田间特定的生长模式。与纯粹以数据为中心、不进行过程知识转移、只根据观测到的田间数据和特征进行训练的模型相比，这种方法提高了整体性能，并将模型误差降低了 8%。从较暖条件下生成的合成样本是提高性能的最大驱动力，而且我们发现，生成数据的气候情景比实际合成数据集的大小更为重要。所提出的方法展示了将基于过程的模型与机器学习模型相结合的潜力，突出了以协作方式利用两种方法优势的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Knowledge informed hybrid machine learning in agricultural yield prediction

Research on yield predictions is dominated by two approaches: machine learning and process-based models. Machine learning has shown impressive results in capturing complex relationships but is often limited by data availability in agriculture. Conversely, process-based models, with over 60 years of research history, simulate crop growth processes using biophysical equations. Here, we present a method to transfer domain knowledge from the Decision Support System for Agrotechnology Transfer framework (DSSAT) using the Nwheat crop simulation process-model into neural networks and random forest for predicting wheat yield at field scale. Expanding the feature and distribution space involved simulating crop parameters and synthetic samples through the utilization of observed and historical weather recordings, as well as future climate projections. We demonstrated that neural networks can learn both general crop growth and yield processes and then effectively adapt to regional, field-specific growth patterns using synthetic and high-resolution field data. This approach boosts overall performance and reduces model error by 8 % compared to a purely data-centric model without process-knowledge transfer and solely trained on observed field data and features. Synthetic samples generated from warmer conditions were the greatest driver for improvements and we showed that the climate scenario for data generation is more important than the actual synthetic data set size. The proposed method shows the potential of combining process-based and machine-learning models, highlighting the potential to leverage the strengths of both methods in a collaborative manner.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computers and Electronics in Agriculture 工程技术-计算机：跨学科应用

CiteScore

15.30

自引率

14.50%

发文量

800

审稿时长

62 days

期刊介绍： Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.