Pengpeng Zhang , Bing Lu , Jiali Shang , Changwei Tan , Qihan Xu , Lei Shi , Shujian Jin , Xingyu Wang , Yunfei Jiang , Yadong Yang , Huadong Zang , Junyong Ge , Zhaohai Zeng
{"title":"TKSF-KAN: Transformer-enhanced oat yield modeling and transferability across major oat-producing regions in China using UAV multisource data","authors":"Pengpeng Zhang , Bing Lu , Jiali Shang , Changwei Tan , Qihan Xu , Lei Shi , Shujian Jin , Xingyu Wang , Yunfei Jiang , Yadong Yang , Huadong Zang , Junyong Ge , Zhaohai Zeng","doi":"10.1016/j.isprsjprs.2025.04.004","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and efficient estimation of crop yield is crucial for enhancing crop variety testing, optimizing cultivation practices, and supporting effective crop management to ensure sustainable production. However, remote sensing-based yield models often face limitations due to geographical variability and diverse cultivation techniques, affecting both their model accuracy and transferability. This study utilized multiple features, including vegetation indices (VI), color indices (CI), texture features (TF), structural indices (SI), and canopy thermal information (TIR), obtained from RGB, multispectral, and thermal infrared sensors of unmanned aerial vehicles (UAV), to create six scenarios for oat yield estimation across major oat-producing regions in China. We developed a novel deep learning-based architecture, TKSF-KAN, which combines Transformer and Kolmogorov–Arnold Network (KAN) to fuse multimodal data across key growth stages, and benchmarked its performance against stacking ensemble learning (SEL) and standalone Transformer model. While SEL demonstrated the highest accuracy in single-modal scenarios, TKSF-KAN outperformed SEL in multimodal settings (R<sup>2</sup> = 0.76–0.81). Particularly, TKSF-KAN, with integrated VI, CI, TF, SI, and TIR inputs improved R<sup>2</sup> by 53.77 % compared with single-modal data sources. By combining Adaptive Batch Normalization (AdaBN) with fine-tuning strategies, transfer performance improved as the proportion of fine-tuned data increased, reaching a peak R<sup>2</sup> of 0.83 at one study site. In contrast, transferability was more influenced by cultivation practices at another site, with a maximum R<sup>2</sup> of 0.78. This study presents an innovative framework that seamlessly integrates agricultural practices with remote sensing and transfer learning methodologies, offering a more robust and scalable solution for yield prediction and advancing precision of agricultural management.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"224 ","pages":"Pages 166-186"},"PeriodicalIF":10.6000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271625001406","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate and efficient estimation of crop yield is crucial for enhancing crop variety testing, optimizing cultivation practices, and supporting effective crop management to ensure sustainable production. However, remote sensing-based yield models often face limitations due to geographical variability and diverse cultivation techniques, affecting both their model accuracy and transferability. This study utilized multiple features, including vegetation indices (VI), color indices (CI), texture features (TF), structural indices (SI), and canopy thermal information (TIR), obtained from RGB, multispectral, and thermal infrared sensors of unmanned aerial vehicles (UAV), to create six scenarios for oat yield estimation across major oat-producing regions in China. We developed a novel deep learning-based architecture, TKSF-KAN, which combines Transformer and Kolmogorov–Arnold Network (KAN) to fuse multimodal data across key growth stages, and benchmarked its performance against stacking ensemble learning (SEL) and standalone Transformer model. While SEL demonstrated the highest accuracy in single-modal scenarios, TKSF-KAN outperformed SEL in multimodal settings (R2 = 0.76–0.81). Particularly, TKSF-KAN, with integrated VI, CI, TF, SI, and TIR inputs improved R2 by 53.77 % compared with single-modal data sources. By combining Adaptive Batch Normalization (AdaBN) with fine-tuning strategies, transfer performance improved as the proportion of fine-tuned data increased, reaching a peak R2 of 0.83 at one study site. In contrast, transferability was more influenced by cultivation practices at another site, with a maximum R2 of 0.78. This study presents an innovative framework that seamlessly integrates agricultural practices with remote sensing and transfer learning methodologies, offering a more robust and scalable solution for yield prediction and advancing precision of agricultural management.
期刊介绍:
The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive.
P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields.
In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.