{"title":"Estimating chlorophyll content in tea leaves using spectral reflectance and deep learning methods","authors":"Yuta Tsuchiya , Yuhei Hirono , Rei Sonobe","doi":"10.1016/j.ecoinf.2025.103399","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate estimation of chlorophyll content in tea leaves is essential for evaluating plant health, managing fertilization, and optimizing harvest timing in precision agriculture. This study investigates the use of hyperspectral reflectance data (400–850 nm, 5 nm intervals; 91 bands) to estimate chlorophyll content in tea leaves (<em>Camellia sinensis</em>) using three deep learning models: a one-dimensional convolutional neural network (1D–CNN) tailored for spectral regression, a vision transformer (ViT) adapted for one-dimensional inputs, and a self-supervised learning (SSL) model with regression. The key innovation of this study is the introduction of a self-supervised learning framework specifically adapted for spectral data, in which an autoencoder is first trained on unlabeled spectra to learn compact and noise-tolerant representations. These pretrained features are then used in a downstream regression task to predict chlorophyll content, allowing effective use of limited labeled data. To our knowledge, this is the first application of SSL in chlorophyll estimation using high–resolution leaf–level spectral measurements. Among the three models, the SSL approach achieved the highest accuracy, with a root mean square error (RMSE) of 3.33 μg/cm<sup>2</sup>, outperforming both the 1D–CNN (5.05 μg/cm<sup>2</sup>) and ViT (4.28 μg/cm<sup>2</sup>). These findings demonstrate that SSL is particularly effective for capturing subtle spectral patterns and improving prediction performance, especially when labeled data are scarce. This study highlights the potential of combining hyperspectral sensing with advanced representation learning to non–destructively monitor chlorophyll dynamics in tea cultivation, supporting more sustainable and data–driven agricultural practices.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"91 ","pages":"Article 103399"},"PeriodicalIF":7.3000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S157495412500408X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate estimation of chlorophyll content in tea leaves is essential for evaluating plant health, managing fertilization, and optimizing harvest timing in precision agriculture. This study investigates the use of hyperspectral reflectance data (400–850 nm, 5 nm intervals; 91 bands) to estimate chlorophyll content in tea leaves (Camellia sinensis) using three deep learning models: a one-dimensional convolutional neural network (1D–CNN) tailored for spectral regression, a vision transformer (ViT) adapted for one-dimensional inputs, and a self-supervised learning (SSL) model with regression. The key innovation of this study is the introduction of a self-supervised learning framework specifically adapted for spectral data, in which an autoencoder is first trained on unlabeled spectra to learn compact and noise-tolerant representations. These pretrained features are then used in a downstream regression task to predict chlorophyll content, allowing effective use of limited labeled data. To our knowledge, this is the first application of SSL in chlorophyll estimation using high–resolution leaf–level spectral measurements. Among the three models, the SSL approach achieved the highest accuracy, with a root mean square error (RMSE) of 3.33 μg/cm2, outperforming both the 1D–CNN (5.05 μg/cm2) and ViT (4.28 μg/cm2). These findings demonstrate that SSL is particularly effective for capturing subtle spectral patterns and improving prediction performance, especially when labeled data are scarce. This study highlights the potential of combining hyperspectral sensing with advanced representation learning to non–destructively monitor chlorophyll dynamics in tea cultivation, supporting more sustainable and data–driven agricultural practices.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.