{"title":"Advancing predictive modeling in conventional solar stills: a deep learning approach leveraging data augmentation and convolutional neural networks","authors":"Hashim H. Migaybil , Bhushan Gopaluni","doi":"10.1016/j.enconman.2025.120565","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate forecasting of freshwater productivity from conventional single-slope solar stills is crucial for enhancing operational efficiency and minimizing capital costs. A persistent challenge in this domain is the scarcity of experimental data, which limits the training of reliable predictive models. This study proposes a data-efficient forecasting framework that integrates a one-dimensional convolutional neural network (CNN-1D) with time-series data augmentation. Gaussian noise sampled from <span><math><mi>N</mi></math></span>(0, 0.01<sup>2</sup>) was applied exclusively to the training set, generating six augmented samples per instance. Both the augmentation factor (six) and the look-back window (seven days) were selected through systematic optimization, ensuring preservation of temporal dependencies. The CNN-1D architecture comprised three convolutional layers with 128 filters, ReLU activations, a flattening stage, and a dense regression output layer. Hyperparameters—including learning rate, batch size, kernel size, and regularization strength—were fine-tuned using Tree-structured Parzen Estimator (TPE) optimization with a maximum of 50 trials, where the best-performing configuration achieved the lowest loss. Model training employed a feed-forward backpropagation algorithm with 365 daily observations to predict freshwater yield (P<sub>std</sub>, L/day). Benchmarking against an optimized support vector regression (SVR) model with a radial basis function kernel revealed that the augmented CNN-1D achieved superior performance (RMSE = 0.04, MAE = 0.03, OIMP = 0.97), consistently outperforming both the baseline CNN-1D and the optimized SVR. Residual analyses confirmed its robustness, minimal bias, and strong generalization across unseen data. These findings demonstrate that combining augmentation with hierarchical feature extraction enables a scalable and computationally efficient predictive tool for solar still performance, offering significant potential for sustainable freshwater management in arid and data-constrained regions.</div></div>","PeriodicalId":11664,"journal":{"name":"Energy Conversion and Management","volume":"346 ","pages":"Article 120565"},"PeriodicalIF":10.9000,"publicationDate":"2025-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Conversion and Management","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0196890425010891","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate forecasting of freshwater productivity from conventional single-slope solar stills is crucial for enhancing operational efficiency and minimizing capital costs. A persistent challenge in this domain is the scarcity of experimental data, which limits the training of reliable predictive models. This study proposes a data-efficient forecasting framework that integrates a one-dimensional convolutional neural network (CNN-1D) with time-series data augmentation. Gaussian noise sampled from (0, 0.012) was applied exclusively to the training set, generating six augmented samples per instance. Both the augmentation factor (six) and the look-back window (seven days) were selected through systematic optimization, ensuring preservation of temporal dependencies. The CNN-1D architecture comprised three convolutional layers with 128 filters, ReLU activations, a flattening stage, and a dense regression output layer. Hyperparameters—including learning rate, batch size, kernel size, and regularization strength—were fine-tuned using Tree-structured Parzen Estimator (TPE) optimization with a maximum of 50 trials, where the best-performing configuration achieved the lowest loss. Model training employed a feed-forward backpropagation algorithm with 365 daily observations to predict freshwater yield (Pstd, L/day). Benchmarking against an optimized support vector regression (SVR) model with a radial basis function kernel revealed that the augmented CNN-1D achieved superior performance (RMSE = 0.04, MAE = 0.03, OIMP = 0.97), consistently outperforming both the baseline CNN-1D and the optimized SVR. Residual analyses confirmed its robustness, minimal bias, and strong generalization across unseen data. These findings demonstrate that combining augmentation with hierarchical feature extraction enables a scalable and computationally efficient predictive tool for solar still performance, offering significant potential for sustainable freshwater management in arid and data-constrained regions.
期刊介绍:
The journal Energy Conversion and Management provides a forum for publishing original contributions and comprehensive technical review articles of interdisciplinary and original research on all important energy topics.
The topics considered include energy generation, utilization, conversion, storage, transmission, conservation, management and sustainability. These topics typically involve various types of energy such as mechanical, thermal, nuclear, chemical, electromagnetic, magnetic and electric. These energy types cover all known energy resources, including renewable resources (e.g., solar, bio, hydro, wind, geothermal and ocean energy), fossil fuels and nuclear resources.