Harnessing Variational Autoencoders and self-organising maps for groundwater contamination assessment in peri-urban Ghana

IF 2.2 4区 地球科学 Q2 GEOSCIENCES, MULTIDISCIPLINARY
Portia Annabelle Opoku , Raymond Webrah Kazapoe , Noah Kwaku Baah , Abass Gibrilla , Geophrey K. Anornu , Nana Kobea Bonso
{"title":"Harnessing Variational Autoencoders and self-organising maps for groundwater contamination assessment in peri-urban Ghana","authors":"Portia Annabelle Opoku ,&nbsp;Raymond Webrah Kazapoe ,&nbsp;Noah Kwaku Baah ,&nbsp;Abass Gibrilla ,&nbsp;Geophrey K. Anornu ,&nbsp;Nana Kobea Bonso","doi":"10.1016/j.jafrearsci.2025.105866","DOIUrl":null,"url":null,"abstract":"<div><div>Although advanced machine learning models have demonstrated considerable potential for environmental monitoring, their application to assessing groundwater contamination in Ghana's peri-urban areas remains inadequately explored and poorly understood. To bridge this gap, this study aimed to apply advanced non-linear machine learning techniques, specifically Variational Autoencoders (VAEs) and Self-Organising Maps (SOMs), to analyse groundwater contaminants in south-eastern Ghana. The study examines intricate relationships and patterns among various pollutants to provide a comprehensive evaluation of groundwater quality. All the physicochemical parameters evaluated fell within the WHO guideline values. The VAE and SOM analyses confirm dual-source controls on groundwater chemistry in the Birimian terrains, involving both natural geogenic inputs from silicate and mafic lithologies and anthropogenic impacts from settlements. Inverse loadings across latent dimensions captured spatial heterogeneity, separating lithology-driven variables (e.g., Na<sup>+</sup>, Ca<sup>2+</sup>, EC) from pollution markers (e.g., NO<sub>3</sub><sup>−</sup>, Cl<sup>−</sup>). SOM clustering further distinguished zones of minimal human influence from areas with localised contamination, such as Pb hotspots and elevated EC and salinity linked to mineralisation or saline intrusion. Scattered peaks in F<sup>−</sup> and Cl<sup>−</sup> suggested episodic anthropogenic inputs. The results reveal notable disparities in machine learning model performance based on target variable features; the Nitrate Pollution Index (NPI) yielded a Test R<sup>2</sup> of 0.983, indicating superior predictive accuracy. Conversely, challenges with the Fluoride Pollution Index (FPI) and Pollution Index of Groundwater (PIG) exposed limitations due to unmeasured geological factors and low variability. We propose a data-driven, scalable diagnostic tool for monitoring water quality that can be integrated into national frameworks. This tool has implications for Sub-Saharan Africa and other regions similarly affected.</div></div>","PeriodicalId":14874,"journal":{"name":"Journal of African Earth Sciences","volume":"233 ","pages":"Article 105866"},"PeriodicalIF":2.2000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of African Earth Sciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1464343X25003334","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Although advanced machine learning models have demonstrated considerable potential for environmental monitoring, their application to assessing groundwater contamination in Ghana's peri-urban areas remains inadequately explored and poorly understood. To bridge this gap, this study aimed to apply advanced non-linear machine learning techniques, specifically Variational Autoencoders (VAEs) and Self-Organising Maps (SOMs), to analyse groundwater contaminants in south-eastern Ghana. The study examines intricate relationships and patterns among various pollutants to provide a comprehensive evaluation of groundwater quality. All the physicochemical parameters evaluated fell within the WHO guideline values. The VAE and SOM analyses confirm dual-source controls on groundwater chemistry in the Birimian terrains, involving both natural geogenic inputs from silicate and mafic lithologies and anthropogenic impacts from settlements. Inverse loadings across latent dimensions captured spatial heterogeneity, separating lithology-driven variables (e.g., Na+, Ca2+, EC) from pollution markers (e.g., NO3, Cl). SOM clustering further distinguished zones of minimal human influence from areas with localised contamination, such as Pb hotspots and elevated EC and salinity linked to mineralisation or saline intrusion. Scattered peaks in F and Cl suggested episodic anthropogenic inputs. The results reveal notable disparities in machine learning model performance based on target variable features; the Nitrate Pollution Index (NPI) yielded a Test R2 of 0.983, indicating superior predictive accuracy. Conversely, challenges with the Fluoride Pollution Index (FPI) and Pollution Index of Groundwater (PIG) exposed limitations due to unmeasured geological factors and low variability. We propose a data-driven, scalable diagnostic tool for monitoring water quality that can be integrated into national frameworks. This tool has implications for Sub-Saharan Africa and other regions similarly affected.
利用变分自动编码器和自组织地图在加纳城郊地下水污染评估
尽管先进的机器学习模型已经证明了环境监测的巨大潜力,但它们在评估加纳城郊地区地下水污染方面的应用仍然没有得到充分的探索和了解。为了弥补这一差距,本研究旨在应用先进的非线性机器学习技术,特别是变分自编码器(VAEs)和自组织地图(SOMs),来分析加纳东南部的地下水污染物。该研究考察了各种污染物之间复杂的关系和模式,以提供地下水质量的综合评价。评估的所有理化参数均在世卫组织指导值范围内。VAE和SOM分析证实了Birimian地区地下水化学的双重来源控制,包括硅酸盐和基性岩性的自然地质输入和住区的人为影响。潜在维度上的逆负荷捕获了空间异质性,将岩性驱动的变量(如Na+、Ca2+、EC)与污染标记(如NO3−、Cl−)分离。SOM聚类进一步区分了人类影响最小的区域和局部污染区域,如铅热点和与矿化或盐水入侵相关的EC和盐度升高。F -和Cl -的分散峰值提示偶发性人为输入。结果表明,基于目标变量特征的机器学习模型性能存在显著差异;硝酸盐污染指数(NPI)的检验R2为0.983,表明其预测精度较高。相反,氟化物污染指数(FPI)和地下水污染指数(PIG)的挑战暴露出由于未测量的地质因素和低变异性而存在的局限性。我们提出了一种数据驱动的、可扩展的水质监测诊断工具,可将其纳入国家框架。这一工具对撒哈拉以南非洲和其他受到类似影响的地区也有影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of African Earth Sciences
Journal of African Earth Sciences 地学-地球科学综合
CiteScore
4.70
自引率
4.30%
发文量
240
审稿时长
12 months
期刊介绍: The Journal of African Earth Sciences sees itself as the prime geological journal for all aspects of the Earth Sciences about the African plate. Papers dealing with peripheral areas are welcome if they demonstrate a tight link with Africa. The Journal publishes high quality, peer-reviewed scientific papers. It is devoted primarily to research papers but short communications relating to new developments of broad interest, reviews and book reviews will also be considered. Papers must have international appeal and should present work of more regional than local significance and dealing with well identified and justified scientific questions. Specialised technical papers, analytical or exploration reports must be avoided. Papers on applied geology should preferably be linked to such core disciplines and must be addressed to a more general geoscientific audience.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信