{"title":"Vegetation optical depth as a key predictor for fire risk escalation","authors":"Dinuka Kankanige, Yi Y. Liu, Ashish Sharma","doi":"10.1016/j.ecoinf.2025.103050","DOIUrl":"10.1016/j.ecoinf.2025.103050","url":null,"abstract":"<div><div>Excluding direct consideration of vegetation dynamics reduces the accuracy in fire risk estimation. Satellite retrievals of vegetation dynamics can enhance the fire risk prediction when used as indicators of fuel water status and fuel load. However, the fire risk-vegetation relationship carries complexities as different mechanisms dominate during fire risk escalation and decline, with vegetation responding differently to each process. This study investigates whether vegetation parameters can be utilized in fire risk prediction in the absence of fire weather information, and how they can be utilized to effectively reflect on the fire risk increment from a minimum point, which is the concern in bushfire occurrence. Using the McArthur Forest Fire Danger Index (FFDI) as a measure of fire danger, a clear association with the satellite-observed vegetation optical depth (VOD) was noted for segments illustrating risk increment. An application over Australia showed clear improvements when incorporating VOD into a predictive model as compared to the use of fire risk persistence alone. On average, the VOD-induced predictive model exhibited better performance than the persistence model when evaluated over a 12-month lead span. The former model showed higher Nash-Sutcliffe efficiency (NSE) in 55.6% of pixels that indicated VOD causes FFDI. The latter performed better only in 18.2% of those pixels. Across the entire spatial domain, from the first to the ninth lead month, the VOD-induced model showed higher mean NSE (0.65 ± 0.23 to 0.52 ± 0.34) and lower or nearly equal mean root mean square error (RMSE) (4.6 ± 3.7 to 7.9 ± 5.4) than the persistence model. Our study provides insights on fire risk escalation in fire-prone regions in the absence of fire weather data. With further improvements, the proposed method can serve as a foundation for developing a novel forecast index solely based on time series data of fire risk and vegetation dynamics.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103050"},"PeriodicalIF":5.8,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143102467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Using 360-degree panoramic technology to explore the mechanisms underlying the influence of landscape features on visual landscape quality in traditional villages","authors":"Huaheng Shen , Nor Fadzila Aziz , Xinyi Lv","doi":"10.1016/j.ecoinf.2025.103036","DOIUrl":"10.1016/j.ecoinf.2025.103036","url":null,"abstract":"<div><div>Traditional villages, as integral components of cultures worldwide, preserve rich and irreplaceable tangible and intangible cultural heritage. However, due to China's rapid urbanization and the burgeoning rural tourism industry, the original landscape of traditional villages is being damaged. Therefore, their visual landscape quality must be scientifically evaluated and protected. This study explores the impact of landscape features on visual landscape quality. It considers Zhaoxing Dong and Basha Miao villages as case studies, utilizing 360-degree panoramic technology, a combination of scenic beauty estimation and semantic differential methods, and expert focus group discussions. The analyses revealed that eight landscape feature indicators significantly affect the visual landscape quality of traditional villages. Among these, landform diversity, waterscape, architectural style uniformity, historical sense of paved roads, folk activity landscape, and environmental cleanliness have a significant positive impact on visual landscape quality, whereas plant diversity and color richness exhibit complex bidirectional effects in enhancing visual landscape quality. Experts discussed the mechanisms underlying these influencing factors and proposed seven specific strategies and recommendations for protecting and enhancing the visual landscape quality of traditional villages: enhancing ecological aspects, preserving cultural heritage, improving infrastructure and environmental protection, encouraging community participation, developing relevant policies and regulations, promoting sustainable tourism development, and encouraging cross-sector cooperation with funding and technical support to ensure sustainable development.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103036"},"PeriodicalIF":5.8,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143102489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Biao Zhang , Zhichao Wang , Tiantian Ma , Zhihao Wang , Hao Li , Wenxu Ji , Mingyang He , Ao Jiao , Zhongke Feng
{"title":"Correcting forest aboveground biomass biases by incorporating independent canopy height retrieval with conventional machine learning models using GEDI and ICESat-2 data","authors":"Biao Zhang , Zhichao Wang , Tiantian Ma , Zhihao Wang , Hao Li , Wenxu Ji , Mingyang He , Ao Jiao , Zhongke Feng","doi":"10.1016/j.ecoinf.2025.103045","DOIUrl":"10.1016/j.ecoinf.2025.103045","url":null,"abstract":"<div><div>Spaceborne LiDAR satellites, including GEDI and ICESat-2, have shown significant potential in estimating aboveground biomass (AGB) using machine learning (ML) methods. In contrast to advances focused on the refinement of ML algorithms, this study aims to enhance AGB estimation accuracy by integrating an additional Canopy Height (CH) information. To obtain CH data, this study utilized three spaceborne LiDAR datasets: ICESat-2 ATL08, ICESat-2 ATL03/ATL08 fusion data, and GEDI-L2A. Random Forest (RF) and Monte Carlo-based uncertainty analysis were employed to evaluate the most suitable spaceborne LiDAR dataset for CH estimation. The accuracy of CH features in AGB estimation was then compared using both Linear Regression (LR) and RF models. The spectral saturation point was computed using a semi-variance function, and the contribution of CH features to AGB estimates was quantified across different gradients, especially when AGB neared or surpassed the saturation point. The findings demonstrate that the ATL03/08 fusion dataset surpasses the other datasets in terms of CH estimation accuracy and uncertainty, delivering enhanced precision and stability. Incorporating CH features notably improved AGB model performance, as evidenced by R<sup>2</sup> increases of 13.89 % and 10.34 % in the LR and RF models, respectively. The correction of AGB estimates across various gradients with CH features demonstrated a nonlinear pattern, initially increasing, then decreasing, and subsequently rebounding. Notable inflection points were identified at 26 Mg/ha and 123 Mg/ha, marking significant transitions in the correction trend. Both positive and negative bias corrections were observed during the correction process, with their proportions varying according to AGB values. When AGB approached or exceeded the spectral saturation point, the ability of CH features to improve positive bias correction was markedly enhanced, resulting in a greater proportion of positively corrected pixels and more significant correction values. The results of this study provide new insights into the role of CH features in AGB estimation, offering important implications for enhancing biomass mapping accuracy in forest ecosystems.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103045"},"PeriodicalIF":5.8,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143102128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Prediction of fish (Coilia nasus) catch using spatiotemporal environmental variables and random forest model in a highly turbid macrotidal estuary","authors":"Vishal Singh Rawat, Gubash Azhikodan, Katsuhide Yokoyama","doi":"10.1016/j.ecoinf.2025.103048","DOIUrl":"10.1016/j.ecoinf.2025.103048","url":null,"abstract":"<div><div>Fish populations in estuaries are declining due to the changes in environmental conditions and fishing pressures. The estuarine fish behaviour is highly variable, influenced by both upstream fluvial and downstream tidal conditions. This study aims to predict the catch per unit effort (CPUE) of the Japanese Grenadier Anchovy (<em>Coilia nasus</em>) in the Chikugo River estuary by analyzing an extensive dataset of hourly fish catches and environmental variables through Random Forest (RF) models. The fish catch data for <em>C. nasus</em>, collected at 14.6–16 km upstream from the river mouth during the spawning season of every year from 2009 to 2020 using traditional fishing methods, was used. Along with these catch records, hydro-environmental variables such as salinity, turbidity, and temperature were monitored during the same period. The longitudinal variation of these environmental variables along the estuary (0–16 km) was measured during a fortnightly tidal cycle in September 2010. A total of 32 models (M1-M32) were developed to identify the optimal set of environmental variables influencing CPUE. The analysis highlights the significant impact of variables such as salinity, suspended sediment concentration (SSC), temperature, river discharge, and mean tidal range on CPUE. The results revealed that model M19, which incorporated salinity, SSC, and discharge, achieved the highest predictive accuracy (R<sup>2</sup> = 0.89) and closely matched actual field conditions. Further, the results agree with previous research, as spatial distribution plots showed a preference for mature <em>C. nasus</em> habitats 15–16 km upstream from the river mouth. Additionally, the study found that temperature had a negligible effect on short-term CPUE predictions, likely due to its pronounced seasonal variability, suggesting that temperature may not be a critical factor for short-term CPUE predictions. This study highlights the significance of utilizing environmental variables to predict CPUE, emphasizing their role in understanding fish catch dynamics across spatiotemporal variations. The findings provide valuable insights for fisheries management, particularly in optimizing fishing zones based on environmental conditions to improve catch efficiency.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103048"},"PeriodicalIF":5.8,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143102220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Carlos Peco-Costas, Carolina Acuña-Alonso, Mario García-Ontiyuelo, Xana Álvarez
{"title":"Assessing ecological connectivity in the Serra do Cando and Serra do Candán area of Galicia: A multitemporal classification and least-cost path modelling approach","authors":"Carlos Peco-Costas, Carolina Acuña-Alonso, Mario García-Ontiyuelo, Xana Álvarez","doi":"10.1016/j.ecoinf.2025.103049","DOIUrl":"10.1016/j.ecoinf.2025.103049","url":null,"abstract":"<div><div>Ecological connectivity is essential for mitigating the anthropogenic impact caused by urbanization, infrastructure, and the production of goods on natural habitats and their fragmentation. This study assesses the state of ecological connectivity between hardwoods habitats in different years for a Natura 2000 area in Galicia, in northwestern Spain, the Serra do Cando and Candán. A supervised land cover classification was performed using two different machine learning algorithms, an Artificial Neural Network (ANN) and Random Forest (RF), and Sentinel-2 images from 2015 and 2022. A possible future land use scenario for the year 2029 was generated with Modules for Land Use Change Evaluation (MOLUSCE) plugin for QGIS from a Multilayer Perceptron ANN. Land use information was used to construct resistance surfaces on which ecological corridors were modelled as least-cost paths between habitat patches. The equivalent connected area (ECA) was calculated to quantify the level of connectivity and compare different time periods. Classifications achieved an accuracy of 91 % in RF and 88 % in ANN for the year 2015, and 92 % and 91 % respectively in 2022. The results for the year 2029 show a decrease in areas under crops and grassland according to RF and conifers in the case of ANN. The highest ECA values were reached in 2022 with 864 ha according to the RF-based methodology and 757 ha according to ANN. The area of hardwoods patches was the fundamental parameter that affects ECA. Combining remote sensing techniques with the least-cost paths method, including a simulation of future land use changes, has made it possible to compare the degree of ecological connectivity in different scenarios. This methodology shows the effects of land-cover changes and provides a tool to support decision making in land use planning.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103049"},"PeriodicalIF":5.8,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143102469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qingqiang Ren , Yuying Zhang , Jie Yin , Dongyan Han , Min Liu , Yong Chen
{"title":"An evaluation of vulnerability settings in Ecopath with Ecosim on ecosystem hindcast and forecast skills","authors":"Qingqiang Ren , Yuying Zhang , Jie Yin , Dongyan Han , Min Liu , Yong Chen","doi":"10.1016/j.ecoinf.2025.103040","DOIUrl":"10.1016/j.ecoinf.2025.103040","url":null,"abstract":"<div><div>Ecological model fitting is a critical step in ensuring that models can reflect historical ecosystem dynamics, allowing for an improved understanding of ecological processes and potentially enhancing the reliability of future projections, despite inherent uncertainties. Vulnerability parameters (<em>v</em>), reflecting the predator-prey relationship, play a crucial role in the Ecopath with Ecosim (EwE) model fitting. However, many EwE applications have bypassed tuning the vulnerability parameters due to a lack of historical data, limiting the impacts of vulnerability-unfitted (<em>v</em>-unfitted) models on evaluating management strategies. In this study, we used model skill metrics, including bias, error, and reliability, to evaluate the hindcast and forecast skills of the <em>v</em>-unfitted models with multiple vulnerability settings. The prediction from vulnerability-fitted (<em>v</em>-fitted) model was found to have the best fitness and most accurately replicate historical ecosystem dynamics when compared to observed data. In addition, the <em>v</em>-unfitted model with trophic-level-related vulnerability setting (<em>vTL</em>) exhibited relatively better hindcast ability among the alternative <em>v</em> settings compared with <em>v</em>-fitted model. In terms of forecast skill under both reduced and increased fishing effort scenarios, only the depletion-related vulnerability setting (<em>vB</em>) was found to be robust for <em>v</em>-unfitted models comparing to <em>v</em>-fitted model predictions. We highlight the importance of examining various vulnerability settings, and providing a reference for the application of unfitted models in informing ecosystem-based fisheries management. Our results also reaffirm the critical role of time-series data in applying EwE models.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103040"},"PeriodicalIF":5.8,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143102082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Guy A. Fotso Kamga , Yacine Bouroubi , Mickaël Germain , Georges Martin , Laurent Bitjoka
{"title":"Beekeeping suitability prediction based on an adaptive neuro-fuzzy inference system and apiary level data","authors":"Guy A. Fotso Kamga , Yacine Bouroubi , Mickaël Germain , Georges Martin , Laurent Bitjoka","doi":"10.1016/j.ecoinf.2025.103015","DOIUrl":"10.1016/j.ecoinf.2025.103015","url":null,"abstract":"<div><div>The study employs a predictive modelling approach using a fuzzy inference system to assess the beekeeping potential of a geographic area. Specifically, an adaptive neuro-fuzzy inference system with subtractive clustering (ANFIS-SC) was utilized, incorporating six input variables that influence <em>Apis mellifera</em> health and productivity, and field data as the output variable reflecting the state of a colony. The results demonstrate the model’s effectiveness in predicting the suitability of areas for beekeeping. Sensitivity analysis highlighted the significant effects of relative humidity on the model’s output. The research underscores the importance of data quality, particularly in determining the local land cover quality index (LLCQI), on the outcomes. This study highlights the role of data science in enhancing precision in beekeeping and proposes its integration into management practices to support honey bee health.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103015"},"PeriodicalIF":5.8,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143102485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A comprehensive evaluation of deep learning approaches for ground-level ozone prediction across different regions","authors":"Guanjun Lin , Hang Zhao , Yufeng Chi","doi":"10.1016/j.ecoinf.2025.103024","DOIUrl":"10.1016/j.ecoinf.2025.103024","url":null,"abstract":"<div><div>Recently, the issue of near-surface ozone pollution has become a growing concern. To effectively manage and control ozone pollution, emerging deep learning (DL) techniques have been applied for future ozone concentration trend prediction, generating promising outcomes. However, existing studies employ various DL models and rely on diverse datasets to predict ozone concentrations. This leads to a lack of comprehensive evaluations of how the architecture and depth of different DL models influence the predictive accuracy of ozone concentration trends when assessed using a unified dataset. This lack of uniformity in evaluations creates a gap in our understanding of the influence of different neural network architectures and depths on ozone concentration predictions. In this work, we aim to address this research gap by conducting a systematic performance evaluation that benchmarks six prominent DL architectures, each with varying depths, to evaluate their effectiveness for predicting ozone concentrations across diverse geographical regions. Our findings indicate that the best-performing DL model in the nationwide prediction task is the one-layer bidirectional long short-term memory (Bi-LSTM) model, which achieves an <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.66, an RMSE of 15.32<span><math><mrow><mi>μ</mi><mi>g</mi><mi>⋅</mi><msup><mrow><mi>m</mi></mrow><mrow><mo>−</mo><mn>3</mn></mrow></msup></mrow></math></span>, and an MAE of 11.51<span><math><mrow><mi>μ</mi><mi>g</mi><mi>⋅</mi><msup><mrow><mi>m</mi></mrow><mrow><mo>−</mo><mn>3</mn></mrow></msup></mrow></math></span>. In contrast, the poorest-performing model in the same prediction task is the one-block transformer-based model, with an <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.57, an RMSE of 17.34<span><math><mrow><mi>μ</mi><mi>g</mi><mi>⋅</mi><msup><mrow><mi>m</mi></mrow><mrow><mo>−</mo><mn>3</mn></mrow></msup></mrow></math></span>, and an MAE of 13.3<span><math><mrow><mi>μ</mi><mi>g</mi><mi>⋅</mi><msup><mrow><mi>m</mi></mrow><mrow><mo>−</mo><mn>3</mn></mrow></msup></mrow></math></span>. Furthermore, fully connected networks (FCNs) demonstrate robust and efficient predictive performance across both nationwide and regional prediction tasks. Notably, our study reveals that no single DL model consistently performs well across all prediction tasks, emphasizing the need for tailored approaches that cater to the specific attributes of each region. Additionally, we observe that DL models with more than two hidden layers frequently suffer from overfitting. Particularly for the Bi-LSTM architecture, as the number of hidden layers increases from 1 to 7, we observe a 12% reduction in <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> performance. Our analysis also identifies the most influential meteorological factors among the top-performing DL models, offering insigh","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103024"},"PeriodicalIF":5.8,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143101990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cameron Trotter, Huw J. Griffiths, Rowan J. Whittle
{"title":"Surveying the deep: A review of computer vision in the benthos","authors":"Cameron Trotter, Huw J. Griffiths, Rowan J. Whittle","doi":"10.1016/j.ecoinf.2024.102989","DOIUrl":"10.1016/j.ecoinf.2024.102989","url":null,"abstract":"<div><div>The analysis of image data for benthic biodiversity monitoring is now commonplace within the domain of marine ecology. Whilst advances in imaging technologies have allowed for the collection of vast quantities of data, the curation of this has traditionally been performed manually, resulting in a bottleneck whereby data is collected faster than it can be processed. Recent years have seen marine ecologists turn to the domain of computer vision to help automate this curation process. However, as the knowledge required to build such systems spans both domains, there is a high barrier to entry. To help reduce this barrier, this paper aims to provide an introduction to computer vision-based benthic biodiversity monitoring via a comprehensive literature review. To aid ecologists, key computer vision concepts are described and example use-cases highlighted. The major challenges inherent to benthic imagery for computer vision systems are explored, alongside a discussion of how current systems attempt to mitigate against these. To aid computer scientists wishing to enter the domain, an exploration of currently available open-source benthic datasets is also provided. Recommendations for future research are explored, including a move towards human-centric techniques, committing to ablation studies, reaching community agreement on open-source benchmarking datasets, and an increased use of innovative methods to allow for improved answering of key benthic ecology questions.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 102989"},"PeriodicalIF":5.8,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143101993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Alef Iury Siqueira Ferreira , Nádia Felix Felipe da Silva , Fernanda Neiva Mesquita , Thierson Couto Rosa , Stephen L. Buchmann , José Neiva Mesquita-Neto
{"title":"Transformer Models improve the acoustic recognition of buzz-pollinating bee species","authors":"Alef Iury Siqueira Ferreira , Nádia Felix Felipe da Silva , Fernanda Neiva Mesquita , Thierson Couto Rosa , Stephen L. Buchmann , José Neiva Mesquita-Neto","doi":"10.1016/j.ecoinf.2025.103010","DOIUrl":"10.1016/j.ecoinf.2025.103010","url":null,"abstract":"<div><div>Buzz-pollinated crops, such as tomatoes, potatoes, kiwifruit, and blueberries, are among the highest-yielding agricultural products. The flowers of these cultivated plants are characterized by having a specialized flower morphology with poricidal anthers that require vibration to achieve a full seed set. At least 446 bee species, in 82 genera, use floral sonication (buzz pollination) to collect pollen grains as food. Identifying and classifying these diverse often look-alike bee species poses a challenge for taxonomists. Automated classification systems, based upon audible bee floral buzzes, have been investigated to meet this need. Recently, convolutional neural network (CNN) models have demonstrated superior performance in recognizing and distinguishing bee-buzzing sounds compared to classical Machine-Learning (ML) classifiers. Nonetheless, the performance of CNNs remains unsatisfactory and can be improved. Therefore, we applied a novel transformer-based neural network architecture for the task of acoustic recognition of blueberry-pollinating bee species. We further compared the performance of the Audio Spectrogram Transformer (AST) model and its variants, including Self-Supervised AST (SSAST) and Masked Autoencoding AST (MAE-AST), to that of strong baseline CNN models based on previous work, at the task of bee species recognition. We also employed data augmentation techniques and evaluated these models with a data set of bee sounds recorded during visits to blueberry flowers in Chile (518 audio samples of 15 bee species). Our results revealed that Transformer-based Neural Networks combined with pre-training and data augmentation outperformed CNN models (maximum F1-score: 64.5% ± 2; Accuracy: 82.2% ± 0.8). These innovative attention-based neural network architectures have demonstrated exceptional performance in assigning bee buzzing sounds to their respective taxonomic categories, outperforming prior deep learning models. However, transformer approaches face challenges related to small dataset size and class imbalance, similar to CNNs and classical ML algorithms. Combining pre-training with data augmentation is crucial to increase the diversity and robustness of training data sets for the acoustic recognition of bee species. We document the potential of transformer architectures to improve the performance of audible bee species identification, offering promising new avenues for bioacoustic research and pollination ecology.</div></div>","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"86 ","pages":"Article 103010"},"PeriodicalIF":5.8,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143101994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}