Martin Marzidovšek , Janja Francé , Vid Podpečan , Stanka Vadnjal , Jožica Dolenc , Patricija Mozetič
{"title":"利用长期监测数据预测亚得里亚海腹泻性贝类中毒事件的可解释机器学习","authors":"Martin Marzidovšek , Janja Francé , Vid Podpečan , Stanka Vadnjal , Jožica Dolenc , Patricija Mozetič","doi":"10.1016/j.hal.2024.102728","DOIUrl":null,"url":null,"abstract":"<div><div>In this study, explainable machine learning techniques are applied to predict the toxicity of mussels in the Gulf of Trieste (Adriatic Sea) caused by harmful algal blooms. By analysing a newly created 28-year dataset containing records of toxic phytoplankton in mussel farming areas and diarrhetic shellfish toxins in mussels (<em>Mytilus galloprovincialis</em>), we train and evaluate the performance of machine learning (ML) models to accurately predict diarrhetic shellfish poisoning (DSP) events. Based on the F1 score, the random forest model provided the best prediction of toxicity results at which the harvesting of mussels is stopped according to EU regulations. Explainability methods such as permutation importance and Shapley Additive Explanations (SHAP) identified key species (<em>Dinophysis fortii</em> and <em>D. caudata</em>) and environmental factors (salinity, river discharge and precipitation) as the best predictors of DSP toxins above regulatory limits. These findings are important for improving early warning systems, which until now were based solely on empirically defined alert abundances of DSP species. They provide experts, aquaculture practitioners, and authorities with additional information to make informed risk management decisions.</div></div>","PeriodicalId":12897,"journal":{"name":"Harmful Algae","volume":"139 ","pages":"Article 102728"},"PeriodicalIF":5.5000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Explainable machine learning for predicting diarrhetic shellfish poisoning events in the Adriatic Sea using long-term monitoring data\",\"authors\":\"Martin Marzidovšek , Janja Francé , Vid Podpečan , Stanka Vadnjal , Jožica Dolenc , Patricija Mozetič\",\"doi\":\"10.1016/j.hal.2024.102728\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this study, explainable machine learning techniques are applied to predict the toxicity of mussels in the Gulf of Trieste (Adriatic Sea) caused by harmful algal blooms. By analysing a newly created 28-year dataset containing records of toxic phytoplankton in mussel farming areas and diarrhetic shellfish toxins in mussels (<em>Mytilus galloprovincialis</em>), we train and evaluate the performance of machine learning (ML) models to accurately predict diarrhetic shellfish poisoning (DSP) events. Based on the F1 score, the random forest model provided the best prediction of toxicity results at which the harvesting of mussels is stopped according to EU regulations. Explainability methods such as permutation importance and Shapley Additive Explanations (SHAP) identified key species (<em>Dinophysis fortii</em> and <em>D. caudata</em>) and environmental factors (salinity, river discharge and precipitation) as the best predictors of DSP toxins above regulatory limits. These findings are important for improving early warning systems, which until now were based solely on empirically defined alert abundances of DSP species. They provide experts, aquaculture practitioners, and authorities with additional information to make informed risk management decisions.</div></div>\",\"PeriodicalId\":12897,\"journal\":{\"name\":\"Harmful Algae\",\"volume\":\"139 \",\"pages\":\"Article 102728\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2024-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Harmful Algae\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568988324001616\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MARINE & FRESHWATER BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Harmful Algae","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568988324001616","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MARINE & FRESHWATER BIOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
本研究将可解释的机器学习技术应用于预测的里雅斯特湾(亚得里亚海)因有害藻类大量繁殖而导致的贻贝毒性。通过分析新创建的 28 年数据集,其中包含贻贝养殖区有毒浮游植物和贻贝(Mytilus galloprovincialis)腹泻性贝类毒素的记录,我们训练并评估了机器学习(ML)模型的性能,以准确预测腹泻性贝类中毒(DSP)事件。根据 F1 分数,随机森林模型能最好地预测毒性结果,根据欧盟法规,在该结果出现时应停止收获贻贝。可解释性方法(如置换重要性和夏普利加法解释(SHAP))确定了关键物种(Dinophysis fortii 和 D. caudata)和环境因素(盐度、河流排水量和降水量)是预测 DSP 毒素超过法规限值的最佳指标。这些发现对改进早期预警系统非常重要,因为到目前为止,早期预警系统仅基于经验定义的 DSP 物种警戒丰度。它们为专家、水产养殖从业人员和当局提供了更多信息,以便做出明智的风险管理决定。
Explainable machine learning for predicting diarrhetic shellfish poisoning events in the Adriatic Sea using long-term monitoring data
In this study, explainable machine learning techniques are applied to predict the toxicity of mussels in the Gulf of Trieste (Adriatic Sea) caused by harmful algal blooms. By analysing a newly created 28-year dataset containing records of toxic phytoplankton in mussel farming areas and diarrhetic shellfish toxins in mussels (Mytilus galloprovincialis), we train and evaluate the performance of machine learning (ML) models to accurately predict diarrhetic shellfish poisoning (DSP) events. Based on the F1 score, the random forest model provided the best prediction of toxicity results at which the harvesting of mussels is stopped according to EU regulations. Explainability methods such as permutation importance and Shapley Additive Explanations (SHAP) identified key species (Dinophysis fortii and D. caudata) and environmental factors (salinity, river discharge and precipitation) as the best predictors of DSP toxins above regulatory limits. These findings are important for improving early warning systems, which until now were based solely on empirically defined alert abundances of DSP species. They provide experts, aquaculture practitioners, and authorities with additional information to make informed risk management decisions.
期刊介绍:
This journal provides a forum to promote knowledge of harmful microalgae and macroalgae, including cyanobacteria, as well as monitoring, management and control of these organisms.