{"title":"Enhancing particulate matter risk assessment with novel machine learning-driven toxicity threshold prediction","authors":"","doi":"10.1016/j.engappai.2024.109531","DOIUrl":null,"url":null,"abstract":"<div><div>Airborne particulate matter (PM) poses significant health risks, necessitating accurate toxicity threshold determination for effective risk assessment. This study introduces a novel machine-learning (ML) approach to predict PM toxicity thresholds and identify the key physico-chemical and exposure characteristics. Five machine learning algorithms — logistic regression, support vector classifier, decision tree, random forest, and extreme gradient boosting — were employed to develop predictive models using a comprehensive dataset from existing studies. We developed models using the initial dataset and a class weight approach to address data imbalance. For the imbalanced data, the Random Forest classifier outperformed others with 87% accuracy, 81% recall, and the fewest false negatives (23). In the class weight approach, the Support Vector Classifier minimized false negatives (21), while the Random Forest model achieved superior overall performance with 86% accuracy, 80% recall, and an F1-score of 82%. Furthermore, eXplainable Artificial Intelligence (XAI) techniques, specifically SHAP (SHapley Additive exPlanations) values, were utilized to quantify feature contributions to predictions, offering insights beyond traditional laboratory approaches. This study represents the first application of machine learning for predicting PM toxicity thresholds, providing a robust tool for health risk assessment. The proposed methodology offers a time- and cost-effective alternative to classical laboratory tests, potentially revolutionizing PM toxicity threshold determination in scientific and epidemiological research. This innovative approach has significant implications for shaping regulatory policies and designing targeted interventions to mitigate health risks associated with airborne PM.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624016890","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Airborne particulate matter (PM) poses significant health risks, necessitating accurate toxicity threshold determination for effective risk assessment. This study introduces a novel machine-learning (ML) approach to predict PM toxicity thresholds and identify the key physico-chemical and exposure characteristics. Five machine learning algorithms — logistic regression, support vector classifier, decision tree, random forest, and extreme gradient boosting — were employed to develop predictive models using a comprehensive dataset from existing studies. We developed models using the initial dataset and a class weight approach to address data imbalance. For the imbalanced data, the Random Forest classifier outperformed others with 87% accuracy, 81% recall, and the fewest false negatives (23). In the class weight approach, the Support Vector Classifier minimized false negatives (21), while the Random Forest model achieved superior overall performance with 86% accuracy, 80% recall, and an F1-score of 82%. Furthermore, eXplainable Artificial Intelligence (XAI) techniques, specifically SHAP (SHapley Additive exPlanations) values, were utilized to quantify feature contributions to predictions, offering insights beyond traditional laboratory approaches. This study represents the first application of machine learning for predicting PM toxicity thresholds, providing a robust tool for health risk assessment. The proposed methodology offers a time- and cost-effective alternative to classical laboratory tests, potentially revolutionizing PM toxicity threshold determination in scientific and epidemiological research. This innovative approach has significant implications for shaping regulatory policies and designing targeted interventions to mitigate health risks associated with airborne PM.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.