Regression tools for chemical release modeling: An additive manufacturing case study.

IF 1.5 4区 医学 Q4 ENVIRONMENTAL SCIENCES
David E Meyer, Raymond L Smith, Elizabeth Lanphear, Sudhakar Takkellapati, John D Chea, Gerardo J Ruiz-Mercado, Michael A Gonzalez, William M Barrett
{"title":"Regression tools for chemical release modeling: An additive manufacturing case study.","authors":"David E Meyer, Raymond L Smith, Elizabeth Lanphear, Sudhakar Takkellapati, John D Chea, Gerardo J Ruiz-Mercado, Michael A Gonzalez, William M Barrett","doi":"10.1080/15459624.2024.2447320","DOIUrl":null,"url":null,"abstract":"<p><p>Chemical release data are essential for performing chemical risk assessments to understand the potential exposures arising from industrial processes. Often, these data are unknown or unavailable and must be estimated. A case study of volatile organic compound releases during extrusion-based additive manufacturing is used here to explore the viability of various regression methods for predicting chemical releases to inform chemical assessments. The methods assessed in this work include linear Least Squares, Least Absolute Shrinkage and Selection Operator (LASSO) and Ridge regression, classification and regression tree, random forest model, and neural network analysis. Secondary data describing polymeric extrusion in multiple applications are curated and assembled in a dataset to support regression modeling using default parameters for the various approaches. The potential to add noise to the dataset and improve regression is evaluated using synthetic data generation. Evaluation of model performance for a common test set found all methods were able to achieve predictions within 10%-error for up to 98% of the test sample population. The degree to which this level of performance was maintained when varying the number and type of features for regression was dependent on the model type. Linear methods and neural network analysis predicted the most test samples within 10%-error for smaller numbers of features while tree-based approaches could accommodate a larger number of features. The number and type of features can be important if the desire is to make chemical-specific release predictions. The inclusion of release data from related processes generally improved test set predictions across all models while the use of synthetic data as implemented here resulted in smaller increases in test sample predictions within 10%-error. Future work should focus on improving access to primary data and optimizing models to achieve maximum predictive performance of environmental releases to support chemical risk assessment.</p>","PeriodicalId":16599,"journal":{"name":"Journal of Occupational and Environmental Hygiene","volume":" ","pages":"1-11"},"PeriodicalIF":1.5000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Occupational and Environmental Hygiene","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1080/15459624.2024.2447320","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Chemical release data are essential for performing chemical risk assessments to understand the potential exposures arising from industrial processes. Often, these data are unknown or unavailable and must be estimated. A case study of volatile organic compound releases during extrusion-based additive manufacturing is used here to explore the viability of various regression methods for predicting chemical releases to inform chemical assessments. The methods assessed in this work include linear Least Squares, Least Absolute Shrinkage and Selection Operator (LASSO) and Ridge regression, classification and regression tree, random forest model, and neural network analysis. Secondary data describing polymeric extrusion in multiple applications are curated and assembled in a dataset to support regression modeling using default parameters for the various approaches. The potential to add noise to the dataset and improve regression is evaluated using synthetic data generation. Evaluation of model performance for a common test set found all methods were able to achieve predictions within 10%-error for up to 98% of the test sample population. The degree to which this level of performance was maintained when varying the number and type of features for regression was dependent on the model type. Linear methods and neural network analysis predicted the most test samples within 10%-error for smaller numbers of features while tree-based approaches could accommodate a larger number of features. The number and type of features can be important if the desire is to make chemical-specific release predictions. The inclusion of release data from related processes generally improved test set predictions across all models while the use of synthetic data as implemented here resulted in smaller increases in test sample predictions within 10%-error. Future work should focus on improving access to primary data and optimizing models to achieve maximum predictive performance of environmental releases to support chemical risk assessment.

化学释放建模的回归工具:一个增材制造案例研究。
化学品释放数据对于进行化学品风险评估以了解工业过程中产生的潜在暴露至关重要。通常,这些数据是未知的或不可用的,必须进行估计。本文以挤压增材制造过程中挥发性有机化合物释放为例,探讨各种回归方法预测化学物质释放的可行性,为化学评估提供信息。本研究评估的方法包括线性最小二乘、最小绝对收缩和选择算子(LASSO)和Ridge回归、分类和回归树、随机森林模型和神经网络分析。在多种应用中描述聚合物挤出的次要数据被整理和组装在一个数据集中,以支持使用各种方法的默认参数进行回归建模。使用合成数据生成来评估向数据集添加噪声和改进回归的潜力。对通用测试集的模型性能评估发现,所有方法都能够在高达98%的测试样本总体中实现10%误差以内的预测。当改变回归特征的数量和类型时,维持这种性能水平的程度取决于模型类型。线性方法和神经网络分析预测,对于较小数量的特征,大多数测试样本的误差在10%以内,而基于树的方法可以容纳更多的特征。如果想要预测特定化学物质的释放,特征的数量和类型可能很重要。包含来自相关过程的发布数据通常改善了所有模型的测试集预测,而在这里实现的合成数据的使用导致测试样本预测在10%误差内的较小增长。今后的工作应侧重于改善对原始数据的获取和优化模型,以实现对环境释放的最大预测性能,以支持化学品风险评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Occupational and Environmental Hygiene
Journal of Occupational and Environmental Hygiene 环境科学-公共卫生、环境卫生与职业卫生
CiteScore
3.30
自引率
10.00%
发文量
81
审稿时长
12-24 weeks
期刊介绍: The Journal of Occupational and Environmental Hygiene ( JOEH ) is a joint publication of the American Industrial Hygiene Association (AIHA®) and ACGIH®. The JOEH is a peer-reviewed journal devoted to enhancing the knowledge and practice of occupational and environmental hygiene and safety by widely disseminating research articles and applied studies of the highest quality. The JOEH provides a written medium for the communication of ideas, methods, processes, and research in core and emerging areas of occupational and environmental hygiene. Core domains include, but are not limited to: exposure assessment, control strategies, ergonomics, and risk analysis. Emerging domains include, but are not limited to: sensor technology, emergency preparedness and response, changing workforce, and management and analysis of "big" data.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信