David E Meyer, Raymond L Smith, Elizabeth Lanphear, Sudhakar Takkellapati, John D Chea, Gerardo J Ruiz-Mercado, Michael A Gonzalez, William M Barrett
{"title":"化学释放建模的回归工具:一个增材制造案例研究。","authors":"David E Meyer, Raymond L Smith, Elizabeth Lanphear, Sudhakar Takkellapati, John D Chea, Gerardo J Ruiz-Mercado, Michael A Gonzalez, William M Barrett","doi":"10.1080/15459624.2024.2447320","DOIUrl":null,"url":null,"abstract":"<p><p>Chemical release data are essential for performing chemical risk assessments to understand the potential exposures arising from industrial processes. Often, these data are unknown or unavailable and must be estimated. A case study of volatile organic compound releases during extrusion-based additive manufacturing is used here to explore the viability of various regression methods for predicting chemical releases to inform chemical assessments. The methods assessed in this work include linear Least Squares, Least Absolute Shrinkage and Selection Operator (LASSO) and Ridge regression, classification and regression tree, random forest model, and neural network analysis. Secondary data describing polymeric extrusion in multiple applications are curated and assembled in a dataset to support regression modeling using default parameters for the various approaches. The potential to add noise to the dataset and improve regression is evaluated using synthetic data generation. Evaluation of model performance for a common test set found all methods were able to achieve predictions within 10%-error for up to 98% of the test sample population. The degree to which this level of performance was maintained when varying the number and type of features for regression was dependent on the model type. Linear methods and neural network analysis predicted the most test samples within 10%-error for smaller numbers of features while tree-based approaches could accommodate a larger number of features. The number and type of features can be important if the desire is to make chemical-specific release predictions. The inclusion of release data from related processes generally improved test set predictions across all models while the use of synthetic data as implemented here resulted in smaller increases in test sample predictions within 10%-error. Future work should focus on improving access to primary data and optimizing models to achieve maximum predictive performance of environmental releases to support chemical risk assessment.</p>","PeriodicalId":16599,"journal":{"name":"Journal of Occupational and Environmental Hygiene","volume":" ","pages":"1-11"},"PeriodicalIF":1.5000,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Regression tools for chemical release modeling: An additive manufacturing case study.\",\"authors\":\"David E Meyer, Raymond L Smith, Elizabeth Lanphear, Sudhakar Takkellapati, John D Chea, Gerardo J Ruiz-Mercado, Michael A Gonzalez, William M Barrett\",\"doi\":\"10.1080/15459624.2024.2447320\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Chemical release data are essential for performing chemical risk assessments to understand the potential exposures arising from industrial processes. Often, these data are unknown or unavailable and must be estimated. A case study of volatile organic compound releases during extrusion-based additive manufacturing is used here to explore the viability of various regression methods for predicting chemical releases to inform chemical assessments. The methods assessed in this work include linear Least Squares, Least Absolute Shrinkage and Selection Operator (LASSO) and Ridge regression, classification and regression tree, random forest model, and neural network analysis. Secondary data describing polymeric extrusion in multiple applications are curated and assembled in a dataset to support regression modeling using default parameters for the various approaches. The potential to add noise to the dataset and improve regression is evaluated using synthetic data generation. Evaluation of model performance for a common test set found all methods were able to achieve predictions within 10%-error for up to 98% of the test sample population. The degree to which this level of performance was maintained when varying the number and type of features for regression was dependent on the model type. Linear methods and neural network analysis predicted the most test samples within 10%-error for smaller numbers of features while tree-based approaches could accommodate a larger number of features. The number and type of features can be important if the desire is to make chemical-specific release predictions. The inclusion of release data from related processes generally improved test set predictions across all models while the use of synthetic data as implemented here resulted in smaller increases in test sample predictions within 10%-error. Future work should focus on improving access to primary data and optimizing models to achieve maximum predictive performance of environmental releases to support chemical risk assessment.</p>\",\"PeriodicalId\":16599,\"journal\":{\"name\":\"Journal of Occupational and Environmental Hygiene\",\"volume\":\" \",\"pages\":\"1-11\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-01-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Occupational and Environmental Hygiene\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1080/15459624.2024.2447320\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Occupational and Environmental Hygiene","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1080/15459624.2024.2447320","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Regression tools for chemical release modeling: An additive manufacturing case study.
Chemical release data are essential for performing chemical risk assessments to understand the potential exposures arising from industrial processes. Often, these data are unknown or unavailable and must be estimated. A case study of volatile organic compound releases during extrusion-based additive manufacturing is used here to explore the viability of various regression methods for predicting chemical releases to inform chemical assessments. The methods assessed in this work include linear Least Squares, Least Absolute Shrinkage and Selection Operator (LASSO) and Ridge regression, classification and regression tree, random forest model, and neural network analysis. Secondary data describing polymeric extrusion in multiple applications are curated and assembled in a dataset to support regression modeling using default parameters for the various approaches. The potential to add noise to the dataset and improve regression is evaluated using synthetic data generation. Evaluation of model performance for a common test set found all methods were able to achieve predictions within 10%-error for up to 98% of the test sample population. The degree to which this level of performance was maintained when varying the number and type of features for regression was dependent on the model type. Linear methods and neural network analysis predicted the most test samples within 10%-error for smaller numbers of features while tree-based approaches could accommodate a larger number of features. The number and type of features can be important if the desire is to make chemical-specific release predictions. The inclusion of release data from related processes generally improved test set predictions across all models while the use of synthetic data as implemented here resulted in smaller increases in test sample predictions within 10%-error. Future work should focus on improving access to primary data and optimizing models to achieve maximum predictive performance of environmental releases to support chemical risk assessment.
期刊介绍:
The Journal of Occupational and Environmental Hygiene ( JOEH ) is a joint publication of the American Industrial Hygiene Association (AIHA®) and ACGIH®. The JOEH is a peer-reviewed journal devoted to enhancing the knowledge and practice of occupational and environmental hygiene and safety by widely disseminating research articles and applied studies of the highest quality.
The JOEH provides a written medium for the communication of ideas, methods, processes, and research in core and emerging areas of occupational and environmental hygiene. Core domains include, but are not limited to: exposure assessment, control strategies, ergonomics, and risk analysis. Emerging domains include, but are not limited to: sensor technology, emergency preparedness and response, changing workforce, and management and analysis of "big" data.