Yi-Hui Zhou , Paul J. Gallins , Ivan Rusyn , Fred A. Wright
{"title":"揭示多维新方法(NAMs)数据中重要的直接和中介关系的方法:以石油uvcb危害评估为例","authors":"Yi-Hui Zhou , Paul J. Gallins , Ivan Rusyn , Fred A. Wright","doi":"10.1016/j.scitotenv.2025.179724","DOIUrl":null,"url":null,"abstract":"<div><div>New Approach Methods (NAMs) encompass a wide range of data types; it is increasingly common to have highly multi-dimensional data (e.g., cellular, molecular and gene expression effects) on the same chemicals. In addition, chemical structure descriptors (for mono-constituent substances) or fractional composition (for complex substances) inform similarity hypotheses for read-across. Still, the utility of these multi-dimensional datasets for decision-making is difficult to ascertain. To address this challenge, we hypothesized that correlation and mediation analyses methods can be used to uncover significant and interpretable relationships in complex NAMs datasets. We used previously published data on 141 petroleum UVCBs (substances of unknown or variable composition, complex reaction products and biological materials) that included (i) characterization of the polycyclic aromatic compound (PAC) content, (ii) 42 bioactivity measurements from 12 human cell types, and (iii) transcriptomic data from 6 cell types. We explored the relationships among data types and determined how these data can be used for bioactivity-based prioritization. We found that PAC content was highly informative for bioactivity prediction, while the addition of transcriptomic data provided modest improvements. We then applied the statistical procedure of mediation analysis to uncover relationships among transcriptomics, PAC, and bioactivity. The strongest relationships appeared to be nearly completely mediated, and phenotypes with high transcriptomic mediation tended to have high correlation with PAC content. This study shows how a mediation analysis approach can be used to uncover relationships in multi-dimensional NAMs datasets and provides further insights into strategies for hazard prioritization using a combination of transcriptomic and bioactivity data.</div></div>","PeriodicalId":422,"journal":{"name":"Science of the Total Environment","volume":"985 ","pages":"Article 179724"},"PeriodicalIF":8.2000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An approach to uncover significant direct and mediated relationships in multi-dimensional new approach methods (NAMs) data: A case study of hazard evaluation of petroleum UVCBs\",\"authors\":\"Yi-Hui Zhou , Paul J. Gallins , Ivan Rusyn , Fred A. Wright\",\"doi\":\"10.1016/j.scitotenv.2025.179724\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>New Approach Methods (NAMs) encompass a wide range of data types; it is increasingly common to have highly multi-dimensional data (e.g., cellular, molecular and gene expression effects) on the same chemicals. In addition, chemical structure descriptors (for mono-constituent substances) or fractional composition (for complex substances) inform similarity hypotheses for read-across. Still, the utility of these multi-dimensional datasets for decision-making is difficult to ascertain. To address this challenge, we hypothesized that correlation and mediation analyses methods can be used to uncover significant and interpretable relationships in complex NAMs datasets. We used previously published data on 141 petroleum UVCBs (substances of unknown or variable composition, complex reaction products and biological materials) that included (i) characterization of the polycyclic aromatic compound (PAC) content, (ii) 42 bioactivity measurements from 12 human cell types, and (iii) transcriptomic data from 6 cell types. We explored the relationships among data types and determined how these data can be used for bioactivity-based prioritization. We found that PAC content was highly informative for bioactivity prediction, while the addition of transcriptomic data provided modest improvements. We then applied the statistical procedure of mediation analysis to uncover relationships among transcriptomics, PAC, and bioactivity. The strongest relationships appeared to be nearly completely mediated, and phenotypes with high transcriptomic mediation tended to have high correlation with PAC content. This study shows how a mediation analysis approach can be used to uncover relationships in multi-dimensional NAMs datasets and provides further insights into strategies for hazard prioritization using a combination of transcriptomic and bioactivity data.</div></div>\",\"PeriodicalId\":422,\"journal\":{\"name\":\"Science of the Total Environment\",\"volume\":\"985 \",\"pages\":\"Article 179724\"},\"PeriodicalIF\":8.2000,\"publicationDate\":\"2025-05-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Science of the Total Environment\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0048969725013658\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science of the Total Environment","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0048969725013658","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
An approach to uncover significant direct and mediated relationships in multi-dimensional new approach methods (NAMs) data: A case study of hazard evaluation of petroleum UVCBs
New Approach Methods (NAMs) encompass a wide range of data types; it is increasingly common to have highly multi-dimensional data (e.g., cellular, molecular and gene expression effects) on the same chemicals. In addition, chemical structure descriptors (for mono-constituent substances) or fractional composition (for complex substances) inform similarity hypotheses for read-across. Still, the utility of these multi-dimensional datasets for decision-making is difficult to ascertain. To address this challenge, we hypothesized that correlation and mediation analyses methods can be used to uncover significant and interpretable relationships in complex NAMs datasets. We used previously published data on 141 petroleum UVCBs (substances of unknown or variable composition, complex reaction products and biological materials) that included (i) characterization of the polycyclic aromatic compound (PAC) content, (ii) 42 bioactivity measurements from 12 human cell types, and (iii) transcriptomic data from 6 cell types. We explored the relationships among data types and determined how these data can be used for bioactivity-based prioritization. We found that PAC content was highly informative for bioactivity prediction, while the addition of transcriptomic data provided modest improvements. We then applied the statistical procedure of mediation analysis to uncover relationships among transcriptomics, PAC, and bioactivity. The strongest relationships appeared to be nearly completely mediated, and phenotypes with high transcriptomic mediation tended to have high correlation with PAC content. This study shows how a mediation analysis approach can be used to uncover relationships in multi-dimensional NAMs datasets and provides further insights into strategies for hazard prioritization using a combination of transcriptomic and bioactivity data.
期刊介绍:
The Science of the Total Environment is an international journal dedicated to scientific research on the environment and its interaction with humanity. It covers a wide range of disciplines and seeks to publish innovative, hypothesis-driven, and impactful research that explores the entire environment, including the atmosphere, lithosphere, hydrosphere, biosphere, and anthroposphere.
The journal's updated Aims & Scope emphasizes the importance of interdisciplinary environmental research with broad impact. Priority is given to studies that advance fundamental understanding and explore the interconnectedness of multiple environmental spheres. Field studies are preferred, while laboratory experiments must demonstrate significant methodological advancements or mechanistic insights with direct relevance to the environment.