Prioritizing molecular formulae identified by non-target analysis through high-throughput modelling: application to identify compounds with high human accumulation potential from house dust

IF 4.3 3区 环境科学与生态学 Q1 CHEMISTRY, ANALYTICAL
Zhizhen Zhang, Li Li, Hui Peng and Frank Wania
{"title":"Prioritizing molecular formulae identified by non-target analysis through high-throughput modelling: application to identify compounds with high human accumulation potential from house dust","authors":"Zhizhen Zhang, Li Li, Hui Peng and Frank Wania","doi":"10.1039/D3EM00317E","DOIUrl":null,"url":null,"abstract":"<p >Because it is typically not possible to pursue compound identification efforts for all chemical features detected during non-target analysis (NTA), the need for prioritization arises. Here we propose a strategy that ranks chemical features detected in environmental samples based on a model-derived metric that quantifies a feature's attribute that makes it desirable to elucidate its structure, <em>e.g.</em>, a high potential for bioaccumulation in humans or wildlife. The procedure involves the identification of isomers that could plausibly represent the molecular formulae assigned to NTA-detected chemical features. For each isomer, the prioritization metric is calculated using properties predicted with high-throughput methods. After the molecular formulae are ranked based on the average values of the prioritization metric calculated for all isomers assigned to a formula, the highest ranked molecular formulae are prioritized for structure elucidation. We applied this workflow to features identified in house dust, using the ratio of chemical intake through dust ingestion to chemical concentration in blood (dose-to-concentration ratio, DCR) as the prioritization metric. Collections of isomers for the molecular formulae were assembled from the PubChem database and DCR was estimated using partitioning and biotransformation properties predicted for each isomer using quantitative structure property relationships. The ten top-ranked molecular formulae with notably lower average DCR-values represented mostly compounds already known to be indoor pollutants of concern, such as two polybrominated diphenyl ethers, bis(2-ethylhexyl) tetrabromophthalate, tetrabromobisphenol A, tris(1,3-dichloroisopropyl)phosphate and the azo dye disperse blue 373.</p>","PeriodicalId":74,"journal":{"name":"Environmental Science: Processes & Impacts","volume":" 11","pages":" 1817-1829"},"PeriodicalIF":4.3000,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science: Processes & Impacts","FirstCategoryId":"93","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2023/em/d3em00317e","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Because it is typically not possible to pursue compound identification efforts for all chemical features detected during non-target analysis (NTA), the need for prioritization arises. Here we propose a strategy that ranks chemical features detected in environmental samples based on a model-derived metric that quantifies a feature's attribute that makes it desirable to elucidate its structure, e.g., a high potential for bioaccumulation in humans or wildlife. The procedure involves the identification of isomers that could plausibly represent the molecular formulae assigned to NTA-detected chemical features. For each isomer, the prioritization metric is calculated using properties predicted with high-throughput methods. After the molecular formulae are ranked based on the average values of the prioritization metric calculated for all isomers assigned to a formula, the highest ranked molecular formulae are prioritized for structure elucidation. We applied this workflow to features identified in house dust, using the ratio of chemical intake through dust ingestion to chemical concentration in blood (dose-to-concentration ratio, DCR) as the prioritization metric. Collections of isomers for the molecular formulae were assembled from the PubChem database and DCR was estimated using partitioning and biotransformation properties predicted for each isomer using quantitative structure property relationships. The ten top-ranked molecular formulae with notably lower average DCR-values represented mostly compounds already known to be indoor pollutants of concern, such as two polybrominated diphenyl ethers, bis(2-ethylhexyl) tetrabromophthalate, tetrabromobisphenol A, tris(1,3-dichloroisopropyl)phosphate and the azo dye disperse blue 373.

Abstract Image

通过高通量建模对非目标分析确定的分子式进行优先排序:应用于从房屋灰尘中识别具有高人类积累潜力的化合物。
由于通常不可能对非目标分析(NTA)期间检测到的所有化学特征进行化合物识别,因此需要进行优先排序。在这里,我们提出了一种策略,根据模型衍生的指标对环境样本中检测到的化学特征进行排名,该指标量化了特征的属性,从而有必要阐明其结构,例如,在人类或野生动物中具有很高的生物累积潜力。该程序涉及识别可能合理代表NTA检测到的化学特征的分子式的异构体。对于每种异构体,使用高通量方法预测的特性来计算优先级度量。在基于为分配给一个分子式的所有异构体计算的优先度度量的平均值对分子式进行排序之后,对排名最高的分子式进行结构阐明的优先度。我们将这一工作流程应用于室内灰尘中确定的特征,使用通过灰尘摄入的化学物质摄入与血液中化学物质浓度的比率(剂量浓度比,DCR)作为优先级指标。从PubChem数据库中收集分子式的异构体,并使用定量结构-性质关系预测每个异构体的分配和生物转化性质来估计DCR。平均DCR值显著较低的十个排名靠前的分子式主要代表已知为室内污染物的化合物,如两种多溴二苯醚、二(2-乙基己基)四溴邻苯二甲酸酯、四溴双酚A、磷酸三(1,3-二氯异丙基)和偶氮染料分散蓝373。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Science: Processes & Impacts
Environmental Science: Processes & Impacts CHEMISTRY, ANALYTICAL-ENVIRONMENTAL SCIENCES
CiteScore
9.50
自引率
3.60%
发文量
202
审稿时长
1 months
期刊介绍: Environmental Science: Processes & Impacts publishes high quality papers in all areas of the environmental chemical sciences, including chemistry of the air, water, soil and sediment. We welcome studies on the environmental fate and effects of anthropogenic and naturally occurring contaminants, both chemical and microbiological, as well as related natural element cycling processes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信