Lapo Renai*, Viktoriia Turkina, Tobias Hulleman, Alexandros Nikolopoulos, Andrea F. G. Gargano, Elvio D. Amato, Massimo Del Bubba and Saer Samanipour*,
{"title":"A Novel Chemical-Space-Dependent Strategy for Compound Selection in Non-target LC-HRMS Method Development Using Physicochemical and Structural Data","authors":"Lapo Renai*, Viktoriia Turkina, Tobias Hulleman, Alexandros Nikolopoulos, Andrea F. G. Gargano, Elvio D. Amato, Massimo Del Bubba and Saer Samanipour*, ","doi":"10.1021/acs.estlett.5c00759","DOIUrl":null,"url":null,"abstract":"<p >The virtual chemical space of substances, including emerging contaminants relevant to the environment and exposome, is rapidly expanding. Non-targeted analysis (NTA) by liquid chromatography–high-resolution mass spectrometry (LC-HRMS) is useful in measuring broad chemical space regions. Internal standards are typically used to optimize the selectivity and sensitivity of NTA LC-HRMS methods, assuming a linear relationship between structure and behavior across all analytes. However, this assumption fails for large, heterogeneous chemical spaces, narrowing measurable coverage to structurally similar compounds. We present a data-driven strategy for unbiased sampling of candidate structures for NTA LC-HRMS method development from extensive chemical spaces, such as the U.S. EPA’s CompTox (>1 million chemicals). The workflow maximizes physicochemical/structural diversity using precomputed PubChem descriptors (e.g., molecular weight, XLogP) and grants LC-HRMS compatibility thanks to predicted mobility and ionization efficiency from molecular fingerprints. The resulting measurable compound lists (MCLs) provide broad, heterogeneous coverage for NTA method development, validation, and boundary assessment. Applied to the CompTox space, the approach yielded MCLs with greater chemical coverage and broader predicted LC-HRMS applicability than conventional “watch list” contaminants, offering a robust framework for enhancing NTA’s measurable chemical space while preserving diversity.</p>","PeriodicalId":37,"journal":{"name":"Environmental Science & Technology Letters Environ.","volume":"12 9","pages":"1162–1168"},"PeriodicalIF":8.8000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/pdf/10.1021/acs.estlett.5c00759","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science & Technology Letters Environ.","FirstCategoryId":"1","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.estlett.5c00759","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
The virtual chemical space of substances, including emerging contaminants relevant to the environment and exposome, is rapidly expanding. Non-targeted analysis (NTA) by liquid chromatography–high-resolution mass spectrometry (LC-HRMS) is useful in measuring broad chemical space regions. Internal standards are typically used to optimize the selectivity and sensitivity of NTA LC-HRMS methods, assuming a linear relationship between structure and behavior across all analytes. However, this assumption fails for large, heterogeneous chemical spaces, narrowing measurable coverage to structurally similar compounds. We present a data-driven strategy for unbiased sampling of candidate structures for NTA LC-HRMS method development from extensive chemical spaces, such as the U.S. EPA’s CompTox (>1 million chemicals). The workflow maximizes physicochemical/structural diversity using precomputed PubChem descriptors (e.g., molecular weight, XLogP) and grants LC-HRMS compatibility thanks to predicted mobility and ionization efficiency from molecular fingerprints. The resulting measurable compound lists (MCLs) provide broad, heterogeneous coverage for NTA method development, validation, and boundary assessment. Applied to the CompTox space, the approach yielded MCLs with greater chemical coverage and broader predicted LC-HRMS applicability than conventional “watch list” contaminants, offering a robust framework for enhancing NTA’s measurable chemical space while preserving diversity.
期刊介绍:
Environmental Science & Technology Letters serves as an international forum for brief communications on experimental or theoretical results of exceptional timeliness in all aspects of environmental science, both pure and applied. Published as soon as accepted, these communications are summarized in monthly issues. Additionally, the journal features short reviews on emerging topics in environmental science and technology.