{"title":"从LC-HRMS数据中提取化学暴露特征的非目标数据挖掘策略:应用于早期生命暴露评估的胎粪","authors":"Dylan Saunier , Éric Venot , Blanche Guillon , Sylvain Dechaumet , Florence Castelli , Etienne Thevenot , François Fenaille , Blandine de Lauzon-Guillain , Karine Adel-Patient , Estelle Rathahao-Paris","doi":"10.1016/j.aca.2025.344751","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Exposome research has expanded rapidly in recent years, driven by advances in analytical techniques such as liquid chromatography-high-resolution mass spectrometry (LC-HRMS), which enable broad and sensitive chemical coverage. Targeted methods focus on known compounds, while untargeted metabolomic approaches provide a more holistic view and may reveal exposure biomarkers, but they are not specifically designed to detect exogenous chemicals. Identifying relevant exposure markers within the vast and complex datasets generated by untargeted LC-HRMS data remains a significant analytical and computational challenge, requiring innovative data mining strategies.</div></div><div><h3>Results</h3><div>We developed a novel untargeted data mining strategy to extract exogenous chemical signatures from complex LC-HRMS datasets. The approach integrates isotopic signature enrichment (ISE), biotransformation-informed feature selection and an “exposure rate” metric. When applied to meconium data from the EDEN cohort, the strategy led to a six-fold reduction in the number of features by retaining only those exhibiting valid carbon isotope patterns. Mass defect plots revealed signatures of suspect monohalogenated species and putative conjugated and non-conjugated metabolites in a specific region. Incorporating ISE results into the chemical formula prediction significantly reduced the number of candidates, improving annotation efficiency. <em>In utero</em> exposure to xenobiotics was supported by the detection of known exposure markers such as acetaminophen, caffeine and nicotine. These results demonstrate the method's potential to uncover exposomic signals in complex biological matrices.</div></div><div><h3>Significance</h3><div>This study presents a novel data mining strategy that reduces the complexity of untargeted LC-HRMS data by retaining chemically reliable features based on isotopic signatures. As a proof of concept, this strategy enables the detection of specific chemical signatures and exogenous compounds without prior knowledge. Its adaptability to various biological matrices and its compatibility with different high-resolution mass spectrometry platforms make this strategy a valuable tool for exposome research and early-life exposure assessment.</div></div>","PeriodicalId":240,"journal":{"name":"Analytica Chimica Acta","volume":"1379 ","pages":"Article 344751"},"PeriodicalIF":6.0000,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An untargeted data mining strategy for extracting chemical exposome signatures from LC-HRMS data: Application to meconium for early-life exposure assessment\",\"authors\":\"Dylan Saunier , Éric Venot , Blanche Guillon , Sylvain Dechaumet , Florence Castelli , Etienne Thevenot , François Fenaille , Blandine de Lauzon-Guillain , Karine Adel-Patient , Estelle Rathahao-Paris\",\"doi\":\"10.1016/j.aca.2025.344751\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Exposome research has expanded rapidly in recent years, driven by advances in analytical techniques such as liquid chromatography-high-resolution mass spectrometry (LC-HRMS), which enable broad and sensitive chemical coverage. Targeted methods focus on known compounds, while untargeted metabolomic approaches provide a more holistic view and may reveal exposure biomarkers, but they are not specifically designed to detect exogenous chemicals. Identifying relevant exposure markers within the vast and complex datasets generated by untargeted LC-HRMS data remains a significant analytical and computational challenge, requiring innovative data mining strategies.</div></div><div><h3>Results</h3><div>We developed a novel untargeted data mining strategy to extract exogenous chemical signatures from complex LC-HRMS datasets. The approach integrates isotopic signature enrichment (ISE), biotransformation-informed feature selection and an “exposure rate” metric. When applied to meconium data from the EDEN cohort, the strategy led to a six-fold reduction in the number of features by retaining only those exhibiting valid carbon isotope patterns. Mass defect plots revealed signatures of suspect monohalogenated species and putative conjugated and non-conjugated metabolites in a specific region. Incorporating ISE results into the chemical formula prediction significantly reduced the number of candidates, improving annotation efficiency. <em>In utero</em> exposure to xenobiotics was supported by the detection of known exposure markers such as acetaminophen, caffeine and nicotine. These results demonstrate the method's potential to uncover exposomic signals in complex biological matrices.</div></div><div><h3>Significance</h3><div>This study presents a novel data mining strategy that reduces the complexity of untargeted LC-HRMS data by retaining chemically reliable features based on isotopic signatures. As a proof of concept, this strategy enables the detection of specific chemical signatures and exogenous compounds without prior knowledge. Its adaptability to various biological matrices and its compatibility with different high-resolution mass spectrometry platforms make this strategy a valuable tool for exposome research and early-life exposure assessment.</div></div>\",\"PeriodicalId\":240,\"journal\":{\"name\":\"Analytica Chimica Acta\",\"volume\":\"1379 \",\"pages\":\"Article 344751\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-10-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytica Chimica Acta\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0003267025011456\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytica Chimica Acta","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003267025011456","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
An untargeted data mining strategy for extracting chemical exposome signatures from LC-HRMS data: Application to meconium for early-life exposure assessment
Background
Exposome research has expanded rapidly in recent years, driven by advances in analytical techniques such as liquid chromatography-high-resolution mass spectrometry (LC-HRMS), which enable broad and sensitive chemical coverage. Targeted methods focus on known compounds, while untargeted metabolomic approaches provide a more holistic view and may reveal exposure biomarkers, but they are not specifically designed to detect exogenous chemicals. Identifying relevant exposure markers within the vast and complex datasets generated by untargeted LC-HRMS data remains a significant analytical and computational challenge, requiring innovative data mining strategies.
Results
We developed a novel untargeted data mining strategy to extract exogenous chemical signatures from complex LC-HRMS datasets. The approach integrates isotopic signature enrichment (ISE), biotransformation-informed feature selection and an “exposure rate” metric. When applied to meconium data from the EDEN cohort, the strategy led to a six-fold reduction in the number of features by retaining only those exhibiting valid carbon isotope patterns. Mass defect plots revealed signatures of suspect monohalogenated species and putative conjugated and non-conjugated metabolites in a specific region. Incorporating ISE results into the chemical formula prediction significantly reduced the number of candidates, improving annotation efficiency. In utero exposure to xenobiotics was supported by the detection of known exposure markers such as acetaminophen, caffeine and nicotine. These results demonstrate the method's potential to uncover exposomic signals in complex biological matrices.
Significance
This study presents a novel data mining strategy that reduces the complexity of untargeted LC-HRMS data by retaining chemically reliable features based on isotopic signatures. As a proof of concept, this strategy enables the detection of specific chemical signatures and exogenous compounds without prior knowledge. Its adaptability to various biological matrices and its compatibility with different high-resolution mass spectrometry platforms make this strategy a valuable tool for exposome research and early-life exposure assessment.
期刊介绍:
Analytica Chimica Acta has an open access mirror journal Analytica Chimica Acta: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review.
Analytica Chimica Acta provides a forum for the rapid publication of original research, and critical, comprehensive reviews dealing with all aspects of fundamental and applied modern analytical chemistry. The journal welcomes the submission of research papers which report studies concerning the development of new and significant analytical methodologies. In determining the suitability of submitted articles for publication, particular scrutiny will be placed on the degree of novelty and impact of the research and the extent to which it adds to the existing body of knowledge in analytical chemistry.