Victor C.L. Lee , Khanh C.K. Nguyen , Linglan Zhu , Chloe A.K. White , Yong Jin Lim , Tao Huan , Thomas J. Velenosi
{"title":"DIATAGeR: Triacylglycerol annotation of data-independent acquisition based lipidomics","authors":"Victor C.L. Lee , Khanh C.K. Nguyen , Linglan Zhu , Chloe A.K. White , Yong Jin Lim , Tao Huan , Thomas J. Velenosi","doi":"10.1016/j.aca.2025.344698","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Triacylglycerols (TGs) are the most abundant lipids in the human body and the primary source of energy storage. TGs are comprised of three fatty acyls with various lengths and double bond composition, complicating structural annotation when performing lipidomics by LCMS. Data-independent acquisition (DIA) based lipidomics enables a continuous and unbiased acquisition of all TGs, creating the potential for more comprehensive TG analysis. <strong>However, TG identification in DIA lipidomics data is challenging due to the difficulty analyzing multiplexed tandem mass spectra (MS2).</strong></div></div><div><h3>Results</h3><div>In this study, we present DIATAGeR, an R package aimed to improve and automate TG identifications to the molecular species level in DIA-based lipidomics. With DIATAGeR, TGs are identified using a TG-centric approach, where each TG in the reference database is considered as an analysis target, searched in DIA spectra, and scored using a logistic regression machine learning algorithm. Additionally, DIATAGeR uses a false discovery rate (FDR) correction calculated by a target-decoy approach to improve the confidence of TG identification and limit false positives due to interference from unrelated ions. The performance of DIATAGeR was validated in a lipidomic study of liver and plasma samples from mice with metabolic dysfunction-associated steatohepatitis (MASH) and healthy controls. All 9 TG standards were annotated at an FDR <0.1 in both datasets. When benchmarked against MS-DIAL, TGs identified by DIATAGeR contained 18 % and 12 % more even-carbon fatty acyls in liver and plasma datasets, respectively.</div></div><div><h3>Significance</h3><div>DIATAGeR is a valuable tool for streamlining complex TG annotation in DIA-lipidomics data. It supports vendor-neutral MS spectra data formats and offers a customizable reference database. By combining TG-centric and target-decoy approaches, DIATAGeR showed improvements in TG identification by addressing primary challenges associated with multiplexed MS2 spectra. DIATAGeR is freely available at <span><span>https://github.com/Velenosi-Lab/DIATAGeR</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":240,"journal":{"name":"Analytica Chimica Acta","volume":"1378 ","pages":"Article 344698"},"PeriodicalIF":6.0000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytica Chimica Acta","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S000326702501092X","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Triacylglycerols (TGs) are the most abundant lipids in the human body and the primary source of energy storage. TGs are comprised of three fatty acyls with various lengths and double bond composition, complicating structural annotation when performing lipidomics by LCMS. Data-independent acquisition (DIA) based lipidomics enables a continuous and unbiased acquisition of all TGs, creating the potential for more comprehensive TG analysis. However, TG identification in DIA lipidomics data is challenging due to the difficulty analyzing multiplexed tandem mass spectra (MS2).
Results
In this study, we present DIATAGeR, an R package aimed to improve and automate TG identifications to the molecular species level in DIA-based lipidomics. With DIATAGeR, TGs are identified using a TG-centric approach, where each TG in the reference database is considered as an analysis target, searched in DIA spectra, and scored using a logistic regression machine learning algorithm. Additionally, DIATAGeR uses a false discovery rate (FDR) correction calculated by a target-decoy approach to improve the confidence of TG identification and limit false positives due to interference from unrelated ions. The performance of DIATAGeR was validated in a lipidomic study of liver and plasma samples from mice with metabolic dysfunction-associated steatohepatitis (MASH) and healthy controls. All 9 TG standards were annotated at an FDR <0.1 in both datasets. When benchmarked against MS-DIAL, TGs identified by DIATAGeR contained 18 % and 12 % more even-carbon fatty acyls in liver and plasma datasets, respectively.
Significance
DIATAGeR is a valuable tool for streamlining complex TG annotation in DIA-lipidomics data. It supports vendor-neutral MS spectra data formats and offers a customizable reference database. By combining TG-centric and target-decoy approaches, DIATAGeR showed improvements in TG identification by addressing primary challenges associated with multiplexed MS2 spectra. DIATAGeR is freely available at https://github.com/Velenosi-Lab/DIATAGeR.
期刊介绍:
Analytica Chimica Acta has an open access mirror journal Analytica Chimica Acta: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review.
Analytica Chimica Acta provides a forum for the rapid publication of original research, and critical, comprehensive reviews dealing with all aspects of fundamental and applied modern analytical chemistry. The journal welcomes the submission of research papers which report studies concerning the development of new and significant analytical methodologies. In determining the suitability of submitted articles for publication, particular scrutiny will be placed on the degree of novelty and impact of the research and the extent to which it adds to the existing body of knowledge in analytical chemistry.