Tito Damiani, Alan K Jarmusch, Allegra T Aron, Daniel Petras, Vanessa V Phelan, Haoqi Nina Zhao, Wout Bittremieux, Deepa D Acharya, Mohammed M A Ahmed, Anelize Bauermeister, Matthew J Bertin, Paul D Boudreau, Ricardo M Borges, Benjamin P Bowen, Christopher J Brown, Fernanda O Chagas, Kenneth D Clevenger, Mario S P Correia, William J Crandall, Max Crüsemann, Eoin Fahy, Oliver Fiehn, Neha Garg, William H Gerwick, Jeffrey R Gilbert, Daniel Globisch, Paulo Wender P Gomes, Steffen Heuckeroth, C Andrew James, Scott A Jarmusch, Sarvar A Kakhkhorov, Kyo Bin Kang, Nikolas Kessler, Roland D Kersten, Hyunwoo Kim, Riley D Kirk, Oliver Kohlbacher, Eftychia E Kontou, Ken Liu, Itzel Lizama-Chamu, Gordon T Luu, Tal Luzzatto Knaan, Helena Mannochio-Russo, Michael T Marty, Yuki Matsuzawa, Andrew C McAvoy, Laura-Isobel McCall, Osama G Mohamed, Omri Nahor, Heiko Neuweger, Timo H J Niedermeyer, Kozo Nishida, Trent R Northen, Kirsten E Overdahl, Johannes Rainer, Raphael Reher, Elys Rodriguez, Timo T Sachsenberg, Laura M Sanchez, Robin Schmid, Cole Stevens, Shankar Subramaniam, Zhenyu Tian, Ashootosh Tripathi, Hiroshi Tsugawa, Justin J J van der Hooft, Andrea Vicini, Axel Walter, Tilmann Weber, Quanbo Xiong, Tao Xu, Tomáš Pluskal, Pieter C Dorrestein, Mingxun Wang
{"title":"A universal language for finding mass spectrometry data patterns.","authors":"Tito Damiani, Alan K Jarmusch, Allegra T Aron, Daniel Petras, Vanessa V Phelan, Haoqi Nina Zhao, Wout Bittremieux, Deepa D Acharya, Mohammed M A Ahmed, Anelize Bauermeister, Matthew J Bertin, Paul D Boudreau, Ricardo M Borges, Benjamin P Bowen, Christopher J Brown, Fernanda O Chagas, Kenneth D Clevenger, Mario S P Correia, William J Crandall, Max Crüsemann, Eoin Fahy, Oliver Fiehn, Neha Garg, William H Gerwick, Jeffrey R Gilbert, Daniel Globisch, Paulo Wender P Gomes, Steffen Heuckeroth, C Andrew James, Scott A Jarmusch, Sarvar A Kakhkhorov, Kyo Bin Kang, Nikolas Kessler, Roland D Kersten, Hyunwoo Kim, Riley D Kirk, Oliver Kohlbacher, Eftychia E Kontou, Ken Liu, Itzel Lizama-Chamu, Gordon T Luu, Tal Luzzatto Knaan, Helena Mannochio-Russo, Michael T Marty, Yuki Matsuzawa, Andrew C McAvoy, Laura-Isobel McCall, Osama G Mohamed, Omri Nahor, Heiko Neuweger, Timo H J Niedermeyer, Kozo Nishida, Trent R Northen, Kirsten E Overdahl, Johannes Rainer, Raphael Reher, Elys Rodriguez, Timo T Sachsenberg, Laura M Sanchez, Robin Schmid, Cole Stevens, Shankar Subramaniam, Zhenyu Tian, Ashootosh Tripathi, Hiroshi Tsugawa, Justin J J van der Hooft, Andrea Vicini, Axel Walter, Tilmann Weber, Quanbo Xiong, Tao Xu, Tomáš Pluskal, Pieter C Dorrestein, Mingxun Wang","doi":"10.1038/s41592-025-02660-z","DOIUrl":null,"url":null,"abstract":"<p><p>Despite being information rich, the vast majority of untargeted mass spectrometry data are underutilized; most analytes are not used for downstream interpretation or reanalysis after publication. The inability to dive into these rich raw mass spectrometry datasets is due to the limited flexibility and scalability of existing software tools. Here we introduce a new language, the Mass Spectrometry Query Language (MassQL), and an accompanying software ecosystem that addresses these issues by enabling the community to directly query mass spectrometry data with an expressive set of user-defined mass spectrometry patterns. Illustrated by real-world examples, MassQL provides a data-driven definition of chemical diversity by enabling the reanalysis of all public untargeted metabolomics data, empowering scientists across many disciplines to make new discoveries. MassQL has been widely implemented in multiple open-source and commercial mass spectrometry analysis tools, which enhances the ability, interoperability and reproducibility of mining of mass spectrometry data for the research community.</p>","PeriodicalId":18981,"journal":{"name":"Nature Methods","volume":" ","pages":""},"PeriodicalIF":36.1000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Methods","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41592-025-02660-z","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Despite being information rich, the vast majority of untargeted mass spectrometry data are underutilized; most analytes are not used for downstream interpretation or reanalysis after publication. The inability to dive into these rich raw mass spectrometry datasets is due to the limited flexibility and scalability of existing software tools. Here we introduce a new language, the Mass Spectrometry Query Language (MassQL), and an accompanying software ecosystem that addresses these issues by enabling the community to directly query mass spectrometry data with an expressive set of user-defined mass spectrometry patterns. Illustrated by real-world examples, MassQL provides a data-driven definition of chemical diversity by enabling the reanalysis of all public untargeted metabolomics data, empowering scientists across many disciplines to make new discoveries. MassQL has been widely implemented in multiple open-source and commercial mass spectrometry analysis tools, which enhances the ability, interoperability and reproducibility of mining of mass spectrometry data for the research community.
期刊介绍:
Nature Methods is a monthly journal that focuses on publishing innovative methods and substantial enhancements to fundamental life sciences research techniques. Geared towards a diverse, interdisciplinary readership of researchers in academia and industry engaged in laboratory work, the journal offers new tools for research and emphasizes the immediate practical significance of the featured work. It publishes primary research papers and reviews recent technical and methodological advancements, with a particular interest in primary methods papers relevant to the biological and biomedical sciences. This includes methods rooted in chemistry with practical applications for studying biological problems.