Kevin Mildau, Henry Ehlers, Mara Meisenburg, Elena Del Pup, Robert A Koetsier, Laura Rosina Torres Ortega, Niek F de Jonge, Kumar Saurabh Singh, Dora Ferreira, Kgalaletso Othibeng, Fidele Tugizimana, Florian Huber, Justin J J van der Hooft
{"title":"Effective data visualization strategies in untargeted metabolomics.","authors":"Kevin Mildau, Henry Ehlers, Mara Meisenburg, Elena Del Pup, Robert A Koetsier, Laura Rosina Torres Ortega, Niek F de Jonge, Kumar Saurabh Singh, Dora Ferreira, Kgalaletso Othibeng, Fidele Tugizimana, Florian Huber, Justin J J van der Hooft","doi":"10.1039/d4np00039k","DOIUrl":null,"url":null,"abstract":"<p><p>Covering: 2014 to 2023 for metabolomics, 2002 to 2023 for information visualizationLC-MS/MS-based untargeted metabolomics is a rapidly developing research field spawning increasing numbers of computational metabolomics tools assisting researchers with their complex data processing, analysis, and interpretation tasks. In this article, we review the entire untargeted metabolomics workflow from the perspective of information visualization, visual analytics and visual data integration. Data visualization is a crucial step at every stage of the metabolomics workflow, where it provides core components of data inspection, evaluation, and sharing capabilities. However, due to the large number of available data analysis tools and corresponding visualization components, it is hard for both users and developers to get an overview of what is already available and which tools are suitable for their analysis. In addition, there is little cross-pollination between the fields of data visualization and metabolomics, leaving visual tools to be designed in a secondary and mostly <i>ad hoc</i> fashion. With this review, we aim to bridge the gap between the fields of untargeted metabolomics and data visualization. First, we introduce data visualization to the untargeted metabolomics field as a topic worthy of its own dedicated research, and provide a primer on cutting-edge visualization research into data visualization for both researchers as well as developers active in metabolomics. We extend this primer with a discussion of best practices for data visualization as they have emerged from data visualization studies. Second, we provide a practical roadmap to the visual tool landscape and its use within the untargeted metabolomics field. Here, for several computational analysis stages within the untargeted metabolomics workflow, we provide an overview of commonly used visual strategies with practical examples. In this context, we will also outline promising areas for further research and development. We end the review with a set of recommendations for developers and users on how to make the best use of visualizations for more effective and transparent communication of results.</p>","PeriodicalId":94,"journal":{"name":"Natural Product Reports","volume":" ","pages":""},"PeriodicalIF":10.2000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11610048/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Product Reports","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1039/d4np00039k","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Covering: 2014 to 2023 for metabolomics, 2002 to 2023 for information visualizationLC-MS/MS-based untargeted metabolomics is a rapidly developing research field spawning increasing numbers of computational metabolomics tools assisting researchers with their complex data processing, analysis, and interpretation tasks. In this article, we review the entire untargeted metabolomics workflow from the perspective of information visualization, visual analytics and visual data integration. Data visualization is a crucial step at every stage of the metabolomics workflow, where it provides core components of data inspection, evaluation, and sharing capabilities. However, due to the large number of available data analysis tools and corresponding visualization components, it is hard for both users and developers to get an overview of what is already available and which tools are suitable for their analysis. In addition, there is little cross-pollination between the fields of data visualization and metabolomics, leaving visual tools to be designed in a secondary and mostly ad hoc fashion. With this review, we aim to bridge the gap between the fields of untargeted metabolomics and data visualization. First, we introduce data visualization to the untargeted metabolomics field as a topic worthy of its own dedicated research, and provide a primer on cutting-edge visualization research into data visualization for both researchers as well as developers active in metabolomics. We extend this primer with a discussion of best practices for data visualization as they have emerged from data visualization studies. Second, we provide a practical roadmap to the visual tool landscape and its use within the untargeted metabolomics field. Here, for several computational analysis stages within the untargeted metabolomics workflow, we provide an overview of commonly used visual strategies with practical examples. In this context, we will also outline promising areas for further research and development. We end the review with a set of recommendations for developers and users on how to make the best use of visualizations for more effective and transparent communication of results.
基于lc - ms / ms的非靶向代谢组学是一个快速发展的研究领域,产生了越来越多的计算代谢组学工具,帮助研究人员完成复杂的数据处理、分析和解释任务。在本文中,我们从信息可视化、可视化分析和可视化数据集成的角度回顾了整个非靶向代谢组学工作流程。数据可视化在代谢组学工作流程的每个阶段都是至关重要的一步,它提供了数据检查、评估和共享功能的核心组件。然而,由于有大量可用的数据分析工具和相应的可视化组件,用户和开发人员都很难对已有的可用工具和适合其分析的工具进行概述。此外,数据可视化和代谢组学领域之间几乎没有交叉授粉,使得可视化工具以次要的和主要是临时的方式设计。通过这篇综述,我们旨在弥合非靶向代谢组学和数据可视化领域之间的差距。首先,我们将数据可视化作为一个值得专门研究的主题引入非靶向代谢组学领域,并为研究人员和活跃于代谢组学的开发人员提供数据可视化的前沿可视化研究入门。我们通过讨论从数据可视化研究中出现的数据可视化最佳实践来扩展本入门。其次,我们为可视化工具景观及其在非靶向代谢组学领域的使用提供了一个实用的路线图。在这里,对于非靶向代谢组学工作流程中的几个计算分析阶段,我们通过实际示例概述了常用的视觉策略。在此背景下,我们还将概述有希望进一步研究和发展的领域。最后,我们为开发人员和用户提供了一组关于如何最好地利用可视化来更有效和透明地交流结果的建议。
期刊介绍:
Natural Product Reports (NPR) serves as a pivotal critical review journal propelling advancements in all facets of natural products research, encompassing isolation, structural and stereochemical determination, biosynthesis, biological activity, and synthesis.
With a broad scope, NPR extends its influence into the wider bioinorganic, bioorganic, and chemical biology communities. Covering areas such as enzymology, nucleic acids, genetics, chemical ecology, carbohydrates, primary and secondary metabolism, and analytical techniques, the journal provides insightful articles focusing on key developments shaping the field, rather than offering exhaustive overviews of all results.
NPR encourages authors to infuse their perspectives on developments, trends, and future directions, fostering a dynamic exchange of ideas within the natural products research community.