BatchEval Pipeline：用于多个数据集联合分析的批量效应评估工作流程。

IF 1.2

GigaByte (Hong Kong, China) Pub Date : 2024-02-20 eCollection Date: 2024-01-01 DOI:10.46471/gigabyte.108

Chao Zhang, Qiang Kang, Mei Li, Hongqing Xie, Shuangsang Fang, Xun Xu

{"title":"BatchEval Pipeline：用于多个数据集联合分析的批量效应评估工作流程。","authors":"Chao Zhang, Qiang Kang, Mei Li, Hongqing Xie, Shuangsang Fang, Xun Xu","doi":"10.46471/gigabyte.108","DOIUrl":null,"url":null,"abstract":"As genomic sequencing technology continues to advance, it becomes increasingly important to perform joint analyses of multiple datasets of transcriptomics. However, batch effect presents challenges for dataset integration, such as sequencing data measured on different platforms, and datasets collected at different times. Here, we report the development of BatchEval Pipeline, a batch effect workflow used to evaluate batch effect on dataset integration. The BatchEval Pipeline generates a comprehensive report, which consists of a series of HTML pages for assessment findings, including a main page, a raw dataset evaluation page, and several built-in methods evaluation pages. The main page exhibits basic information of the integrated datasets, a comprehensive score of batch effect, and the most recommended method for removing batch effect from the current datasets. The remaining pages exhibit evaluation details for the raw dataset, and evaluation results from the built-in batch effect removal methods after removing batch effect. This comprehensive report enables researchers to accurately identify and remove batch effects, resulting in more reliable and meaningful biological insights from integrated datasets. In summary, the BatchEval Pipeline represents a significant advancement in batch effect evaluation, and is a valuable tool to improve the accuracy and reliability of the experimental results.Availability & implementation: The source code of the BatchEval Pipeline is available at https://github.com/STOmics/BatchEval.","PeriodicalId":73157,"journal":{"name":"GigaByte (Hong Kong, China)","volume":"2024 ","pages":"gigabyte108"},"PeriodicalIF":1.2000,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10905258/pdf/","citationCount":"0","resultStr":"{\"title\":\"BatchEval Pipeline: batch effect evaluation workflow for multiple datasets joint analysis.\",\"authors\":\"Chao Zhang, Qiang Kang, Mei Li, Hongqing Xie, Shuangsang Fang, Xun Xu\",\"doi\":\"10.46471/gigabyte.108\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As genomic sequencing technology continues to advance, it becomes increasingly important to perform joint analyses of multiple datasets of transcriptomics. However, batch effect presents challenges for dataset integration, such as sequencing data measured on different platforms, and datasets collected at different times. Here, we report the development of BatchEval Pipeline, a batch effect workflow used to evaluate batch effect on dataset integration. The BatchEval Pipeline generates a comprehensive report, which consists of a series of HTML pages for assessment findings, including a main page, a raw dataset evaluation page, and several built-in methods evaluation pages. The main page exhibits basic information of the integrated datasets, a comprehensive score of batch effect, and the most recommended method for removing batch effect from the current datasets. The remaining pages exhibit evaluation details for the raw dataset, and evaluation results from the built-in batch effect removal methods after removing batch effect. This comprehensive report enables researchers to accurately identify and remove batch effects, resulting in more reliable and meaningful biological insights from integrated datasets. In summary, the BatchEval Pipeline represents a significant advancement in batch effect evaluation, and is a valuable tool to improve the accuracy and reliability of the experimental results.Availability & implementation: The source code of the BatchEval Pipeline is available at https://github.com/STOmics/BatchEval.\",\"PeriodicalId\":73157,\"journal\":{\"name\":\"GigaByte (Hong Kong, China)\",\"volume\":\"2024 \",\"pages\":\"gigabyte108\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-02-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10905258/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"GigaByte (Hong Kong, China)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.46471/gigabyte.108\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"GigaByte (Hong Kong, China)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46471/gigabyte.108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着基因组测序技术的不断进步，对多个转录组学数据集进行联合分析变得越来越重要。然而，批次效应给数据集整合带来了挑战，例如在不同平台测量的测序数据和在不同时间采集的数据集。在此，我们报告了 BatchEval Pipeline 的开发情况，这是一个批次效应工作流，用于评估数据集整合的批次效应。BatchEval Pipeline 生成的综合报告由一系列用于评估结果的 HTML 页面组成，包括一个主页面、一个原始数据集评估页面和几个内置方法评估页面。主页面展示了集成数据集的基本信息、批量效应的综合评分，以及从当前数据集中去除批量效应的最推荐方法。其余页面展示了原始数据集的评估详情，以及内置批量效应去除方法在去除批量效应后的评估结果。这份全面的报告能帮助研究人员准确识别和去除批次效应，从而从集成数据集中获得更可靠、更有意义的生物学见解。总之，BatchEval 管道代表了批次效应评估的重大进步，是提高实验结果准确性和可靠性的重要工具：BatchEval Pipeline 的源代码可从 https://github.com/STOmics/BatchEval 获取。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

BatchEval Pipeline: batch effect evaluation workflow for multiple datasets joint analysis.

查看原文本刊更多论文

BatchEval Pipeline: batch effect evaluation workflow for multiple datasets joint analysis.

As genomic sequencing technology continues to advance, it becomes increasingly important to perform joint analyses of multiple datasets of transcriptomics. However, batch effect presents challenges for dataset integration, such as sequencing data measured on different platforms, and datasets collected at different times. Here, we report the development of BatchEval Pipeline, a batch effect workflow used to evaluate batch effect on dataset integration. The BatchEval Pipeline generates a comprehensive report, which consists of a series of HTML pages for assessment findings, including a main page, a raw dataset evaluation page, and several built-in methods evaluation pages. The main page exhibits basic information of the integrated datasets, a comprehensive score of batch effect, and the most recommended method for removing batch effect from the current datasets. The remaining pages exhibit evaluation details for the raw dataset, and evaluation results from the built-in batch effect removal methods after removing batch effect. This comprehensive report enables researchers to accurately identify and remove batch effects, resulting in more reliable and meaningful biological insights from integrated datasets. In summary, the BatchEval Pipeline represents a significant advancement in batch effect evaluation, and is a valuable tool to improve the accuracy and reliability of the experimental results.

Availability & implementation: The source code of the BatchEval Pipeline is available at https://github.com/STOmics/BatchEval.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊