Zachary Whitfield, Przemyslaw Kiljan, Monika Krzyzanowska, Daniel Marbach, Vincent Lavergne, Roland Schmucki, Thomas Lorber, Ana Teixeira, Anna Rautanen, Annegret Fischer, Iryna Dekhtiarenko, Franziska Braun, Nicolas Staedler, Guillemette Duchateau-Nguyen, Moritz Gilsdorf, Jehad Charo, Lena Voith von Voithenberg
{"title":"检测多组学新一代测序数据中的个体间污染和错配。","authors":"Zachary Whitfield, Przemyslaw Kiljan, Monika Krzyzanowska, Daniel Marbach, Vincent Lavergne, Roland Schmucki, Thomas Lorber, Ana Teixeira, Anna Rautanen, Annegret Fischer, Iryna Dekhtiarenko, Franziska Braun, Nicolas Staedler, Guillemette Duchateau-Nguyen, Moritz Gilsdorf, Jehad Charo, Lena Voith von Voithenberg","doi":"10.3791/69428","DOIUrl":null,"url":null,"abstract":"<p><p>High-throughput processing of patient biosamples by next-generation sequencing and the comparison of molecular data with patient-level and sample-level clinical data require precise tracking and matching of sample identifiers throughout the biospecimen chain of custody and are critical to enabling robust interpretation of biomarker trial results. In addition to tracing individual steps in the sample and data processing workflows, bioinformatics solutions can be used to confirm that samples originate from the same patient. Here, the use of a bioinformatics workflow to identify matched samples originating from the same individual is showcased. The analysis workflow is suitable for any two or more pairs of NGS datasets to be compared and verified for patient sample origin. A scoring algorithm based on genome-wide comparisons of samples enables the user to determine whether two samples stem from the same individual. Specifically, single-nucleotide polymorphisms (SNPs) within selected linkage disequilibrium blocks are used to identify and compare samples. Threshold combinations for permissive and stringent selection of matched and mismatched samples were identified. The utility of this protocol was demonstrated through its application to the quality control and validation of clinical tumor tissue and blood samples, encompassing multiple omics modalities from over 2,000 patients.</p>","PeriodicalId":48787,"journal":{"name":"Jove-Journal of Visualized Experiments","volume":" 230","pages":""},"PeriodicalIF":1.2000,"publicationDate":"2026-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detecting Inter-Individual Contamination and Mismatches in Multiomics Next-Generation Sequencing Data.\",\"authors\":\"Zachary Whitfield, Przemyslaw Kiljan, Monika Krzyzanowska, Daniel Marbach, Vincent Lavergne, Roland Schmucki, Thomas Lorber, Ana Teixeira, Anna Rautanen, Annegret Fischer, Iryna Dekhtiarenko, Franziska Braun, Nicolas Staedler, Guillemette Duchateau-Nguyen, Moritz Gilsdorf, Jehad Charo, Lena Voith von Voithenberg\",\"doi\":\"10.3791/69428\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>High-throughput processing of patient biosamples by next-generation sequencing and the comparison of molecular data with patient-level and sample-level clinical data require precise tracking and matching of sample identifiers throughout the biospecimen chain of custody and are critical to enabling robust interpretation of biomarker trial results. In addition to tracing individual steps in the sample and data processing workflows, bioinformatics solutions can be used to confirm that samples originate from the same patient. Here, the use of a bioinformatics workflow to identify matched samples originating from the same individual is showcased. The analysis workflow is suitable for any two or more pairs of NGS datasets to be compared and verified for patient sample origin. A scoring algorithm based on genome-wide comparisons of samples enables the user to determine whether two samples stem from the same individual. Specifically, single-nucleotide polymorphisms (SNPs) within selected linkage disequilibrium blocks are used to identify and compare samples. Threshold combinations for permissive and stringent selection of matched and mismatched samples were identified. The utility of this protocol was demonstrated through its application to the quality control and validation of clinical tumor tissue and blood samples, encompassing multiple omics modalities from over 2,000 patients.</p>\",\"PeriodicalId\":48787,\"journal\":{\"name\":\"Jove-Journal of Visualized Experiments\",\"volume\":\" 230\",\"pages\":\"\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2026-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jove-Journal of Visualized Experiments\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://doi.org/10.3791/69428\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jove-Journal of Visualized Experiments","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.3791/69428","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Detecting Inter-Individual Contamination and Mismatches in Multiomics Next-Generation Sequencing Data.
High-throughput processing of patient biosamples by next-generation sequencing and the comparison of molecular data with patient-level and sample-level clinical data require precise tracking and matching of sample identifiers throughout the biospecimen chain of custody and are critical to enabling robust interpretation of biomarker trial results. In addition to tracing individual steps in the sample and data processing workflows, bioinformatics solutions can be used to confirm that samples originate from the same patient. Here, the use of a bioinformatics workflow to identify matched samples originating from the same individual is showcased. The analysis workflow is suitable for any two or more pairs of NGS datasets to be compared and verified for patient sample origin. A scoring algorithm based on genome-wide comparisons of samples enables the user to determine whether two samples stem from the same individual. Specifically, single-nucleotide polymorphisms (SNPs) within selected linkage disequilibrium blocks are used to identify and compare samples. Threshold combinations for permissive and stringent selection of matched and mismatched samples were identified. The utility of this protocol was demonstrated through its application to the quality control and validation of clinical tumor tissue and blood samples, encompassing multiple omics modalities from over 2,000 patients.
期刊介绍:
JoVE, the Journal of Visualized Experiments, is the world''s first peer reviewed scientific video journal. Established in 2006, JoVE is devoted to publishing scientific research in a visual format to help researchers overcome two of the biggest challenges facing the scientific research community today; poor reproducibility and the time and labor intensive nature of learning new experimental techniques.