Elias George Balimponya, Maria Stefanie Dwiyanti, Koichi Yamamori, Shuntaro Sakaguchi, Yoshitaka Kanaoka, Yohei Koide, Yuji Kishima
{"title":"杂合snp与水稻基因组数据的准确检测及新生自发突变率的预测。","authors":"Elias George Balimponya, Maria Stefanie Dwiyanti, Koichi Yamamori, Shuntaro Sakaguchi, Yoshitaka Kanaoka, Yohei Koide, Yuji Kishima","doi":"10.1186/s13007-025-01437-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The use of Illumina sequencing technologies has enabled the identification and removal of mutations in various plant species. However, the Illumina sequencing method requires a considerable amount of data to ensure its integrity and quality due to the enormous number of false positives. This study aimed to explore an effective genomic data analysis for the detection of heterozygous variant (HV) in rice varieties.</p><p><strong>Results: </strong>We compared the accuracy of four combinations of mapping tools and variant calling pipelines and selected BWA-MEM2 with GATK4.3 HaplotypeCaller. To detect heterozygous de novo polymorphisms such as HVs in the three different rice varieties (Nipponbare, Kitaake, and Hinohikari), we adopted the following cost-saving procedures; secondary references were created in Nipponbare and Kitaake, and generation-based comparison was performed in Hinohikari. The similar HVs were estimated by the three varieties to range from 2.55814 × 10<sup>-8</sup> to 4.41860 × 10<sup>-8</sup>, with an average of 3.10278 × 10<sup>-8</sup> per nucleotide in a single rice plant, a rate consistent with observations in other organisms. Of 107 HVs identified in all eight plant samples, nine were found to be non-synonymous, resulting in an average of one non-synonymous HV per plant in a single generation.</p><p><strong>Conclusions: </strong>We have developed a methodology for the detection of true positive HVs within Illumina sequencing techniques. This system removed false positive HVs, allowing for the estimation of true positive HVs and, consequently, the estimation of the mutation rate. The study outlines a clear, step-by-step procedure that can be employed to detect true HVs in different organisms.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":"21 1","pages":"125"},"PeriodicalIF":4.4000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12495679/pdf/","citationCount":"0","resultStr":"{\"title\":\"Accurate detections of the heterozygous SNPs with rice genomic data and prediction of de novo spontaneous mutation rate.\",\"authors\":\"Elias George Balimponya, Maria Stefanie Dwiyanti, Koichi Yamamori, Shuntaro Sakaguchi, Yoshitaka Kanaoka, Yohei Koide, Yuji Kishima\",\"doi\":\"10.1186/s13007-025-01437-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The use of Illumina sequencing technologies has enabled the identification and removal of mutations in various plant species. However, the Illumina sequencing method requires a considerable amount of data to ensure its integrity and quality due to the enormous number of false positives. This study aimed to explore an effective genomic data analysis for the detection of heterozygous variant (HV) in rice varieties.</p><p><strong>Results: </strong>We compared the accuracy of four combinations of mapping tools and variant calling pipelines and selected BWA-MEM2 with GATK4.3 HaplotypeCaller. To detect heterozygous de novo polymorphisms such as HVs in the three different rice varieties (Nipponbare, Kitaake, and Hinohikari), we adopted the following cost-saving procedures; secondary references were created in Nipponbare and Kitaake, and generation-based comparison was performed in Hinohikari. The similar HVs were estimated by the three varieties to range from 2.55814 × 10<sup>-8</sup> to 4.41860 × 10<sup>-8</sup>, with an average of 3.10278 × 10<sup>-8</sup> per nucleotide in a single rice plant, a rate consistent with observations in other organisms. Of 107 HVs identified in all eight plant samples, nine were found to be non-synonymous, resulting in an average of one non-synonymous HV per plant in a single generation.</p><p><strong>Conclusions: </strong>We have developed a methodology for the detection of true positive HVs within Illumina sequencing techniques. This system removed false positive HVs, allowing for the estimation of true positive HVs and, consequently, the estimation of the mutation rate. The study outlines a clear, step-by-step procedure that can be employed to detect true HVs in different organisms.</p>\",\"PeriodicalId\":20100,\"journal\":{\"name\":\"Plant Methods\",\"volume\":\"21 1\",\"pages\":\"125\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-10-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12495679/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Plant Methods\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s13007-025-01437-x\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Methods","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s13007-025-01437-x","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
Accurate detections of the heterozygous SNPs with rice genomic data and prediction of de novo spontaneous mutation rate.
Background: The use of Illumina sequencing technologies has enabled the identification and removal of mutations in various plant species. However, the Illumina sequencing method requires a considerable amount of data to ensure its integrity and quality due to the enormous number of false positives. This study aimed to explore an effective genomic data analysis for the detection of heterozygous variant (HV) in rice varieties.
Results: We compared the accuracy of four combinations of mapping tools and variant calling pipelines and selected BWA-MEM2 with GATK4.3 HaplotypeCaller. To detect heterozygous de novo polymorphisms such as HVs in the three different rice varieties (Nipponbare, Kitaake, and Hinohikari), we adopted the following cost-saving procedures; secondary references were created in Nipponbare and Kitaake, and generation-based comparison was performed in Hinohikari. The similar HVs were estimated by the three varieties to range from 2.55814 × 10-8 to 4.41860 × 10-8, with an average of 3.10278 × 10-8 per nucleotide in a single rice plant, a rate consistent with observations in other organisms. Of 107 HVs identified in all eight plant samples, nine were found to be non-synonymous, resulting in an average of one non-synonymous HV per plant in a single generation.
Conclusions: We have developed a methodology for the detection of true positive HVs within Illumina sequencing techniques. This system removed false positive HVs, allowing for the estimation of true positive HVs and, consequently, the estimation of the mutation rate. The study outlines a clear, step-by-step procedure that can be employed to detect true HVs in different organisms.
期刊介绍:
Plant Methods is an open access, peer-reviewed, online journal for the plant research community that encompasses all aspects of technological innovation in the plant sciences.
There is no doubt that we have entered an exciting new era in plant biology. The completion of the Arabidopsis genome sequence, and the rapid progress being made in other plant genomics projects are providing unparalleled opportunities for progress in all areas of plant science. Nevertheless, enormous challenges lie ahead if we are to understand the function of every gene in the genome, and how the individual parts work together to make the whole organism. Achieving these goals will require an unprecedented collaborative effort, combining high-throughput, system-wide technologies with more focused approaches that integrate traditional disciplines such as cell biology, biochemistry and molecular genetics.
Technological innovation is probably the most important catalyst for progress in any scientific discipline. Plant Methods’ goal is to stimulate the development and adoption of new and improved techniques and research tools and, where appropriate, to promote consistency of methodologies for better integration of data from different laboratories.