Realistic Scenarios of Phenotypic Variation and Errors in High-Throughput Phenotyping Experiments Minimally Impact the Results of Quantitative Trait Locus Mapping Analysis.
{"title":"Realistic Scenarios of Phenotypic Variation and Errors in High-Throughput Phenotyping Experiments Minimally Impact the Results of Quantitative Trait Locus Mapping Analysis.","authors":"Aliyah Brewer, Anna Underhill, Surya Sapkota, Chin-Feng Hwang, Summaira Riaz, Madeline Oravec, Lance Cadle-Davidson","doi":"10.1094/PHYTO-01-25-0007-FI","DOIUrl":null,"url":null,"abstract":"<p><p>High-throughput phenotyping technologies increase the efficiency of breeding programs, but with larger datasets, errors can accumulate. Plant breeders often conduct quantitative trait locus (QTL) mapping, where large sample size and accurate quantitative response estimates are important for detecting small-effect QTLs. This study examined how phenotype error, inconsistency, and replication changed QTL magnitude and location. Three real sets of phenotype data were used from microscopy robot analysis of grapevine powdery mildew (<i>Erysiphe necator</i>) severity, which previously resulted in discovery of large (<i>R</i><sup>2</sup> = 85%), intermediate (<i>R</i><sup>2</sup> = 45%), and small (<i>R</i><sup>2</sup> = 9%) effect QTLs. Custom R scripts were written to induce several realistic sources of error, inconsistency, and varied replication. The results were remarkably robust to these changes. Swapping or shifting 2% of samples or changing disease severity by 50% on one replicate had negligible impact on QTLs. Unreplicated simulations produced the largest logarithm of the odds score range (5.55 to 8.27) and mean logarithm of the odds score deviation (-1.72 to -3.22; Cohen's D = 1.48 to 2.12). The large-effect-size QTL (<i>REN12</i>) was always detected. The intermediate-effect-size QTL (<i>REN13</i>) was detected except when three of the eight replicates were analyzed individually. Even for the small-effect-size locus (<i>NYVPLG9</i>), error scenarios rarely (2 of 9,000 cases) eliminated significant QTL detection, versus no replication (9 of 10). Thus, the benefits of data volume associated with high-throughput phenotyping technologies outweigh the cost of the increased errors tested here. Instead, the focus should be on examining how each experimental replicate contributes to the results of the QTL mapping analysis.</p>","PeriodicalId":20410,"journal":{"name":"Phytopathology","volume":" ","pages":"PHYTO01250007FI"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Phytopathology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1094/PHYTO-01-25-0007-FI","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
High-throughput phenotyping technologies increase the efficiency of breeding programs, but with larger datasets, errors can accumulate. Plant breeders often conduct quantitative trait locus (QTL) mapping, where large sample size and accurate quantitative response estimates are important for detecting small-effect QTLs. This study examined how phenotype error, inconsistency, and replication changed QTL magnitude and location. Three real sets of phenotype data were used from microscopy robot analysis of grapevine powdery mildew (Erysiphe necator) severity, which previously resulted in discovery of large (R2 = 85%), intermediate (R2 = 45%), and small (R2 = 9%) effect QTLs. Custom R scripts were written to induce several realistic sources of error, inconsistency, and varied replication. The results were remarkably robust to these changes. Swapping or shifting 2% of samples or changing disease severity by 50% on one replicate had negligible impact on QTLs. Unreplicated simulations produced the largest logarithm of the odds score range (5.55 to 8.27) and mean logarithm of the odds score deviation (-1.72 to -3.22; Cohen's D = 1.48 to 2.12). The large-effect-size QTL (REN12) was always detected. The intermediate-effect-size QTL (REN13) was detected except when three of the eight replicates were analyzed individually. Even for the small-effect-size locus (NYVPLG9), error scenarios rarely (2 of 9,000 cases) eliminated significant QTL detection, versus no replication (9 of 10). Thus, the benefits of data volume associated with high-throughput phenotyping technologies outweigh the cost of the increased errors tested here. Instead, the focus should be on examining how each experimental replicate contributes to the results of the QTL mapping analysis.
期刊介绍:
Phytopathology publishes articles on fundamental research that advances understanding of the nature of plant diseases, the agents that cause them, their spread, the losses they cause, and measures that can be used to control them. Phytopathology considers manuscripts covering all aspects of plant diseases including bacteriology, host-parasite biochemistry and cell biology, biological control, disease control and pest management, description of new pathogen species description of new pathogen species, ecology and population biology, epidemiology, disease etiology, host genetics and resistance, mycology, nematology, plant stress and abiotic disorders, postharvest pathology and mycotoxins, and virology. Papers dealing mainly with taxonomy, such as descriptions of new plant pathogen taxa are acceptable if they include plant disease research results such as pathogenicity, host range, etc. Taxonomic papers that focus on classification, identification, and nomenclature below the subspecies level may also be submitted to Phytopathology.