Realistic Scenarios of Phenotypic Variation and Errors in High-Throughput Phenotyping Experiments Minimally Impact the Results of Quantitative Trait Locus Mapping Analysis.

IF 3.1 2区农林科学 Q2 PLANT SCIENCES

Phytopathology Pub Date : 2025-07-14 DOI:10.1094/PHYTO-01-25-0007-FI

Aliyah Brewer, Anna Underhill, Surya Sapkota, Chin-Feng Hwang, Summaira Riaz, Madeline Oravec, Lance Cadle-Davidson

{"title":"Realistic Scenarios of Phenotypic Variation and Errors in High-Throughput Phenotyping Experiments Minimally Impact the Results of Quantitative Trait Locus Mapping Analysis.","authors":"Aliyah Brewer, Anna Underhill, Surya Sapkota, Chin-Feng Hwang, Summaira Riaz, Madeline Oravec, Lance Cadle-Davidson","doi":"10.1094/PHYTO-01-25-0007-FI","DOIUrl":null,"url":null,"abstract":"High-throughput phenotyping technologies increase the efficiency of breeding programs, but with larger datasets, errors can accumulate. Plant breeders often conduct quantitative trait locus (QTL) mapping, where large sample size and accurate quantitative response estimates are important for detecting small-effect QTLs. This study examined how phenotype error, inconsistency, and replication changed QTL magnitude and location. Three real sets of phenotype data were used from microscopy robot analysis of grapevine powdery mildew (Erysiphe necator) severity, which previously resulted in discovery of large (R2 = 85%), intermediate (R2 = 45%), and small (R2 = 9%) effect QTLs. Custom R scripts were written to induce several realistic sources of error, inconsistency, and varied replication. The results were remarkably robust to these changes. Swapping or shifting 2% of samples or changing disease severity by 50% on one replicate had negligible impact on QTLs. Unreplicated simulations produced the largest logarithm of the odds score range (5.55 to 8.27) and mean logarithm of the odds score deviation (-1.72 to -3.22; Cohen's D = 1.48 to 2.12). The large-effect-size QTL (REN12) was always detected. The intermediate-effect-size QTL (REN13) was detected except when three of the eight replicates were analyzed individually. Even for the small-effect-size locus (NYVPLG9), error scenarios rarely (2 of 9,000 cases) eliminated significant QTL detection, versus no replication (9 of 10). Thus, the benefits of data volume associated with high-throughput phenotyping technologies outweigh the cost of the increased errors tested here. Instead, the focus should be on examining how each experimental replicate contributes to the results of the QTL mapping analysis.","PeriodicalId":20410,"journal":{"name":"Phytopathology","volume":" ","pages":"PHYTO01250007FI"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Phytopathology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1094/PHYTO-01-25-0007-FI","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PLANT SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

High-throughput phenotyping technologies increase the efficiency of breeding programs, but with larger datasets, errors can accumulate. Plant breeders often conduct quantitative trait locus (QTL) mapping, where large sample size and accurate quantitative response estimates are important for detecting small-effect QTLs. This study examined how phenotype error, inconsistency, and replication changed QTL magnitude and location. Three real sets of phenotype data were used from microscopy robot analysis of grapevine powdery mildew (Erysiphe necator) severity, which previously resulted in discovery of large (R² = 85%), intermediate (R² = 45%), and small (R² = 9%) effect QTLs. Custom R scripts were written to induce several realistic sources of error, inconsistency, and varied replication. The results were remarkably robust to these changes. Swapping or shifting 2% of samples or changing disease severity by 50% on one replicate had negligible impact on QTLs. Unreplicated simulations produced the largest logarithm of the odds score range (5.55 to 8.27) and mean logarithm of the odds score deviation (-1.72 to -3.22; Cohen's D = 1.48 to 2.12). The large-effect-size QTL (REN12) was always detected. The intermediate-effect-size QTL (REN13) was detected except when three of the eight replicates were analyzed individually. Even for the small-effect-size locus (NYVPLG9), error scenarios rarely (2 of 9,000 cases) eliminated significant QTL detection, versus no replication (9 of 10). Thus, the benefits of data volume associated with high-throughput phenotyping technologies outweigh the cost of the increased errors tested here. Instead, the focus should be on examining how each experimental replicate contributes to the results of the QTL mapping analysis.

查看原文本刊更多论文

高通量表型实验中表型变异和错误的现实情况对QTL定位分析结果的影响最小。

高通量表型技术提高了育种计划的效率，但随着数据集的增加，错误可能会累积。植物育种家经常进行数量性状位点（QTL）定位，其中大样本量和准确的定量应答估计对于检测小效应QTL至关重要。本研究考察了表型错误、不一致和复制如何改变QTL的大小和位置。利用显微镜机器人分析葡萄白粉病（Erysiphe necator）严重程度的三组真实表型数据，先前发现了大（R2 = 85%）、中（R2 = 45%）和小（R2 = 9%）效应QTL。编写自定义R脚本是为了引出几个实际的错误、不一致和各种复制的来源。结果对这些变化非常有力。交换或转移2%的样本或在一个重复中将疾病严重程度改变50%对QTL的影响可以忽略不计。非重复模拟得到最大的LOD评分范围（5.55 ~ 8.27）和平均LOD评分偏差（-1.72 ~ -3.22）；科恩D = 1.48 - 2.12)。总能检测到大效应量QTL （REN12）。除单独分析8个重复中的3个重复外，检测中间效应大小QTL （REN13）。即使对于效应大小较小的位点（NYVPLG9），错误情况也很少（9000例中有2例）消除了显著的QTL检测，而没有复制（10例中有9例）。因此，与高通量表型技术相关的数据量的好处超过了这里测试的错误增加的成本。相反，重点应该放在检查每个实验重复对QTL定位分析结果的贡献上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Phytopathology 生物-植物科学

CiteScore

5.90

自引率

9.40%

发文量

505

审稿时长

4-8 weeks

期刊介绍： Phytopathology publishes articles on fundamental research that advances understanding of the nature of plant diseases, the agents that cause them, their spread, the losses they cause, and measures that can be used to control them. Phytopathology considers manuscripts covering all aspects of plant diseases including bacteriology, host-parasite biochemistry and cell biology, biological control, disease control and pest management, description of new pathogen species description of new pathogen species, ecology and population biology, epidemiology, disease etiology, host genetics and resistance, mycology, nematology, plant stress and abiotic disorders, postharvest pathology and mycotoxins, and virology. Papers dealing mainly with taxonomy, such as descriptions of new plant pathogen taxa are acceptable if they include plant disease research results such as pathogenicity, host range, etc. Taxonomic papers that focus on classification, identification, and nomenclature below the subspecies level may also be submitted to Phytopathology.