Genetic diversity analysis of peppers: a comparison of discarding variable methods

IF 1.1 4区 农林科学 Q3 AGRONOMY
E. R. Rêgo, M. M. Rêgo, C. Cruz, P. Cecon, F. Finger
{"title":"Genetic diversity analysis of peppers: a comparison of discarding variable methods","authors":"E. R. Rêgo, M. M. Rêgo, C. Cruz, P. Cecon, F. Finger","doi":"10.12702/1984-7033.V03N01A03","DOIUrl":null,"url":null,"abstract":"There are a lot of variables in genetic diversity studies, and it is necessary to know whether or not they are all important and which ones can be discarded. There are often little changes in clustering patterns if a subset of these variables is used, because the discarded variables are redundant or of little contribution to the variability. This study aimed at comparing two discards of variables methods – the Singh method and the principal components method – as well as evaluating the effect of the discards on the cluster analysis. In this analysis data of six ripe fruits traits were used. Other characters with previously known variability or collinearity were added to the analysis. The method considered being the most efficient was the one, which indicated variables that did not alter the initial clustering pattern when discarded. The Singh method did not detect variation differences when standardized data were used. When the distance was obtained by the non-standardized data, the pericarp thickness (0.018%), total soluble solids (0.1668%) and minimum width (2.99%) had the lowest contribution to the divergence. The principal components pointed out that the characteristics fruit length, total soluble solids and seeds yield/fruit were considered as dispensable variables. There were changes in the initial clustering pattern when the variable pericarp thickness was discarded, and the Singh method was not efficient in detecting the importance of this variable. There were no changes in the initial clustering pattern when fruit length was discarded. The data showed that the two compared methods differed, since Singh’s and principal component methods showed different variables to be discarded. The Singh method was not efficient in detecting multicollinearity among variables. The principal component method was more efficient in pointing out the variables that can be discarded. It is advisable that the genetic divergence is calculated based on the scores of the principal components. In future studies, when there is no replicated data, the genetic divergence and the pinpoint of characters should be calculated based on the principal component scores to avoid discarding some important variables when determining divergence. However, if the variable values differ independently, the Singh method based on Euclidean distance is appropriate.","PeriodicalId":49085,"journal":{"name":"Crop Breeding and Applied Biotechnology","volume":"1 1","pages":"19-26"},"PeriodicalIF":1.1000,"publicationDate":"2003-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crop Breeding and Applied Biotechnology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.12702/1984-7033.V03N01A03","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 27

Abstract

There are a lot of variables in genetic diversity studies, and it is necessary to know whether or not they are all important and which ones can be discarded. There are often little changes in clustering patterns if a subset of these variables is used, because the discarded variables are redundant or of little contribution to the variability. This study aimed at comparing two discards of variables methods – the Singh method and the principal components method – as well as evaluating the effect of the discards on the cluster analysis. In this analysis data of six ripe fruits traits were used. Other characters with previously known variability or collinearity were added to the analysis. The method considered being the most efficient was the one, which indicated variables that did not alter the initial clustering pattern when discarded. The Singh method did not detect variation differences when standardized data were used. When the distance was obtained by the non-standardized data, the pericarp thickness (0.018%), total soluble solids (0.1668%) and minimum width (2.99%) had the lowest contribution to the divergence. The principal components pointed out that the characteristics fruit length, total soluble solids and seeds yield/fruit were considered as dispensable variables. There were changes in the initial clustering pattern when the variable pericarp thickness was discarded, and the Singh method was not efficient in detecting the importance of this variable. There were no changes in the initial clustering pattern when fruit length was discarded. The data showed that the two compared methods differed, since Singh’s and principal component methods showed different variables to be discarded. The Singh method was not efficient in detecting multicollinearity among variables. The principal component method was more efficient in pointing out the variables that can be discarded. It is advisable that the genetic divergence is calculated based on the scores of the principal components. In future studies, when there is no replicated data, the genetic divergence and the pinpoint of characters should be calculated based on the principal component scores to avoid discarding some important variables when determining divergence. However, if the variable values differ independently, the Singh method based on Euclidean distance is appropriate.
辣椒遗传多样性分析:丢弃变量方法的比较
遗传多样性研究中有很多变量,有必要知道它们是否都是重要的,哪些是可以忽略的。如果使用这些变量的一个子集,通常聚类模式的变化很小,因为丢弃的变量是冗余的,或者对可变性的贡献很小。本研究旨在比较两种变量丢弃方法-辛格方法和主成分方法-以及评估丢弃对聚类分析的影响。本分析采用了6个成熟果实的性状数据。其他已知变异或共线性的性状被加入分析。被认为是最有效的方法,它表明变量在丢弃时不会改变初始聚类模式。当使用标准化数据时,Singh方法未检测到变异差异。当采用非标准化数据获取距离时,果皮厚度(0.018%)、总可溶性固形物(0.1668%)和最小宽度(2.99%)对差异的贡献最小。主成分分析指出,果实长度、总可溶性固形物和种子产量/果是可忽略的变量。当果皮厚度变量被丢弃时,初始聚类模式会发生变化,Singh方法在检测该变量的重要性方面效率不高。当果实长度被丢弃时,初始聚类模式没有变化。数据显示,两种比较方法不同,因为Singh 's和主成分方法显示了不同的变量被丢弃。Singh方法在检测变量间多重共线性时效率不高。主成分法更有效地指出了可以丢弃的变量。建议根据主成分的得分来计算遗传差异。在今后的研究中,在没有重复数据的情况下,应根据主成分得分计算遗传散度和性状的精确度,避免在确定散度时遗漏一些重要变量。然而,如果变量值独立不同,则基于欧几里得距离的辛格方法是合适的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Crop Breeding and Applied Biotechnology
Crop Breeding and Applied Biotechnology AGRONOMY-BIOTECHNOLOGY & APPLIED MICROBIOLOGY
CiteScore
2.70
自引率
13.30%
发文量
25
审稿时长
6-12 weeks
期刊介绍: The CBAB – CROP BREEDING AND APPLIED BIOTECHNOLOGY (ISSN 1984-7033) – is the official quarterly journal of the Brazilian Society of Plant Breeding, abbreviated CROP BREED APPL BIOTECHNOL. It publishes original scientific articles, which contribute to the scientific and technological development of plant breeding and agriculture. Articles should be to do with basic and applied research on improvement of perennial and annual plants, within the fields of genetics, conservation of germplasm, biotechnology, genomics, cytogenetics, experimental statistics, seeds, food quality, biotic and abiotic stress, and correlated areas. The article must be unpublished. Simultaneous submitting to another periodical is ruled out. Authors are held solely responsible for the opinions and ideas expressed, which do not necessarily reflect the view of the Editorial board. However, the Editorial board reserves the right to suggest or ask for any modifications required. The journal adopts the Ithenticate software for identification of plagiarism. Complete or partial reproduction of articles is permitted, provided the source is cited. All content of the journal, except where identified, is licensed under a Creative Commons attribution-type BY. All articles are published free of charge. This is an open access journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信