印度芥菜低产、高产居群判别的变量选择

P. Godara
{"title":"印度芥菜低产、高产居群判别的变量选择","authors":"P. Godara","doi":"10.18782/2320-7051.7792","DOIUrl":null,"url":null,"abstract":"Variable Selection is an important problem in classification and discriminant analysis. The selection of important variables for the purpose of discrimination between populations is important from the point of view of time and resources required for the experimentation. Keeping this in view, the present study has been designed to find important characters of Indian mustard which can discriminate between high and low yielding genotypes. Secondary data set on 310 genotypes of Indian mustard recorded for 12 characters was used for discrimination between populations of low and high yielding genotypes of Indian mustard. Three variable selection methods (Univariate t-test, Rao ́s F test for additional Information and Random Forests Algorithm) for classification and discrimination were used and compared. Performance of the methods was assessed in terms of leave one out cross-validation error and out of bag error rate for classification. The Four most important variables for discrimination among genotypes based on seed yield per plants were secondary branches, primary branches, days to maturity and siliqua number on main shoot.","PeriodicalId":14249,"journal":{"name":"International Journal of Pure & Applied Bioscience","volume":"68 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2019-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Variable Selection for Discrimination between Low and High Yielding Populations of Indian Mustard\",\"authors\":\"P. Godara\",\"doi\":\"10.18782/2320-7051.7792\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Variable Selection is an important problem in classification and discriminant analysis. The selection of important variables for the purpose of discrimination between populations is important from the point of view of time and resources required for the experimentation. Keeping this in view, the present study has been designed to find important characters of Indian mustard which can discriminate between high and low yielding genotypes. Secondary data set on 310 genotypes of Indian mustard recorded for 12 characters was used for discrimination between populations of low and high yielding genotypes of Indian mustard. Three variable selection methods (Univariate t-test, Rao ́s F test for additional Information and Random Forests Algorithm) for classification and discrimination were used and compared. Performance of the methods was assessed in terms of leave one out cross-validation error and out of bag error rate for classification. The Four most important variables for discrimination among genotypes based on seed yield per plants were secondary branches, primary branches, days to maturity and siliqua number on main shoot.\",\"PeriodicalId\":14249,\"journal\":{\"name\":\"International Journal of Pure & Applied Bioscience\",\"volume\":\"68 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Pure & Applied Bioscience\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18782/2320-7051.7792\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Pure & Applied Bioscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18782/2320-7051.7792","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

变量选择是分类和判别分析中的一个重要问题。从实验所需的时间和资源的角度来看,为种群之间的区分选择重要变量是很重要的。有鉴于此,本研究旨在寻找能够区分高产和低产基因型的印度芥菜的重要性状。利用记录的310个印度芥菜基因型12个性状的二级数据集,对印度芥菜低、高产基因型群体进行了判别。使用三种变量选择方法(单变量t检验、附加信息的Rao′s F检验和随机森林算法)进行分类和判别并进行比较。从交叉验证误差和分类失误率两方面对方法的性能进行了评估。根据单株种子产量进行基因型区分的4个最重要变量是次生枝、初生枝、成熟期和主梢上的穗数。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Variable Selection for Discrimination between Low and High Yielding Populations of Indian Mustard
Variable Selection is an important problem in classification and discriminant analysis. The selection of important variables for the purpose of discrimination between populations is important from the point of view of time and resources required for the experimentation. Keeping this in view, the present study has been designed to find important characters of Indian mustard which can discriminate between high and low yielding genotypes. Secondary data set on 310 genotypes of Indian mustard recorded for 12 characters was used for discrimination between populations of low and high yielding genotypes of Indian mustard. Three variable selection methods (Univariate t-test, Rao ́s F test for additional Information and Random Forests Algorithm) for classification and discrimination were used and compared. Performance of the methods was assessed in terms of leave one out cross-validation error and out of bag error rate for classification. The Four most important variables for discrimination among genotypes based on seed yield per plants were secondary branches, primary branches, days to maturity and siliqua number on main shoot.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信