{"title":"Selection indices and support vector machines in the selection of sugarcane families","authors":"B. A. Muetanene","doi":"10.47328/ufvbbt.2022.664","DOIUrl":null,"url":null,"abstract":"The present study aimed to compare selection indices, namely: Smith and Hazel multiplicative, Mulamba and Mock's, and the support vector machines algorithm for sugarcane families selection. We used two datasets, from Moreira et al. (2021) and from Ferreira et al. (2022), both related to the sugarcane breeding program conducted at the Center for Sugar cane Research and Breeding at the Federal University of Viçosa, Oratórios, Minas Gerais. Both experiments were conducted in a randomized complete block design. We constructed the selection indices via mixed models approach. We adopted a selection percentage of 18% of the top families for the selection process. In both studies, we considered as explanatory traits: the number of stalks, stalks diameter and stalk height, and as the response trait the tons of stalks per hectare per family. In the dataset from Ferreira et al. (2022), the support vector machine was a better approach to select sugarcane families by learning from the data after multivariate simulation. Whereas in the dataset from Moreira et al. (2021), using similar methodology, lower performance for support vector machines was obtained. Keywords: Synthetic data. Indirect selection. Yield prediction. Machine learning. BLUP","PeriodicalId":134365,"journal":{"name":"BRAZILIAN JOURNAL OF AGRICULTURE - Revista de Agricultura","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BRAZILIAN JOURNAL OF AGRICULTURE - Revista de Agricultura","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47328/ufvbbt.2022.664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The present study aimed to compare selection indices, namely: Smith and Hazel multiplicative, Mulamba and Mock's, and the support vector machines algorithm for sugarcane families selection. We used two datasets, from Moreira et al. (2021) and from Ferreira et al. (2022), both related to the sugarcane breeding program conducted at the Center for Sugar cane Research and Breeding at the Federal University of Viçosa, Oratórios, Minas Gerais. Both experiments were conducted in a randomized complete block design. We constructed the selection indices via mixed models approach. We adopted a selection percentage of 18% of the top families for the selection process. In both studies, we considered as explanatory traits: the number of stalks, stalks diameter and stalk height, and as the response trait the tons of stalks per hectare per family. In the dataset from Ferreira et al. (2022), the support vector machine was a better approach to select sugarcane families by learning from the data after multivariate simulation. Whereas in the dataset from Moreira et al. (2021), using similar methodology, lower performance for support vector machines was obtained. Keywords: Synthetic data. Indirect selection. Yield prediction. Machine learning. BLUP
本研究旨在比较Smith and Hazel乘法、Mulamba and Mock’s选择指标和支持向量机算法在甘蔗家族选择中的应用。我们使用了两个数据集,分别来自Moreira等人(2021)和Ferreira等人(2022),这两个数据集都与米纳斯吉拉斯州维拉瑟萨联邦大学甘蔗研究与育种中心(Oratórios)开展的甘蔗育种计划有关。两项实验均采用完全随机区组设计。采用混合模型方法构建了选择指标。我们在挑选过程中采用了18%的顶级家庭的选择百分比。在两项研究中,我们都考虑了作为解释性状的秸秆数、秸秆直径和秸秆高度,以及作为响应性状的每户每公顷秸秆吨数。在Ferreira et al.(2022)的数据集中,通过多变量模拟后的数据学习,支持向量机是一种更好的选择甘蔗家族的方法。而在Moreira等人(2021)的数据集中,使用类似的方法,支持向量机的性能较低。关键词:合成数据;间接选择。产量预测。机器学习。BLUP