{"title":"机器学习的技术细微差别:植物育种中基因组预测监督方法的实施和验证","authors":"A. Xavier","doi":"10.1590/1984-70332021v21sa15","DOIUrl":null,"url":null,"abstract":"Abstract The decision-making process in plant breeding is driven by data. The machine learning framework has powerful tools that can extract useful information from data. However, there is still a lack of understanding about the underlying algorithms of these methods, their strengths, and pitfalls. Machine learning has two main branches: supervised and unsupervised learning. In plant breeding, supervised learning is used for genomic prediction, where phenotypic traits are modeled as a function of molecular markers. The key supervised learning algorithms for genomic prediction are linear methods, kernel methods, neural networks, and tree ensembles. This manuscript provides an insight into the implementation of these algorithms and how cross-validations can be used to compare methods. Examples for genomic prediction come from plant breeding.","PeriodicalId":10763,"journal":{"name":"Crop Breeding and Applied Biotechnology","volume":null,"pages":null},"PeriodicalIF":1.5000,"publicationDate":"2021-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Technical nuances of machine learning: implementation and validation of supervised methods for genomic prediction in plant breeding\",\"authors\":\"A. Xavier\",\"doi\":\"10.1590/1984-70332021v21sa15\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract The decision-making process in plant breeding is driven by data. The machine learning framework has powerful tools that can extract useful information from data. However, there is still a lack of understanding about the underlying algorithms of these methods, their strengths, and pitfalls. Machine learning has two main branches: supervised and unsupervised learning. In plant breeding, supervised learning is used for genomic prediction, where phenotypic traits are modeled as a function of molecular markers. The key supervised learning algorithms for genomic prediction are linear methods, kernel methods, neural networks, and tree ensembles. This manuscript provides an insight into the implementation of these algorithms and how cross-validations can be used to compare methods. Examples for genomic prediction come from plant breeding.\",\"PeriodicalId\":10763,\"journal\":{\"name\":\"Crop Breeding and Applied Biotechnology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2021-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Crop Breeding and Applied Biotechnology\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1590/1984-70332021v21sa15\",\"RegionNum\":4,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Agricultural and Biological Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Crop Breeding and Applied Biotechnology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1590/1984-70332021v21sa15","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
Technical nuances of machine learning: implementation and validation of supervised methods for genomic prediction in plant breeding
Abstract The decision-making process in plant breeding is driven by data. The machine learning framework has powerful tools that can extract useful information from data. However, there is still a lack of understanding about the underlying algorithms of these methods, their strengths, and pitfalls. Machine learning has two main branches: supervised and unsupervised learning. In plant breeding, supervised learning is used for genomic prediction, where phenotypic traits are modeled as a function of molecular markers. The key supervised learning algorithms for genomic prediction are linear methods, kernel methods, neural networks, and tree ensembles. This manuscript provides an insight into the implementation of these algorithms and how cross-validations can be used to compare methods. Examples for genomic prediction come from plant breeding.
期刊介绍:
The CBAB – CROP BREEDING AND APPLIED BIOTECHNOLOGY (ISSN 1984-7033) – is the official quarterly journal of the Brazilian Society of Plant Breeding, abbreviated CROP BREED APPL BIOTECHNOL.
It publishes original scientific articles, which contribute to the scientific and technological development of plant breeding and agriculture. Articles should be to do with basic and applied research on improvement of perennial and annual plants, within the fields of genetics, conservation of germplasm, biotechnology, genomics, cytogenetics, experimental statistics, seeds, food quality, biotic and abiotic stress, and correlated areas. The article must be unpublished. Simultaneous submitting to another periodical is ruled out. Authors are held solely responsible for the opinions and ideas expressed, which do not necessarily reflect the view of the Editorial board. However, the Editorial board reserves the right to suggest or ask for any modifications required. The journal adopts the Ithenticate software for identification of plagiarism. Complete or partial reproduction of articles is permitted, provided the source is cited. All content of the journal, except where identified, is licensed under a Creative Commons attribution-type BY. All articles are published free of charge. This is an open access journal.