Application of near-infrared spectroscopy for fast germplasm analysis and classification in multi-environment using intact-seed peanut (Arachis hypogaea L.)

Q3 Agricultural and Biological Sciences
Fentanesh Chekole Kassie , Gilles Chaix , Hermine Bille Ngalle , Maguette Seye , Coura Fall , Hodo-Abalo Tossim , Aissatou Sambou , Olivier Gibert , Fabrice Davrieux , Joseph Martin Bell , Jean-François Rami , Daniel Fonceka , Joël Romaric Nguepjop
{"title":"Application of near-infrared spectroscopy for fast germplasm analysis and classification in multi-environment using intact-seed peanut (Arachis hypogaea L.)","authors":"Fentanesh Chekole Kassie ,&nbsp;Gilles Chaix ,&nbsp;Hermine Bille Ngalle ,&nbsp;Maguette Seye ,&nbsp;Coura Fall ,&nbsp;Hodo-Abalo Tossim ,&nbsp;Aissatou Sambou ,&nbsp;Olivier Gibert ,&nbsp;Fabrice Davrieux ,&nbsp;Joseph Martin Bell ,&nbsp;Jean-François Rami ,&nbsp;Daniel Fonceka ,&nbsp;Joël Romaric Nguepjop","doi":"10.1016/j.ocsci.2024.03.003","DOIUrl":null,"url":null,"abstract":"<div><p>Peanut is a worldwide oilseed crop and the need to assess germplasm in a non-destructive manner is important for seed nutritional breeding. In this study, Near Infrared Spectroscopy (NIRS) was applied to rapidly assess germplasm variability from whole seed of 699 samples, field-collected and assembled in four genetic and environment-based sets: one set of 300 varieties of a core-collection and three sets of 133 genotypes of an interspecific population, evaluated in three environments in a large spatial scale of two countries, Mbalmayo and Bafia in Cameroon and Nioro in Senegal, under rainfed conditions. NIR elemental spectra were gathered on six subsets of seeds of each sample, after three rotation scans, with a spectral resolution of 16 cm<sup>-1</sup> over the spectral range of 867 nm to 2530 ​nm. Spectra were then processed by principal component analysis (PCA) coupled with Partial least squares-discriminant analysis (PLS-DA). As results, a huge variability was found between varieties and genotypes for all NIR wavelength within and between environments. The magnitude of genetic variation was particularly observed at 11 relevant wavelengths such as 1723 ​nm, usually related to oil content and fatty acid composition. PCA yielded the most chemical attributes in three significant PCs (i.e., eigenvalues &gt;10), which together captured 93% of the total variation, revealing genetic and environment structure of varieties and genotypes into four clusters, corresponding to the four samples sets. The pattern of genetic variability of the interspecific population covers, remarkably half of spectrum of the core-collection, turning out to be the largest. Interestingly, a PLS-DA model was developed and a strong accuracy of 99.6% was achieved for the four sets, aiming to classify each seed sample according to environment origin. The confusion matrix achieved for the two sets of Bafia and Nioro showed 100% of instances classified correctly with 100% at both sensitivity and specificity, confirming that their seed quality was different from each other and all other samples. Overall, NIRS chemometrics is useful to assess and distinguish seeds from different environments and highlights the value of the interspecific population and core-collection, as a source of nutritional diversity, to support the breeding efforts.</p></div>","PeriodicalId":34095,"journal":{"name":"Oil Crop Science","volume":"9 2","pages":"Pages 132-141"},"PeriodicalIF":0.0000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2096242824000241/pdfft?md5=c346609e816dacff868dbdf230d628b2&pid=1-s2.0-S2096242824000241-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oil Crop Science","FirstCategoryId":"1091","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2096242824000241","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Peanut is a worldwide oilseed crop and the need to assess germplasm in a non-destructive manner is important for seed nutritional breeding. In this study, Near Infrared Spectroscopy (NIRS) was applied to rapidly assess germplasm variability from whole seed of 699 samples, field-collected and assembled in four genetic and environment-based sets: one set of 300 varieties of a core-collection and three sets of 133 genotypes of an interspecific population, evaluated in three environments in a large spatial scale of two countries, Mbalmayo and Bafia in Cameroon and Nioro in Senegal, under rainfed conditions. NIR elemental spectra were gathered on six subsets of seeds of each sample, after three rotation scans, with a spectral resolution of 16 cm-1 over the spectral range of 867 nm to 2530 ​nm. Spectra were then processed by principal component analysis (PCA) coupled with Partial least squares-discriminant analysis (PLS-DA). As results, a huge variability was found between varieties and genotypes for all NIR wavelength within and between environments. The magnitude of genetic variation was particularly observed at 11 relevant wavelengths such as 1723 ​nm, usually related to oil content and fatty acid composition. PCA yielded the most chemical attributes in three significant PCs (i.e., eigenvalues >10), which together captured 93% of the total variation, revealing genetic and environment structure of varieties and genotypes into four clusters, corresponding to the four samples sets. The pattern of genetic variability of the interspecific population covers, remarkably half of spectrum of the core-collection, turning out to be the largest. Interestingly, a PLS-DA model was developed and a strong accuracy of 99.6% was achieved for the four sets, aiming to classify each seed sample according to environment origin. The confusion matrix achieved for the two sets of Bafia and Nioro showed 100% of instances classified correctly with 100% at both sensitivity and specificity, confirming that their seed quality was different from each other and all other samples. Overall, NIRS chemometrics is useful to assess and distinguish seeds from different environments and highlights the value of the interspecific population and core-collection, as a source of nutritional diversity, to support the breeding efforts.

应用近红外光谱技术在多环境下利用完整种子花生(Arachis hypogaea L.)进行快速种质分析和分类
花生是一种世界性的油料作物,以非破坏性方式评估种质对于种子营养育种非常重要。在这项研究中,应用近红外光谱技术(NIRS)从田间采集的 699 个样品的全籽中快速评估种质变异性,这些样品被组合成四个基于遗传和环境的集合:一组是核心集合的 300 个品种,三组是种间群体的 133 个基因型,在两个国家(喀麦隆的 Mbalmayo 和 Bafia 以及塞内加尔的 Nioro)的三个大空间尺度环境中的雨水灌溉条件下进行评估。对每个样本的六个种子子集进行了近红外元素光谱采集,经过三次旋转扫描,光谱分辨率为 16 cm-1,光谱范围为 867 nm 至 2530 nm。然后通过主成分分析(PCA)和偏最小二乘判别分析(PLS-DA)对光谱进行处理。结果发现,在环境内和环境间,品种和基因型之间在所有近红外波长上都存在巨大差异。在 11 个相关波长(如 1723 nm)上观察到的遗传变异幅度尤其大,这些波长通常与含油量和脂肪酸组成有关。PCA 在三个显著的 PC(即特征值为 10)中得到了最多的化学属性,这三个 PC 共捕获了总变异的 93%,揭示了品种和基因型的遗传和环境结构,将其分为四个聚类,与四个样本集相对应。值得注意的是,种间群体的遗传变异模式覆盖了核心收集谱的一半,是最大的遗传变异模式。有趣的是,我们开发了一个 PLS-DA 模型,该模型对四组样本的准确率高达 99.6%,目的是根据环境来源对每个种子样本进行分类。Bafia 和 Nioro 两组样本的混淆矩阵显示,100% 的实例被正确分类,灵敏度和特异度均为 100%,这证明它们的种子质量与其他所有样本不同。总之,近红外光谱化学计量学有助于评估和区分来自不同环境的种子,并突出了种间群体和核心采集作为营养多样性来源的价值,以支持育种工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Oil Crop Science
Oil Crop Science Food Science, Plant Science, Agronomy and Crop Science
CiteScore
3.40
自引率
0.00%
发文量
20
审稿时长
74 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信