Evaluation of biochemical profiles in walnut (Juglans regia L.) genotypes and predictive modeling using advanced machine learning algorithms

IF 3.2 3区 农林科学 Q2 FOOD SCIENCE & TECHNOLOGY
Fatih Temizyürek, Muhammet Ali Gündeşli, Yavuz Canbay, Hacı Osman Özatar, Esra Gölcü, Vahdet Cemil Altun, Ömer Atagül, Murat Güney, Ebru Kafkas, Remzi Ugur, Mehmet Sütyemez
{"title":"Evaluation of biochemical profiles in walnut (Juglans regia L.) genotypes and predictive modeling using advanced machine learning algorithms","authors":"Fatih Temizyürek,&nbsp;Muhammet Ali Gündeşli,&nbsp;Yavuz Canbay,&nbsp;Hacı Osman Özatar,&nbsp;Esra Gölcü,&nbsp;Vahdet Cemil Altun,&nbsp;Ömer Atagül,&nbsp;Murat Güney,&nbsp;Ebru Kafkas,&nbsp;Remzi Ugur,&nbsp;Mehmet Sütyemez","doi":"10.1007/s00217-025-04826-7","DOIUrl":null,"url":null,"abstract":"<div><p>Biochemical parameters such as phenolic compounds, tocopherols and polyunsaturated fatty acids (omega-6 and omega-3 etc.) found in walnuts support cardiovascular health with their high antioxidant content, reduce inflammation and improve the blood lipid profile and show positive effects such as lowering LDL cholesterol, increasing HDL and reducing oxidative stress. This study investigates the application of machine learning algorithms to predict the biochemical properties of 10 walnut (<i>Juglans regia</i> L.) genotypes selected from the Kahramanmaraş region and three commonly cultivated walnut cultivars (Maraş-18, Chandler, and Franquette). Plant material was collected between 2020–2021, and analyses were conducted on biochemical components, including polyphenols, antioxidant capacity, protein, oil, fatty acids, sugars, and organic acids using HPLC, GC, and spectrophotometric methods. The highest total phenolic content was found in genotype 46KM-5 (402.6 mg GAE/100 g), and the highest antioxidant capacity in genotype 46KM-13 (73.77%). The 46KM-9 genotype exhibited the highest oil content (71.60%) and the highest total sugar content (3.27%), while sucrose was the dominant sugar. Regarding fatty acids, genotype 46KM-19 had the highest linoleic acid content (60.88%), and genotype 46KM-21 showed the highest oleic acid content (20.88%). Malic acid emerged as the dominant organic acid, with the highest content observed in genotype 46KM-9 (6.55%). In addition, various machine learning algorithms such as such as decision tree, support vector regression, K-nearest neighbor (KNN), gradient boosting, multi-layer perceptron, and linear regression (LR) have been used to assess the biochemical properties of genotypes using hyperparameter optimization. KNN algorithm generally provided high performance in prediction processes by exhibiting very low error rates. This study's results align with previous research, highlighting walnut genotype variation and nutritional benefits. Machine learning predicts key traits, aiding breeders and consumers in health-focused production and better genotype selection for improved cultivation.</p></div>","PeriodicalId":549,"journal":{"name":"European Food Research and Technology","volume":"251 10","pages":"3105 - 3122"},"PeriodicalIF":3.2000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s00217-025-04826-7.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Food Research and Technology","FirstCategoryId":"97","ListUrlMain":"https://link.springer.com/article/10.1007/s00217-025-04826-7","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Biochemical parameters such as phenolic compounds, tocopherols and polyunsaturated fatty acids (omega-6 and omega-3 etc.) found in walnuts support cardiovascular health with their high antioxidant content, reduce inflammation and improve the blood lipid profile and show positive effects such as lowering LDL cholesterol, increasing HDL and reducing oxidative stress. This study investigates the application of machine learning algorithms to predict the biochemical properties of 10 walnut (Juglans regia L.) genotypes selected from the Kahramanmaraş region and three commonly cultivated walnut cultivars (Maraş-18, Chandler, and Franquette). Plant material was collected between 2020–2021, and analyses were conducted on biochemical components, including polyphenols, antioxidant capacity, protein, oil, fatty acids, sugars, and organic acids using HPLC, GC, and spectrophotometric methods. The highest total phenolic content was found in genotype 46KM-5 (402.6 mg GAE/100 g), and the highest antioxidant capacity in genotype 46KM-13 (73.77%). The 46KM-9 genotype exhibited the highest oil content (71.60%) and the highest total sugar content (3.27%), while sucrose was the dominant sugar. Regarding fatty acids, genotype 46KM-19 had the highest linoleic acid content (60.88%), and genotype 46KM-21 showed the highest oleic acid content (20.88%). Malic acid emerged as the dominant organic acid, with the highest content observed in genotype 46KM-9 (6.55%). In addition, various machine learning algorithms such as such as decision tree, support vector regression, K-nearest neighbor (KNN), gradient boosting, multi-layer perceptron, and linear regression (LR) have been used to assess the biochemical properties of genotypes using hyperparameter optimization. KNN algorithm generally provided high performance in prediction processes by exhibiting very low error rates. This study's results align with previous research, highlighting walnut genotype variation and nutritional benefits. Machine learning predicts key traits, aiding breeders and consumers in health-focused production and better genotype selection for improved cultivation.

核桃(Juglans regia L.)基因型生物化学特征评估及先进机器学习算法的预测建模
核桃中含有的酚类化合物、生育酚和多不饱和脂肪酸(omega-6和omega-3等)等生化参数,以其高抗氧化含量支持心血管健康,减少炎症,改善血脂,并显示出降低低密度脂蛋白胆固醇、增加高密度脂蛋白胆固醇和减少氧化应激等积极作用。本研究利用机器学习算法对kahramanmaraki地区10个核桃(Juglans regia L.)基因型和3个常用核桃品种(mara -18、Chandler和Franquette)的生化特性进行了预测。在2020-2021年间采集植物材料,采用HPLC、GC和分光光度法分析其生化成分,包括多酚、抗氧化能力、蛋白质、油脂、脂肪酸、糖和有机酸。总酚含量最高的基因型为46KM-5 (402.6 mg GAE/100 g),抗氧化能力最高的基因型为46KM-13(73.77%)。46KM-9基因型含油量最高(71.60%),总糖含量最高(3.27%),以蔗糖为主。脂肪酸方面,基因型46KM-19亚油酸含量最高(60.88%),基因型46KM-21油酸含量最高(20.88%)。苹果酸为优势有机酸,46KM-9基因型苹果酸含量最高(6.55%)。此外,各种机器学习算法,如决策树、支持向量回归、k最近邻(KNN)、梯度增强、多层感知器和线性回归(LR)已被用于使用超参数优化来评估基因型的生化特性。KNN算法通常在预测过程中表现出非常低的错误率,从而提供了很高的性能。这项研究的结果与之前的研究一致,强调了核桃基因型变异和营养价值。机器学习预测关键性状,帮助育种者和消费者进行以健康为重点的生产,并更好地选择基因型,以改进种植。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European Food Research and Technology
European Food Research and Technology 工程技术-食品科技
CiteScore
6.60
自引率
3.00%
发文量
232
审稿时长
2.0 months
期刊介绍: The journal European Food Research and Technology publishes state-of-the-art research papers and review articles on fundamental and applied food research. The journal''s mission is the fast publication of high quality papers on front-line research, newest techniques and on developing trends in the following sections: -chemistry and biochemistry- technology and molecular biotechnology- nutritional chemistry and toxicology- analytical and sensory methodologies- food physics. Out of the scope of the journal are: - contributions which are not of international interest or do not have a substantial impact on food sciences, - submissions which comprise merely data collections, based on the use of routine analytical or bacteriological methods, - contributions reporting biological or functional effects without profound chemical and/or physical structure characterization of the compound(s) under research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信