Multimodal deep learning for oil content prediction in Camellia oleifera fruits using image, morphometric, and categorical features

IF 4.6 2区 农林科学 Q2 CHEMISTRY, APPLIED
Journal of Food Composition and Analysis Pub Date : 2026-03-01 Epub Date: 2026-02-13 DOI:10.1016/j.jfca.2026.108995
Xueyan Zhu , Huaiqing Zhang , Xue Zhang , Tiantian Ye , Yili Zheng
{"title":"Multimodal deep learning for oil content prediction in Camellia oleifera fruits using image, morphometric, and categorical features","authors":"Xueyan Zhu ,&nbsp;Huaiqing Zhang ,&nbsp;Xue Zhang ,&nbsp;Tiantian Ye ,&nbsp;Yili Zheng","doi":"10.1016/j.jfca.2026.108995","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate determination of oil content is essential for food composition analysis and quality control in the <em>Camellia</em> oil industry, yet conventional chemical analyses are destructive and difficult to implement at large scale. In this study, a multimodal oil content prediction model (MPCM-OC) was developed as an indirect, non-destructive approach to support oil content assessment in <em>Camellia oleifera</em> fruits based on reference chemical measurements. The proposed framework integrates fruit images, morphometric traits (transverse diameter, longitudinal diameter, and fruit shape index), and categorical information (cultivar, maturity stage, acquisition date, and sampling location), using separate feature extraction networks and an adaptive fusion module. Seed oil content values obtained using standardized chemical analysis served as reference data. The MPCM-OC model achieved an overall coefficient of determination (<em>R</em>²) of 0.8353, with a mean absolute percentage error of 13.38 %, a mean absolute error of 4.52, and a root mean squared error of 6.30. Ablation and comparative analyses showed that incorporating morphometric and categorical features with image data consistently improved prediction accuracy over image-only models. The proposed framework serves as a rapid, low-cost complementary tool for preliminary screening and batch-level quality evaluation, enhancing efficiency in food composition analysis and quality control of <em>Camellia oleifera</em>.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"151 ","pages":"Article 108995"},"PeriodicalIF":4.6000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0889157526001389","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/13 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate determination of oil content is essential for food composition analysis and quality control in the Camellia oil industry, yet conventional chemical analyses are destructive and difficult to implement at large scale. In this study, a multimodal oil content prediction model (MPCM-OC) was developed as an indirect, non-destructive approach to support oil content assessment in Camellia oleifera fruits based on reference chemical measurements. The proposed framework integrates fruit images, morphometric traits (transverse diameter, longitudinal diameter, and fruit shape index), and categorical information (cultivar, maturity stage, acquisition date, and sampling location), using separate feature extraction networks and an adaptive fusion module. Seed oil content values obtained using standardized chemical analysis served as reference data. The MPCM-OC model achieved an overall coefficient of determination (R²) of 0.8353, with a mean absolute percentage error of 13.38 %, a mean absolute error of 4.52, and a root mean squared error of 6.30. Ablation and comparative analyses showed that incorporating morphometric and categorical features with image data consistently improved prediction accuracy over image-only models. The proposed framework serves as a rapid, low-cost complementary tool for preliminary screening and batch-level quality evaluation, enhancing efficiency in food composition analysis and quality control of Camellia oleifera.
基于图像、形态计量和分类特征的多模态深度学习预测油茶果实含油量
在油茶油工业中,准确测定油茶油含量对食品成分分析和质量控制至关重要,但传统的化学分析具有破坏性,难以大规模实施。本研究建立了基于参考化学测量的多模态油茶含油量预测模型(MPCM-OC),为油茶果实含油量评价提供了一种间接、非破坏性的方法。该框架利用独立的特征提取网络和自适应融合模块,将水果图像、形态特征(横向直径、纵向直径和水果形状指数)和分类信息(品种、成熟期、采集日期和采样位置)集成在一起。采用标准化化学分析得到的种子含油量值作为参考数据。MPCM-OC模型总体决定系数(R²)为0.8353,平均绝对百分比误差13.38 %,平均绝对误差4.52,均方根误差6.30。消融和比较分析表明,与仅使用图像的模型相比,将形态学特征和分类特征与图像数据结合可以持续提高预测精度。该框架可作为油茶初步筛选和批级质量评价的快速、低成本补充工具,提高油茶食品成分分析和质量控制的效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Food Composition and Analysis
Journal of Food Composition and Analysis 工程技术-食品科技
CiteScore
6.20
自引率
11.60%
发文量
601
审稿时长
53 days
期刊介绍: The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects. The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书