Xueyan Zhu , Huaiqing Zhang , Xue Zhang , Tiantian Ye , Yili Zheng
{"title":"Multimodal deep learning for oil content prediction in Camellia oleifera fruits using image, morphometric, and categorical features","authors":"Xueyan Zhu , Huaiqing Zhang , Xue Zhang , Tiantian Ye , Yili Zheng","doi":"10.1016/j.jfca.2026.108995","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate determination of oil content is essential for food composition analysis and quality control in the <em>Camellia</em> oil industry, yet conventional chemical analyses are destructive and difficult to implement at large scale. In this study, a multimodal oil content prediction model (MPCM-OC) was developed as an indirect, non-destructive approach to support oil content assessment in <em>Camellia oleifera</em> fruits based on reference chemical measurements. The proposed framework integrates fruit images, morphometric traits (transverse diameter, longitudinal diameter, and fruit shape index), and categorical information (cultivar, maturity stage, acquisition date, and sampling location), using separate feature extraction networks and an adaptive fusion module. Seed oil content values obtained using standardized chemical analysis served as reference data. The MPCM-OC model achieved an overall coefficient of determination (<em>R</em>²) of 0.8353, with a mean absolute percentage error of 13.38 %, a mean absolute error of 4.52, and a root mean squared error of 6.30. Ablation and comparative analyses showed that incorporating morphometric and categorical features with image data consistently improved prediction accuracy over image-only models. The proposed framework serves as a rapid, low-cost complementary tool for preliminary screening and batch-level quality evaluation, enhancing efficiency in food composition analysis and quality control of <em>Camellia oleifera</em>.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"151 ","pages":"Article 108995"},"PeriodicalIF":4.6000,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0889157526001389","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/13 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate determination of oil content is essential for food composition analysis and quality control in the Camellia oil industry, yet conventional chemical analyses are destructive and difficult to implement at large scale. In this study, a multimodal oil content prediction model (MPCM-OC) was developed as an indirect, non-destructive approach to support oil content assessment in Camellia oleifera fruits based on reference chemical measurements. The proposed framework integrates fruit images, morphometric traits (transverse diameter, longitudinal diameter, and fruit shape index), and categorical information (cultivar, maturity stage, acquisition date, and sampling location), using separate feature extraction networks and an adaptive fusion module. Seed oil content values obtained using standardized chemical analysis served as reference data. The MPCM-OC model achieved an overall coefficient of determination (R²) of 0.8353, with a mean absolute percentage error of 13.38 %, a mean absolute error of 4.52, and a root mean squared error of 6.30. Ablation and comparative analyses showed that incorporating morphometric and categorical features with image data consistently improved prediction accuracy over image-only models. The proposed framework serves as a rapid, low-cost complementary tool for preliminary screening and batch-level quality evaluation, enhancing efficiency in food composition analysis and quality control of Camellia oleifera.
期刊介绍:
The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects.
The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.