基于语义分割和标签高效数据生成的苹果缺陷识别方法的改进

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-10-21 DOI:10.1016/j.engappai.2025.112679

Jiwon Ryu , Sang-Yeon Kim , Chang-Hyup Lee , Gyumin Kim , Harin Jang , Taehyeong Kim , Suk-Ju Hong , Geon Hee Kim , Ghiseok Kim

{"title":"基于语义分割和标签高效数据生成的苹果缺陷识别方法的改进","authors":"Jiwon Ryu , Sang-Yeon Kim , Chang-Hyup Lee , Gyumin Kim , Harin Jang , Taehyeong Kim , Suk-Ju Hong , Geon Hee Kim , Ghiseok Kim","doi":"10.1016/j.engappai.2025.112679","DOIUrl":null,"url":null,"abstract":"<div><div>The increasing demand for automated fruit-sorting systems has driven the development of machine vision and deep learning technologies for postharvest grading of fruit defects. Effective sorting requires identifying not only the presence of defects but also their types and severity to avoid unnecessary rejection of marketable fruits with minor defects. This study applied deep learning-based semantic segmentation models to Fuji apple images, focusing on four defect types: cracks, bruises, diseases, and scars. Model performance was evaluated for both defect classification and severity estimation. To further improve performance, a label-efficient approach using generative adversarial networks was proposed to generate synthetic apple images and defect masks, reducing the need for extensive manual labeling to create a larger dataset. Qualitative and quantitative analyses of the generated results showed that the synthetic dataset successfully mimicked the biological characteristics of apples as well as the shape, position, and size of the defects. The segmentation model's ability to identify defects was enhanced by the proposed synthetic dataset. The R<sup>2</sup> values for defect severity estimation increased to 0.82, 0.85, 0.75, and 0.92 for cracks, bruises, diseases, and scars respectively, while F1-scores for defect classification reached 100, 94.3, 94.1, and 89.7 %. Furthermore, per-sample classification performance was enhanced with a binary F1-score of 95.9 % for defect presence and a multi-label accuracy of 93.9 % for defect types. This study clearly demonstrates that synthetic datasets generated using generative adversarial networks can substantially enhance both defect type classification and severity estimation in semantic segmentation-based apple defect identification models.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"163 ","pages":"Article 112679"},"PeriodicalIF":8.0000,"publicationDate":"2025-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancement of apple defect identification with semantic segmentation and label-efficient data generation\",\"authors\":\"Jiwon Ryu , Sang-Yeon Kim , Chang-Hyup Lee , Gyumin Kim , Harin Jang , Taehyeong Kim , Suk-Ju Hong , Geon Hee Kim , Ghiseok Kim\",\"doi\":\"10.1016/j.engappai.2025.112679\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The increasing demand for automated fruit-sorting systems has driven the development of machine vision and deep learning technologies for postharvest grading of fruit defects. Effective sorting requires identifying not only the presence of defects but also their types and severity to avoid unnecessary rejection of marketable fruits with minor defects. This study applied deep learning-based semantic segmentation models to Fuji apple images, focusing on four defect types: cracks, bruises, diseases, and scars. Model performance was evaluated for both defect classification and severity estimation. To further improve performance, a label-efficient approach using generative adversarial networks was proposed to generate synthetic apple images and defect masks, reducing the need for extensive manual labeling to create a larger dataset. Qualitative and quantitative analyses of the generated results showed that the synthetic dataset successfully mimicked the biological characteristics of apples as well as the shape, position, and size of the defects. The segmentation model's ability to identify defects was enhanced by the proposed synthetic dataset. The R<sup>2</sup> values for defect severity estimation increased to 0.82, 0.85, 0.75, and 0.92 for cracks, bruises, diseases, and scars respectively, while F1-scores for defect classification reached 100, 94.3, 94.1, and 89.7 %. Furthermore, per-sample classification performance was enhanced with a binary F1-score of 95.9 % for defect presence and a multi-label accuracy of 93.9 % for defect types. This study clearly demonstrates that synthetic datasets generated using generative adversarial networks can substantially enhance both defect type classification and severity estimation in semantic segmentation-based apple defect identification models.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"163 \",\"pages\":\"Article 112679\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625027101\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625027101","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

对自动化水果分拣系统的需求日益增长，推动了机器视觉和深度学习技术的发展，用于水果采后缺陷分级。有效的分选不仅需要识别缺陷的存在，还需要识别缺陷的类型和严重程度，以避免对有轻微缺陷的可销售水果进行不必要的拒收。本研究将基于深度学习的语义分割模型应用于富士苹果图像，重点关注四种缺陷类型：裂纹、瘀伤、疾病和疤痕。对模型的性能进行了缺陷分类和严重性评估。为了进一步提高性能，提出了一种使用生成对抗网络的高效标签方法来生成合成苹果图像和缺陷蒙版，从而减少了创建更大数据集时大量手动标记的需要。对生成的结果进行定性和定量分析表明，合成数据集成功地模拟了苹果的生物学特性以及缺陷的形状、位置和大小。提出的合成数据集增强了分割模型识别缺陷的能力。对于裂纹、瘀伤、疾病和疤痕，缺陷严重程度估计的R2值分别增加到0.82、0.85、0.75和0.92，而缺陷分类的f1得分分别达到100、94.3、94.1和89.7%。此外，每个样本的分类性能得到了提高，缺陷存在的二进制f1得分为95.9%，缺陷类型的多标签准确率为93.9%。该研究清楚地表明，在基于语义分割的苹果缺陷识别模型中，使用生成对抗网络生成的合成数据集可以大大增强缺陷类型分类和严重程度估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Enhancement of apple defect identification with semantic segmentation and label-efficient data generation

The increasing demand for automated fruit-sorting systems has driven the development of machine vision and deep learning technologies for postharvest grading of fruit defects. Effective sorting requires identifying not only the presence of defects but also their types and severity to avoid unnecessary rejection of marketable fruits with minor defects. This study applied deep learning-based semantic segmentation models to Fuji apple images, focusing on four defect types: cracks, bruises, diseases, and scars. Model performance was evaluated for both defect classification and severity estimation. To further improve performance, a label-efficient approach using generative adversarial networks was proposed to generate synthetic apple images and defect masks, reducing the need for extensive manual labeling to create a larger dataset. Qualitative and quantitative analyses of the generated results showed that the synthetic dataset successfully mimicked the biological characteristics of apples as well as the shape, position, and size of the defects. The segmentation model's ability to identify defects was enhanced by the proposed synthetic dataset. The R² values for defect severity estimation increased to 0.82, 0.85, 0.75, and 0.92 for cracks, bruises, diseases, and scars respectively, while F1-scores for defect classification reached 100, 94.3, 94.1, and 89.7 %. Furthermore, per-sample classification performance was enhanced with a binary F1-score of 95.9 % for defect presence and a multi-label accuracy of 93.9 % for defect types. This study clearly demonstrates that synthetic datasets generated using generative adversarial networks can substantially enhance both defect type classification and severity estimation in semantic segmentation-based apple defect identification models.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.