Jiwon Ryu , Sang-Yeon Kim , Chang-Hyup Lee , Gyumin Kim , Harin Jang , Taehyeong Kim , Suk-Ju Hong , Geon Hee Kim , Ghiseok Kim
{"title":"基于语义分割和标签高效数据生成的苹果缺陷识别方法的改进","authors":"Jiwon Ryu , Sang-Yeon Kim , Chang-Hyup Lee , Gyumin Kim , Harin Jang , Taehyeong Kim , Suk-Ju Hong , Geon Hee Kim , Ghiseok Kim","doi":"10.1016/j.engappai.2025.112679","DOIUrl":null,"url":null,"abstract":"<div><div>The increasing demand for automated fruit-sorting systems has driven the development of machine vision and deep learning technologies for postharvest grading of fruit defects. Effective sorting requires identifying not only the presence of defects but also their types and severity to avoid unnecessary rejection of marketable fruits with minor defects. This study applied deep learning-based semantic segmentation models to Fuji apple images, focusing on four defect types: cracks, bruises, diseases, and scars. Model performance was evaluated for both defect classification and severity estimation. To further improve performance, a label-efficient approach using generative adversarial networks was proposed to generate synthetic apple images and defect masks, reducing the need for extensive manual labeling to create a larger dataset. Qualitative and quantitative analyses of the generated results showed that the synthetic dataset successfully mimicked the biological characteristics of apples as well as the shape, position, and size of the defects. The segmentation model's ability to identify defects was enhanced by the proposed synthetic dataset. The R<sup>2</sup> values for defect severity estimation increased to 0.82, 0.85, 0.75, and 0.92 for cracks, bruises, diseases, and scars respectively, while F1-scores for defect classification reached 100, 94.3, 94.1, and 89.7 %. Furthermore, per-sample classification performance was enhanced with a binary F1-score of 95.9 % for defect presence and a multi-label accuracy of 93.9 % for defect types. This study clearly demonstrates that synthetic datasets generated using generative adversarial networks can substantially enhance both defect type classification and severity estimation in semantic segmentation-based apple defect identification models.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"163 ","pages":"Article 112679"},"PeriodicalIF":8.0000,"publicationDate":"2025-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancement of apple defect identification with semantic segmentation and label-efficient data generation\",\"authors\":\"Jiwon Ryu , Sang-Yeon Kim , Chang-Hyup Lee , Gyumin Kim , Harin Jang , Taehyeong Kim , Suk-Ju Hong , Geon Hee Kim , Ghiseok Kim\",\"doi\":\"10.1016/j.engappai.2025.112679\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The increasing demand for automated fruit-sorting systems has driven the development of machine vision and deep learning technologies for postharvest grading of fruit defects. Effective sorting requires identifying not only the presence of defects but also their types and severity to avoid unnecessary rejection of marketable fruits with minor defects. This study applied deep learning-based semantic segmentation models to Fuji apple images, focusing on four defect types: cracks, bruises, diseases, and scars. Model performance was evaluated for both defect classification and severity estimation. To further improve performance, a label-efficient approach using generative adversarial networks was proposed to generate synthetic apple images and defect masks, reducing the need for extensive manual labeling to create a larger dataset. Qualitative and quantitative analyses of the generated results showed that the synthetic dataset successfully mimicked the biological characteristics of apples as well as the shape, position, and size of the defects. The segmentation model's ability to identify defects was enhanced by the proposed synthetic dataset. The R<sup>2</sup> values for defect severity estimation increased to 0.82, 0.85, 0.75, and 0.92 for cracks, bruises, diseases, and scars respectively, while F1-scores for defect classification reached 100, 94.3, 94.1, and 89.7 %. Furthermore, per-sample classification performance was enhanced with a binary F1-score of 95.9 % for defect presence and a multi-label accuracy of 93.9 % for defect types. This study clearly demonstrates that synthetic datasets generated using generative adversarial networks can substantially enhance both defect type classification and severity estimation in semantic segmentation-based apple defect identification models.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"163 \",\"pages\":\"Article 112679\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625027101\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625027101","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Enhancement of apple defect identification with semantic segmentation and label-efficient data generation
The increasing demand for automated fruit-sorting systems has driven the development of machine vision and deep learning technologies for postharvest grading of fruit defects. Effective sorting requires identifying not only the presence of defects but also their types and severity to avoid unnecessary rejection of marketable fruits with minor defects. This study applied deep learning-based semantic segmentation models to Fuji apple images, focusing on four defect types: cracks, bruises, diseases, and scars. Model performance was evaluated for both defect classification and severity estimation. To further improve performance, a label-efficient approach using generative adversarial networks was proposed to generate synthetic apple images and defect masks, reducing the need for extensive manual labeling to create a larger dataset. Qualitative and quantitative analyses of the generated results showed that the synthetic dataset successfully mimicked the biological characteristics of apples as well as the shape, position, and size of the defects. The segmentation model's ability to identify defects was enhanced by the proposed synthetic dataset. The R2 values for defect severity estimation increased to 0.82, 0.85, 0.75, and 0.92 for cracks, bruises, diseases, and scars respectively, while F1-scores for defect classification reached 100, 94.3, 94.1, and 89.7 %. Furthermore, per-sample classification performance was enhanced with a binary F1-score of 95.9 % for defect presence and a multi-label accuracy of 93.9 % for defect types. This study clearly demonstrates that synthetic datasets generated using generative adversarial networks can substantially enhance both defect type classification and severity estimation in semantic segmentation-based apple defect identification models.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.