High-quality, low-quantity: A data-centric approach to deep learning performance optimization in digital X-Ray radiography

IF 4.1 2区 材料科学 Q1 MATERIALS SCIENCE, CHARACTERIZATION & TESTING
Bata Hena , Ziang Wei , Clemente Ibarra-Castanedo , Xavier Maldague
{"title":"High-quality, low-quantity: A data-centric approach to deep learning performance optimization in digital X-Ray radiography","authors":"Bata Hena ,&nbsp;Ziang Wei ,&nbsp;Clemente Ibarra-Castanedo ,&nbsp;Xavier Maldague","doi":"10.1016/j.ndteint.2025.103327","DOIUrl":null,"url":null,"abstract":"<div><div>The accuracy of identifying defects using specialized deep learning models can be affected by the circumstances in which the training data is curated. This is especially evident in digital X-ray radiography, where the depiction of flaws is significantly impacted by the exposure conditions. This study examines the effect of curating high-quality data on deep learning models. The variation in contrast-to-noise ratio (CNR), which is a crucial metric of image quality between features of interest and an adjacent normal background, has been found to be a key factor in model generalization in digital X-ray radiography applications. By making systematic alterations to exposure conditions during data curation, it was possible to obtain several representations of flaws in each test component with varying contrast-to-noise ratios (CNR) in the resultant radiographs. To evaluate the efficacy of the model under various conditions, two distinct datasets were curated. Dataset 1 was obtained by acquiring images with a consistent exposure setting on 140 test samples. The samples contained 4 morphologically distinct classes of flat bottom holes with seven different depths and sizes. The contrast-to-noise ratio (CNR) representations of flaws in this dataset can be attributed only to differences in depth in Dataset 1. Additionally, Dataset 2 was curated with an expanded range of CNR values by methodically adjusting exposure settings during image acquisitions. Hence, only 42 % of the test pieces from Dataset 1, which had three distinct depths of flat bottom holes, were used. Each of the two datasets was used to separately train YOLOv8 for instance segmentation and U-net for multi-class semantic segmentation. Each model was trained under the same conditions, and their performances were assessed using test sets from both dataset groups. The model trained on Dataset 1 exhibited a notable decline in performance when evaluated on test sets from Dataset 2, suggesting a lack of generalization ability. Conversely, the model that was trained using Dataset 2 consistently achieved high accuracy on both test sets, demonstrating impressive performance and successful generalization. This work shows that the generalization abilities of deep learning models may be improved by varying the contrast-to-noise ratio (CNR) of features in the training data. This finding paves the way for practical applications in digital X-ray radiography.</div></div>","PeriodicalId":18868,"journal":{"name":"Ndt & E International","volume":"152 ","pages":"Article 103327"},"PeriodicalIF":4.1000,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ndt & E International","FirstCategoryId":"88","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0963869525000088","RegionNum":2,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATERIALS SCIENCE, CHARACTERIZATION & TESTING","Score":null,"Total":0}
引用次数: 0

Abstract

The accuracy of identifying defects using specialized deep learning models can be affected by the circumstances in which the training data is curated. This is especially evident in digital X-ray radiography, where the depiction of flaws is significantly impacted by the exposure conditions. This study examines the effect of curating high-quality data on deep learning models. The variation in contrast-to-noise ratio (CNR), which is a crucial metric of image quality between features of interest and an adjacent normal background, has been found to be a key factor in model generalization in digital X-ray radiography applications. By making systematic alterations to exposure conditions during data curation, it was possible to obtain several representations of flaws in each test component with varying contrast-to-noise ratios (CNR) in the resultant radiographs. To evaluate the efficacy of the model under various conditions, two distinct datasets were curated. Dataset 1 was obtained by acquiring images with a consistent exposure setting on 140 test samples. The samples contained 4 morphologically distinct classes of flat bottom holes with seven different depths and sizes. The contrast-to-noise ratio (CNR) representations of flaws in this dataset can be attributed only to differences in depth in Dataset 1. Additionally, Dataset 2 was curated with an expanded range of CNR values by methodically adjusting exposure settings during image acquisitions. Hence, only 42 % of the test pieces from Dataset 1, which had three distinct depths of flat bottom holes, were used. Each of the two datasets was used to separately train YOLOv8 for instance segmentation and U-net for multi-class semantic segmentation. Each model was trained under the same conditions, and their performances were assessed using test sets from both dataset groups. The model trained on Dataset 1 exhibited a notable decline in performance when evaluated on test sets from Dataset 2, suggesting a lack of generalization ability. Conversely, the model that was trained using Dataset 2 consistently achieved high accuracy on both test sets, demonstrating impressive performance and successful generalization. This work shows that the generalization abilities of deep learning models may be improved by varying the contrast-to-noise ratio (CNR) of features in the training data. This finding paves the way for practical applications in digital X-ray radiography.
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ndt & E International
Ndt & E International 工程技术-材料科学:表征与测试
CiteScore
7.20
自引率
9.50%
发文量
121
审稿时长
55 days
期刊介绍: NDT&E international publishes peer-reviewed results of original research and development in all categories of the fields of nondestructive testing and evaluation including ultrasonics, electromagnetics, radiography, optical and thermal methods. In addition to traditional NDE topics, the emerging technology area of inspection of civil structures and materials is also emphasized. The journal publishes original papers on research and development of new inspection techniques and methods, as well as on novel and innovative applications of established methods. Papers on NDE sensors and their applications both for inspection and process control, as well as papers describing novel NDE systems for structural health monitoring and their performance in industrial settings are also considered. Other regular features include international news, new equipment and a calendar of forthcoming worldwide meetings. This journal is listed in Current Contents.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信