{"title":"AI-Based Generative Model for Computer-Based X-Ray Inspection Training System","authors":"Junyoung Kim;Gihyun Kwon;Dohoon Ryu;Jong Chul Ye","doi":"10.1109/TIM.2025.3554859","DOIUrl":null,"url":null,"abstract":"X-ray baggage inspection systems have proven to be indispensable tools for swiftly and efficiently examining passengers’ belongings and inspecting goods during import-export processes at locations, such as airports and seaports. The ability to visualize the contents of bags without opening them streamlines the inspection process, enabling rapid assessments. The reliance on human judgment for interpreting X-ray images requires systematic training program, given the disparities between the actual shapes of items and their representations in X-ray images. In addition, the surge in travel demand and logistics movements necessitates a substantial number of highly trained inspectors. Computer-based training (CBT) systems offer a convenient solution for training inspectors, but current X-ray inspection CBT systems have limitations in data diversity and provide only restricted information, constraining the effectiveness of training. To address this, here we propose a novel AI-based data augmentation scheme for X-ray inspection CBT systems. Specifically, this article focuses on the AI generative model component, which is composed of three parts. First, we employ a latent diffusion model (LDM) to generate high-quality, diverse illicit item X-ray images. Second, by incorporating a novel neural support optimization from frontal and side-view images, a tuned voxel grid is obtained, enabling the creation of 3-D images that offer varied perspectives of items. Finally, the generated images undergo transformations, such as metal material emphasis, organic material emphasis, negative images, and variations in brightness to enhance detection capabilities, making it easier to identify items that were challenging to detect in conventional pseudo-color images. Inspector training with real X-ray data followed by training with generated X-ray data resulted in an increase in inspection accuracy and decrease in inspection time, confirming the effectiveness of training using generated images.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-13"},"PeriodicalIF":5.6000,"publicationDate":"2025-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10948186/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
X-ray baggage inspection systems have proven to be indispensable tools for swiftly and efficiently examining passengers’ belongings and inspecting goods during import-export processes at locations, such as airports and seaports. The ability to visualize the contents of bags without opening them streamlines the inspection process, enabling rapid assessments. The reliance on human judgment for interpreting X-ray images requires systematic training program, given the disparities between the actual shapes of items and their representations in X-ray images. In addition, the surge in travel demand and logistics movements necessitates a substantial number of highly trained inspectors. Computer-based training (CBT) systems offer a convenient solution for training inspectors, but current X-ray inspection CBT systems have limitations in data diversity and provide only restricted information, constraining the effectiveness of training. To address this, here we propose a novel AI-based data augmentation scheme for X-ray inspection CBT systems. Specifically, this article focuses on the AI generative model component, which is composed of three parts. First, we employ a latent diffusion model (LDM) to generate high-quality, diverse illicit item X-ray images. Second, by incorporating a novel neural support optimization from frontal and side-view images, a tuned voxel grid is obtained, enabling the creation of 3-D images that offer varied perspectives of items. Finally, the generated images undergo transformations, such as metal material emphasis, organic material emphasis, negative images, and variations in brightness to enhance detection capabilities, making it easier to identify items that were challenging to detect in conventional pseudo-color images. Inspector training with real X-ray data followed by training with generated X-ray data resulted in an increase in inspection accuracy and decrease in inspection time, confirming the effectiveness of training using generated images.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.