{"title":"A sample diversity and identity consistency based cross-modality model for visible-infrared person re-identification","authors":"Jia Sun, Yanfeng Li, Houjin Chen, Yahui Peng","doi":"10.1117/12.2644355","DOIUrl":"https://doi.org/10.1117/12.2644355","url":null,"abstract":"Visible-infrared person re-identification (VI-ReID) aims to search person images across cameras of different modalities, which can address the limitation of visible-based ReID in dark environments. It is a very challenging task, as images of the same identity have huge discrepancy in different modalities. To address this problem, a cross-modality ReID model based on sample diversity and identity consistency is proposed in this paper. For sample diversity, auxiliary images are introduced based on the idea of information transfer. The auxiliary images combine the information of visible images and infrared images, and can improve the diversity of input data and robustness of the network. For identity consistency, homogeneous distance loss and heterogeneous distance loss are developed from four different perspectives to shorten the distance between the samples of same identities. Extensive experimental results demonstrate the effectiveness of the proposed method.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121102727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
M. Farooq, Zhaoxuan Gong, Yu Liu, Muhammad Zubair, Arslan Manzoor, Guodong Zhang
{"title":"Breast cancer detection from ultrasound images using attention U-nets model","authors":"M. Farooq, Zhaoxuan Gong, Yu Liu, Muhammad Zubair, Arslan Manzoor, Guodong Zhang","doi":"10.1117/12.2643599","DOIUrl":"https://doi.org/10.1117/12.2643599","url":null,"abstract":"Breast cancer is the most common form of invasive cancer in women. In recent years, it has become standard practise to perform breast mass evaluations using ultrasound (US) imaging. US can accurately distinguish between malignant and benign breast masses when used by skilled radiologists, as compared to other medical imaging modalities such as MRI. Human domain knowledge is difficult to incorporate into the diagnosis of breast tumours because it differs greatly from person to person in terms of shape, border, curve, intensity, and other commonly used medical priors. A deep learning model that incorporates visual saliency can now be used to segment breast tumours in ultrasound images. Radiologists use the term \"visual saliency,\" which refers to areas of an image that are more likely to be noticed. Features that prioritise spatial regions with high saliency levels are learned using the proposed method. According to validation results, tumours are more accurately identified in models that include attention layers than those without them. The salient attention model has the potential to improve medical image analysis accuracy and robustness by allowing deep learning architectures to incorporate task-specific knowledge. AUC-ROC plots show that our new model is more accurate in terms of IOU and AUC-ROC scores, dice score, precision, recall, and IOU.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129521701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Maciej Zaborowicz, Katarzyna Zaborowicz, B. Biedziak
{"title":"Metrical age assessment using image analysis and artificial neural networks","authors":"Maciej Zaborowicz, Katarzyna Zaborowicz, B. Biedziak","doi":"10.1117/12.2643001","DOIUrl":"https://doi.org/10.1117/12.2643001","url":null,"abstract":"Computer imaging methods are widely used in medical related problems. Imaging is readily used for diagnostic purposes due to its availability, non-invasiveness, and high quality. Due to the great number of medical conditions, as well as due to the frequent lack of qualified medical staff, there has been a need to automate the evaluation of radiological examinations. Therefore, a quickly growing branch of science is the neural analysis of medical images. This paper presents the possibility of using computer image analysis and neural modeling methods in the assessment of metric age of children and adolescents from digital pantomographic images. The analog methods used in the clinical assessment of the patient’s chronological age are subjective and characterized by low accuracy. The paper presents the possibility of using RBF networks and deep learning in the assessment of the metric age of children aged from 4 to 15 years. As a result, two neural models with quality ranging from 97 to 99% were obtained.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129871537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hongtu Xie, Xiao Hu, Xinqiao Jiang, Zhitao Wu, Jinfeng He, Kai Xie, Guoqian Wang
{"title":"Low frequency ultra-wideband BSAR electromagnetic scattering characteristic","authors":"Hongtu Xie, Xiao Hu, Xinqiao Jiang, Zhitao Wu, Jinfeng He, Kai Xie, Guoqian Wang","doi":"10.1117/12.2644365","DOIUrl":"https://doi.org/10.1117/12.2644365","url":null,"abstract":"Low frequency ultra-wideband bistatic synthetic aperture radar (UWB BSAR) system is able to penetrate the foliage, get the high-resolution BSAR image, and offer the increased target information. In this paper, the low frequency UWB BSAR electromagnetic scattering characteristic is analyzed. First, the target under the foliage are modeled and discussed. Moreover, the method of moment (MoM) is proposed for the electromagnetic scattering characteristic. Finally, the simulation experiment is conducted for the modeling and analyzing of the electromagnetic scattering characteristic of the targets, which verifies the correctness of the low frequency UWB BSAR electromagnetic scattering characteristic.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134243972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"RSFS: A soft biometrics-based relative support features set for person verification","authors":"Bilal Hassan, E. Izquierdo","doi":"10.1117/12.2644457","DOIUrl":"https://doi.org/10.1117/12.2644457","url":null,"abstract":"Generally, biometrics is gaining increased attention due to its application for secure and efficient verification – more specifically at border crossing points. Usually, there are many different types of biometrics associated with human body i.e., intrusive like finger prints etc. and non-intrusive, termed as soft biometrics. In order to make the concept of Smart Borders a reality, the non-intrusive soft biometrics are the baseline technology. One of biggest challenge in soft biometrics based verification is to find a highly related set of features from different modalities of human body – as there is large number such soft biometrics associated with human body. In fact, this is extremely useful to select only those soft biometrics which are supportive to each other and relevant to the problem domain. In our work, we thoroughly investigated one of the largest collection of soft biometrics and developed a multiple non-linear regression based framework for the selection of highly supportive and relevant soft biometrics. We used one of the largest dataset e.g., PETA and its annotation for the evaluation of our proposed model. The accuracy is reported in form of MAE and error distribution graphs for two global soft biometrics i.e., gender and age prediction.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"84 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132708104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Temperature imaging network based on swin transformer for TDLAS tomography","authors":"Jingjing Si, Aiting Wang, Yinbo Cheng","doi":"10.1117/12.2643397","DOIUrl":"https://doi.org/10.1117/12.2643397","url":null,"abstract":"Most of existing data-driven temperature imaging schemes for Tunable Diode Laser Absorption Spectroscopy (TDLAS) tomography are based on Convolutional Neural Network (CNN). However, some studies on CNN show that its actual perceptual field is much smaller than the theoretical one, which makes it not conducive for CNN to capture features from contextual information at long distance. In this work, a temperature imaging network based on Swin Transformer is established. To introduce cross-window connections while maintaining the efficient computation of local non-overlapped windows, Multi-headed Self-Attention (MSA) is computed alternatively in regularly partitioned windows and shifted windows. Simulation results show that the proposed network can reconstruct temperature images of higher quality than schemes based on CNN and Extreme Learning Machine (ELM) respectively.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132889206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Video stabilization based on GMS and warping transform","authors":"Jiwen Liu, Qian Huang, Yiming Wang, Chuanxu Jiang, Mingzhou Shang","doi":"10.1117/12.2644293","DOIUrl":"https://doi.org/10.1117/12.2644293","url":null,"abstract":"Video stabilization is a video enhancement technology that improves the original video quality by eliminating unnecessary camera motion. In the last decade of research, video stabilization has changed from a simple solution aimed at computational simplicity to a complex solution aimed at stabilization effects. We propose a novel method based on Grid-based Motion Statistics(GMS) and warping transformation, stabilizing video with less cropping. Specifically, feature points are firstly matched by GMS, and RANSAC is applied within each frame to estimate the motion vectors accurately. Furthermore, we incorporate predicted adaptive path smoothing to produce stable trajectories and generate stable video with warping transformation. Moreover, to the best of our knowledge, the proposed algorithm has less cropping and better stability than previous work. The experimental results demonstrate the performance of our method on a large variety of consumer videos.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134543182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xian-Lung Tang, Liang Zhao, Yanping Shuai, Zhang Li, Xingjun Wang
{"title":"An one-dimensional signal based object detection network for apnea and hypopnea locating","authors":"Xian-Lung Tang, Liang Zhao, Yanping Shuai, Zhang Li, Xingjun Wang","doi":"10.1117/12.2643701","DOIUrl":"https://doi.org/10.1117/12.2643701","url":null,"abstract":"Sleep-disordered breathing (SDB), a common sleep disorder, shows symptoms of shallow breathing or paused breathing during sleep called respiratory events. SDB was conventionally diagnosed based on overnight multi-channel polysomnography (PSG) in clinical treatment. However, this process requires experienced sleep technicians to annotate and is quite labour-intensive. In this study, a novel one-dimensional signal based object detection network was proposed for automatic, high efficiency detection and classification of different kinds of respiratory events from continuous PSG signals. Our method can locate respiratory events in PSG signal data and classify them into four categories for further clinical treatment. The method was further validated on a PSG clinical dataset collected from Beijing Tongren Hospital. Precision, recall and F1-score of 84.9%, 85.1%, 85.0% were achieved for events detection with total accuracy rate reaching 74.9% in classification of detected events. The result shows that one-dimensional signal object detection is a promising method to locate the characteristic waveform and extract signal features. Such method can be applied in other signal feature detection field.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133536305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An evaluation method of aggregate morphological characteristics based on two-dimensional digital image technique","authors":"Pei Sun, Zhenfeng Han","doi":"10.1117/12.2644604","DOIUrl":"https://doi.org/10.1117/12.2644604","url":null,"abstract":"Aggregate shape, angularity and surface texture are closely related to pavement performance of asphalt mixture. In order to quantitatively analyze the morphological characteristics of aggregates, the aggregate particle image was obtained by \"backlight scanning method\", and then noise removal, segmentation and hole filling are performed on the acquired image based on digital image processing technique. On the basis of above mentioned, a two-dimensional aggregate morphological characteristics evaluation system (AMCES) with low equipment requirements was developed. The shape property of aggregates were characterized by shape index (SI) and form factor (FF), and the angularity property and surface texture of aggregates were evaluated by angularity index (AI) and texture factor (TF) respectively. Finally, the morphological characteristics of 12 different standard shaped objects and limestone with 4 different sizes were analyzed. The test results show that the four evaluation parameters can describe the morphological characteristics of aggregate particles well. With the increase of particle size, the shape index decreases, the value of the form factor get closer and closer to 1, while the angularity index and texture factor both decrease gradually.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129625689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"An effective anti-interference visual tracking method","authors":"Q. Fan, Yue Yang, E. Zou","doi":"10.1117/12.2644230","DOIUrl":"https://doi.org/10.1117/12.2644230","url":null,"abstract":"Tracking specific objects in images or videos is one of the most attractive problems in visual tasks. It is widely employed in security monitoring, automatic driving, military operations and other scenes. Recently, object tracker based on convolution neural network, especially Siamese network, obtains high accuracy and has been deeply studied. However, in practical application scenarios of visual tracking, when meets clutter background or the object is occluded, the accuracy of the tracking task will drop rapidly, and the tracker loses the target in extreme cases. It is particularly necessary to quickly and accurately relocate the target. Therefore, an anti-interference tracker based on Siamese convolution neural network is developed. Benefiting from the adaptive tracking confidence parameter, once the tracking effect of the tracker has dropped significantly during the tracking process, the location of the object will be corrected immediately. Experimental results show that the proposed method has the ability to relocate and track the target after occlusion or loss effectively.","PeriodicalId":314555,"journal":{"name":"International Conference on Digital Image Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133125513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}