{"title":"Deep learning for image segmentation: Ultrasound image segmentation of thyroid nodules based on U_Net","authors":"Xueting Zhou, Yan Chen, Shoushan Liu","doi":"10.1145/3577117.3577144","DOIUrl":"https://doi.org/10.1145/3577117.3577144","url":null,"abstract":"The purpose of this article is to investigate the value of deep learning algorithms in the application of ultrasound images of thyroid nodules. Using a dataset of 7288 ultrasound images of thyroid nodules provided by the MICCAI 2020 Challenge, based on the U_Net framework, incorporating a multiscale input mechanism and improving loss optimization function, through continuous training to find the optimal model, so that the computer can autonomously segment the thyroid nodules. The segmentation accuracy reaches 0.955, and the network has good segmentation performance.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128726357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Water Surface Oil Film Inversion Method Based on Infrared Imager","authors":"Shuai Xu, Jin Xu, Bo Li, Rong Chen","doi":"10.1145/3577117.3577124","DOIUrl":"https://doi.org/10.1145/3577117.3577124","url":null,"abstract":"Oil spills bring great negative impact on marine ecology and social economy. At present, remote sensing technology is an important tool for marine oil spill monitoring. In this paper, a marine oil film inversion method using infrared thermal imager was proposed to support sustainable tracking in the oil spill accident. Firstly, different amounts of oils were added into the experimental containers with the same water. Secondly, visible light images, infrared images and environmental data were collected continuously. Then, the infrared images were segmented for oil film detection by using Otsu, InterModes and ISODATA threshold methods. Finally, the effects of different adaptive threshold methods for oil film detection were compared. The results shown that the oil film inversion effect based on ISODATA threshold performed more superior.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116867190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yujie Fang, Junfan Wang, Yi Chen, Mingyu Gao, Hongtao Zhou, Yaonong Wang
{"title":"An Attention Based Network for Two-dimensional Hand Pose Estimation","authors":"Yujie Fang, Junfan Wang, Yi Chen, Mingyu Gao, Hongtao Zhou, Yaonong Wang","doi":"10.1145/3577117.3577133","DOIUrl":"https://doi.org/10.1145/3577117.3577133","url":null,"abstract":"Accurate visual hand pose estimation at the joint level has been used in vision-based Human-Computer interaction (HCI) applications in a number of areas. However, current 2D hand pose estimation tends to focus on high accuracy prediction or fast speed prediction, which does not allow detectors to achieve both fast and accurate pose estimation. In this paper, we combine RepVGG with a self-attention mechanism proposed an improved network we called ARepNet. ArepNet doubled the speed of the network model by re-parameterized network and capturing long-range dependencies by connecting information from different places, thereby achieving an accuracy rate of 86.8%. We add a 2D hand pose dataset in low-light contexts and propose a simple contrast enhancement method to make 2D hand pose estimation robust to picture input in different environments. We have successfully deployed ARepNet to embedded devices, which FPS with 139 frames per second, meeting real-time requirements.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"40 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114042726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Characterization Test for Multivariate Scale Mixtures of Skew- Normal Distributions","authors":"Yan Su, Yue Wu, Jiangchao Zhu","doi":"10.1145/3577117.3577138","DOIUrl":"https://doi.org/10.1145/3577117.3577138","url":null,"abstract":"Based on the canonical form of multivariate scale mixtures of skew- normal (SMSN) distributions, a characterization test for multivariate SMSN distributions is proposed. The limiting null distribution of the transformed sample vectors is obtained and a simulation study is performed. Based on properties of spherical distributions and bootstrap approximation, an algorithm is given to estimate the critical values of the test statistic for the finite sample size. Moreover, a test statistic is constructed for testing skewness of the multivariate SMSN distributions.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114485810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mingfa Wu, Fengyi Li, Yaoyao Liu, Yao Huang, Yiting Zhou, Xianjun Pan
{"title":"Remote Sensing Information Extraction and Dynamic Change Analysis of Leizhou Peninsula Coastline","authors":"Mingfa Wu, Fengyi Li, Yaoyao Liu, Yao Huang, Yiting Zhou, Xianjun Pan","doi":"10.1145/3577117.3577143","DOIUrl":"https://doi.org/10.1145/3577117.3577143","url":null,"abstract":"With the rapid expansion of the scale of urbanization and the rapid economic and social development of coastal areas, the use changes of the coastline and coastal areas of the Leizhou Peninsula is increasing. Employing envi5.1 and arcgis10.2 tools, the normalized difference water index (NDWI) was used to process the image, and the water and land separation was carried out according to the threshold segmentation method. The data of the coastline in 2001, 2008, 2015 and 2020 were extracted from the interpretation of the remote sensing image maps, and the results of the automatic computer interpretation were examined using the fixed-sample visual interpretation method to analyze the coastline dynamics of the Leizhou Peninsula in the past 20 years. The results showed that: (1) From 2001 to 2020, the total length of the coastline of Leizhou Peninsula showed an increasing trend, a total increase of 95.98km, and annual variation was 4.80km; (2) From 2001 to 2008, the coastline increased the most, with an increase of 44.25km; from 2008 to 2015, the increase of the coastline was smaller, with an increase of 18.69km; from 2015 to 2020, the coastline increased by 33.04km; (3) The areas with large changes in coastline length were Xuwen county, Leizhou city, and Potou district, which increased by 38.29km, 34.21km, and 18.86km respectively; (4) The areas with the most complex coastline changes were mainly concentrated in Potou district, Xuwen county, Leizhou city and Suixi county for economic development, tourism development, marine aquaculture and other areas. In this regard, it was proposed to strengthen the basic dynamic information monitoring of the coastline, carry out the repair and survey of the coastline in a timely manner, grasp the utilization information of the coastline, and implement the task of protecting the coastline resources.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"168 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115292718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bo Hu, Donghao Zhou, Quhua Wu, Jinrong Dong, Sheng Kuang, Jie Huang
{"title":"An Improved Defect Detection Algorithm for Industrial Products via Lightweight Convolutional Neural Network","authors":"Bo Hu, Donghao Zhou, Quhua Wu, Jinrong Dong, Sheng Kuang, Jie Huang","doi":"10.1145/3577117.3577140","DOIUrl":"https://doi.org/10.1145/3577117.3577140","url":null,"abstract":"Aiming at the problem that the existing computer vision detection algorithm based on deep learning consumes a lot of memory and computing resources, this paper improves the structure of convolutional neural network and proposes a lightweight algorithm for defect detection of industrial products by network pruning. The proposed algorithm uses the residual network to divide VGG-16 into different residual modules, introduces the sparse constraint of penalty factor and the attenuation constraint of weight matrix to measure the importance of each residual module, and cuts the residual modules with low importance, so as to greatly reduce the number of parameter learning in the deep residual network. Experiments show that this method can retain the accuracy, precision, recall and F1 score of the original network, and greatly improve the speed of network training to meet the real-time needs of product appearance defect detection.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"93 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126202552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Keqiao Huang, Linyan Ling, Tianlang Tan, Jin Zhan, Zhenmeng Yue, Si Tang, Zhiyong Lin, Guiyuan Xie
{"title":"A Real Time Mask Wearing Detection Based on Lightweight CenterNet in Complex Scenes","authors":"Keqiao Huang, Linyan Ling, Tianlang Tan, Jin Zhan, Zhenmeng Yue, Si Tang, Zhiyong Lin, Guiyuan Xie","doi":"10.1145/3577117.3577130","DOIUrl":"https://doi.org/10.1145/3577117.3577130","url":null,"abstract":"CenterNet is a one-stage target detector based on key points with high detection accuracy. However, its backbone is Hourglass network with a large number of parameters, the recognition speed is slow and cannot be recognized in real time. In this paper, we proposes a lightweight Hourglass network based on CenterNet model for mask wearing detection. Firstly, we adopts the depth wise separable convolution network in the reverse residual block of the Hourglass network. In the upsampling and downsampling block, different stride set and two branches are used to reduce the number of model parameters and improve the detection speed. Secondly, we redefine the focal loss function which can correlate the loss values of two Hourglass networks and complement each other to improve the accuracy of difficult targets in complex environments. Finally, in order to improve the test robustness of the method, we constructed a data set of masks under different challenge scenarios. The experimental results show that the average accuracy of our method is 0.922 and the parameters are reduced to 1/25 of CenterNet, and the detection speed is increased by nearly 3 times. Our method can achieve real-time mask wearing detection in videos with better robustness, which provides practicality for deploying the network model to mobile terminals.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132172528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Hyperspectral Image Residual Denoising Network Based on Mixed-Domain Attention Mechanism","authors":"Huan Yang, Juan Xu, Kunhua Liu, Xinyu Lin","doi":"10.1145/3577117.3577137","DOIUrl":"https://doi.org/10.1145/3577117.3577137","url":null,"abstract":"Hyperspectral images (HSIs) contain not only spatial information, but also detail information on spectrum that reflects the internal features of objects, which can be used to monitor crop growth, for example. It is noteworthy that noises are inevitably introduced in the obtained HSIs due to the imperfection of imaging equipment and data transmission process, which will probably lead to misjudging the species of objects. Currently, HSIs-denoising methods based on deep learning have received considerable amount of attention and achieved promising results. However, these methods did not consider the interdependence among the three domains of HSIs. Based on this, we present a mixed-domain attention-based residual denoising network (for short named MA-RDN), so as to better the noises suppression by taking all the three domains into consideration. Different from existing methods, we introduce a mixed-domain attention module, which consists of three branches, respectively modeling the correlation between two of the domains. In this way, the model is guided to simultaneously focus on all the cross-domain features that are influential in denoising tasks. We take the average value of the three branches as the module's output. Then, a sparse feature extraction subnetwork is designed to preserve spatial-spectral features of HSIs as many as possible, which contains several multiscale structures and channel attentions. In order to avoid the gradient disappearance and model degradation caused by the deepening of the network, we utilize two weighted skip connections in the output. Simulation experiments show that, in different noise conditions, the peak signal-to-noise ratio PSNR of our method is increased of about 1.6 that compared with the Cao et al's GRN [11] method, and the structural similarity SSIM is slightly better than it.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127981607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"On Greedy Kaczmarz Method with Uniform Sampling for Consistent Tensor Linear Systems Based on T-Product: tGK method for consistent tensor linear systems","authors":"Yimou Liao, Tianxiu Lu","doi":"10.1145/3577117.3577127","DOIUrl":"https://doi.org/10.1145/3577117.3577127","url":null,"abstract":"Solving large system of tensor linear equations is a fundamental problem in mathematics. This paper proposes a sampling tensor greedy Kaczmarz method (tGK) to solve large-scale linear systems with a t-product structure by introducing an effective greedy criterion, which eliminates the entry with the largest residual in the submatrix system per iteration. Then a relaxed tensor greedy Kaczmarz method (tRGK (ω)) is obtained by introducing the relaxation parameter ω to tGK, which can effectively change the convergence rate. The linear convergence of the two methods is guaranteed when the tensor linear system is consistent. Several experiments show that the methods designed in this paper converge faster compared with tensor randomized Kaczmarz (tRK). Moreover, selecting appropriate parameters ω can improve the convergence rate of tGK.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121447396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xinyu Cheng, Zhaoyi Li, Yuanyuan Zhu, Yanqing Wang
{"title":"Research on Campus Indoor and Outdoor Unmanned Vehicle Navigation Technology","authors":"Xinyu Cheng, Zhaoyi Li, Yuanyuan Zhu, Yanqing Wang","doi":"10.1145/3577117.3577128","DOIUrl":"https://doi.org/10.1145/3577117.3577128","url":null,"abstract":"In view of the technical difficulties in transporting information between different buildings for various sections on campus, this paper proposes a campus unmanned vehicle system. The system resolves the latitude and longitude of the location of the unmanned vehicle through APIs, and combines perimeter, city-wide, and rectangular range (on- screen) information based on a large amount of dynamic location (POI) data from geographic services to meet the location search needs of different scenarios. The Harris-SIFT algorithm is also applied to explore how the unmanned vehicle can work effectively in the indoor-outdoor environment of the campus. Through experimental verification, the unmanned vehicle navigation technology proposed in this paper can satisfy the unmanned vehicle's freedom to switch between indoor and outdoor environments and can perform delivery tasks accurately and stably.","PeriodicalId":309874,"journal":{"name":"Proceedings of the 6th International Conference on Advances in Image Processing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127764924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}