{"title":"Perception of Temporal Quality Fluctuation for HEVC Coded Surveillance Videos","authors":"Zhuang Wang, Yanchao Gong, Kaifang Yang","doi":"10.1145/3532342.3532355","DOIUrl":"https://doi.org/10.1145/3532342.3532355","url":null,"abstract":"The advanced high efficiency video coding (HEVC) standard has been widely used in video surveillance system which plays a critical role in the public security. In order to meet the real-time communication requirements of video surveillance system, the low-delay configuration in HEVC is usually used to encode surveillance videos. However, the low-delay configurations adopt hierarchical prediction structure which inevitably brings in the temporal quality fluctuation (TQF). And large TQF are easily perceived by human eyes which has significant impact on the quality and effective analysis (recognition, comparison, tracking, retrieval, etc.) of reconstructed surveillance videos. In this paper, the perceptual characteristics of TQF related to multiple perception degrees are analyzed for HEVC coded surveillance videos. These findings are foundations for further proposing strategies to reduce or even eliminate the perception of TQF in HEVC coded surveillance videos which helps to significantly improve the analysis efficiency of surveillance videos.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115115972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Hierarchical Vision Transformer with Channel Attention for RGB-D Image Segmentation","authors":"Yali Yang, Yuanping Xu, Chaolong Zhang, Zhijie Xu, Jian Huang","doi":"10.1145/3532342.3532352","DOIUrl":"https://doi.org/10.1145/3532342.3532352","url":null,"abstract":"Although convolutional neural networks (CNNs) have become the mainstream for image processing and achieved great success in the past decade, due to the local characteristics, CNN is difficult to obtain global and long-range semantical information. Moreover, in some scenes, the pure RGB image-based model is difficult to accurately identify the pixel classification and finely segment the edge of objects. This study presents a hierarchical vision Transformer model named Swin-RGB-D to incorporate and exploit the depth information in depth images to supplement and enhance the ambiguous and obscure features in RGB images. In this design, RGB and depth images are used as the two inputs of the two-branch network. The upstream branch applies the Swin Transform which is capable of learning global continuous information from RGB images for segmentation; the other branch performs channel attention on depth image to abstract the feature correlation and dependency between channels and generates a weight matrix. Then matrix multiplication on the feature maps in each stage of the down-sampling process is performed for weighted multi-modal feature extraction. Then this study adds the fused maps to the up-sampled feature maps of the corresponding size, which sufficiently compensates for the distortion of feature in the sampling process. The experiment results on the two benchmark datasets show that the proposed model makes the network more sensitive to edge information.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129781067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Method of Dual-spectrum Feature Fusion for Face Recognition Under Non-ideal Lighting Conditions","authors":"Da Ai, Weixin Fan, Kai Jia, Mingyue Lu, Y. Liu","doi":"10.1145/3532342.3532348","DOIUrl":"https://doi.org/10.1145/3532342.3532348","url":null,"abstract":"Face recognition technology is widely used in the field of public security. To improve the recognition accuracy under non-ideal lighting conditions, a face recognition method with dual-spectrum feature fusion is proposed using the property that the infrared spectrum is insensitive to visible light. The fused face images of visible and near-infrared spectra are obtained with the Non-Subsampled Shearlet Transform (NSST) algorithm, and then been put into the FaceNet as input and trained using transfer learning to renew the FaceNet model parameters for recognizing the fused face images. Compared with existing methods, experimental results show that the accuracy of face recognition is significantly improved under non-ideal lighting conditions which better meets the practical application requirements of public security.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126766225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Pedestrian Re-identification Algorithm Based on 3D Convolution and Non_Local Block","authors":"Xiaojun Bai, Feihu Jiang, Q. Zhao","doi":"10.1145/3532342.3532349","DOIUrl":"https://doi.org/10.1145/3532342.3532349","url":null,"abstract":"In the application of video-based pedestrian re-identification, introduced deep learning method to learn feature representation of pedestrian. In order to improve feature quality, introduced 3D convolution block as backbone network to aggregate temporal and spatial features; for issue of human body occlusion in video frames, introduced Non_Local block to capture long distance dependence between frames, and eventually eliminate the impact of occlusion. Optimal embedding scheme of 3D convolution and Non_Local block in backbone network is designed via experiments, and has proved that rich features of pedestrian can be extracted from video frames by this solution, which helps to improve the accuracy of re-identification.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"124 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125500222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dangerous Behavior Recognition Based on Pose Estimation and Action Analysis","authors":"Xiaojun Bai, Z. Wang, Zhiying Zhang","doi":"10.1145/3532342.3532343","DOIUrl":"https://doi.org/10.1145/3532342.3532343","url":null,"abstract":"In order to identify dangerous behaviors such as fighting in public area, proposed a deep learning model that combines human post estimation and action analysis for behavior recognition. The model takes video frame sequence as input, first make use of HRNet as backbone network to detect human body joints and generate human pose frame sequence, and then, make use of recurrent neural network for action analysis from the pose frame sequence, thus determine whether there is dangerous behavior in this video. In the first part, optimization was made on HRNet so as to reduce its parameter and improve the accuracy of joint location. In the second part, introduced Cubic LSTM as a comprehensive model for action recognition, which analysis the motion of human joints from both temporal sequence and spatial sequence, thus achieve better inference score. Experiments show that the recognition accuracy of the proposed method can reach 92.8%. In the last part, a dangerous behavior recognition system is developed based on this model, which use a monitoring host to analysis the captured video frames from cameras, and identify dangerous behavior from them automatically, thus serve for public security tasks.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125137739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multilayer Blind Equalization Algorithm Of Neural Network For QAM Signal","authors":"Chen-Yang Fan, Shuai Wang","doi":"10.1145/3532342.3532358","DOIUrl":"https://doi.org/10.1145/3532342.3532358","url":null,"abstract":"With the development of communication technology, the communication environment has become more and more complex, the linear channel environment no longer exists, and the neural network blind equalization algorithm has good effect on improving the communication quality. So the neural network blind equilibrium algorithm has also attracted a lot of attention. This paper focuses on the blind equalization algorithm of multilayer neural networks based on QAM(Quadrature Amplitude Modulation, a modulation method that performs amplitude modulation on two orthogonal carriers) signals. The iteration step factor with fixed parameters has slightly limitations on the convergence speed and MSE(Mean Square Error, is the average of the sum of the squares of the deviations of the data from the true value, that is, the average of the sums of the squares of the errors) in communication. In order to solve this problem, this paper proposes to transform the fixed-parameter iteration step factor into a variable iteration step factor that is associated with the MSE. In order to solve this problem, we take to change the iteration step factor of fixed parameters into a varying iteration step factor which is related to the mean square error, think of it as a variable parameter, so that the iteration step factor and the mean square error show a positive correlation, which can improve the convergence speed and the final convergence accuracy in the convergence process. Through algorithm analysis and computer simulation, it is shown that the convergence speed and convergence accuracy of the improved algorithm are improved.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114929528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design and Implementation of Health Status Management and Monitoring System for Airborne Test System","authors":"Dongsheng Hu, Ni Luo, Yanzhi Li","doi":"10.1145/3532342.3532357","DOIUrl":"https://doi.org/10.1145/3532342.3532357","url":null,"abstract":"The airborne test system of flight test is very important for the scientific research test flight of new aircraft. The airborne test system is a complex system composed of many different kinds of test equipment. How to effectively manage and monitor the health status of the airborne test system is an important problem that needs to be solved urgently to ensure the smooth test flight mission. This paper aims at the airborne test equipment which is widely used in domestic flight test system, and introduces the related background, system scheme, hardware design and software design.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116523600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shuo Wang, Wenjie Shen, Yang Li, Yun Lin, Yanping Wang
{"title":"GBSAR Moving Target Detection Capability Evaluation and Refocus based Detection Algorithm","authors":"Shuo Wang, Wenjie Shen, Yang Li, Yun Lin, Yanping Wang","doi":"10.1145/3532342.3532347","DOIUrl":"https://doi.org/10.1145/3532342.3532347","url":null,"abstract":"Ground Based Synthetic Aperture Radar (GBSAR) can retrieve sub-millimeter deformation information of the mine slope by using differential interferometry, which is important to safe production in mining application. However, the continuous operation of vehicles in the mining area can block radar signal to slope. The movement of vehicles may also affect the slope stability. Therefore, using GBSAR to detect moving target signal and remove their influence to deformation retrieval is necessary. To our knowledge, the paper about moving target detection based on GBSAR has not been published. Considering current GBSAR systems are mostly single channel FMCW systems. In this paper, we evaluate the capability of moving target detection based on NCUT Risk Radar GBSAR. The GBSAR moving target signal model is analyzed. Then, detection methods utilize shift and defocusing characteristics are analyze. According to the analysis, a new time-domain phase compensation method with relative speed for azimuth moving target detection is proposed. The analysis and proposed method are verified by simulated data.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132321405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Simultaneous Recognition Algorithm of Human Activity and Phone Position Based on Multi-sensor Data Fusion","authors":"Da Ai, Rui Hao, Chaolin Feng, Yuxuan Li, Y. Liu","doi":"10.1145/3532342.3532345","DOIUrl":"https://doi.org/10.1145/3532342.3532345","url":null,"abstract":"For human activity recognition based on phone sensors, the position of the phone is an important factor of the recognition accuracy. To improve the recognition accuracy of behavioral activities and the position of the phone placed, this paper proposes a classification recognition algorithm based on accelerometer and gyroscope sensors. First, sensor data collected from seven different body positions are used as inputs to a deep stacked bidirectional long and short-term memory neural network; then the activity type and the phone position are used as labels to train the neural network for simultaneous recognition of human activity and phone position; finally, the performance of the proposed method is evaluated by cross-validation. The experimental results show that placing the phone on the waist and thigh achieves the highest recognition accuracy rate. The accuracy of the simultaneous recognition of activity and position is over 90%, which is 18% higher than existing algorithms.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"5 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120841365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Threshold Selection on Circular Histogram Using Renyi Entropy","authors":"Jin Jin, Jiu-lun Fan","doi":"10.1145/3532342.3532351","DOIUrl":"https://doi.org/10.1145/3532342.3532351","url":null,"abstract":"Using the circular histogram of the H component in the HSI color model for threshold selection is a way of color image segmentation. The maximum Shannon entropy thresholding on the circular histogram is an effective segmentation method. Considering that Renyi entropy is one of the generalized forms of Shannon entropy, this paper extends the maximum Shannon entropy threshold selection method on circular histogram to Renyi entropy case, gives recursive algorithm to reduce the time complexity of Renyi entropy threshold selection, and discusses the determination of parameters in Renyi entropy threshold selection. The experimental comparison effect shows that the maximum Renyi entropy threshold selection method outperforms the maximum Shannon entropy threshold selection method.","PeriodicalId":398859,"journal":{"name":"Proceedings of the 4th International Symposium on Signal Processing Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132295370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}