{"title":"Low-Rank Tensor Completion with Total-Variation-Regularized Transformed Tensor Schatten-p Norm for Video Inpainting","authors":"Jiahui Liu, Jialue Tian","doi":"10.1145/3573428.3573699","DOIUrl":"https://doi.org/10.1145/3573428.3573699","url":null,"abstract":"Due to the existence of missing entries in real-world tensor data, low-rank tensor completion (LRTC) problem has received increasing attention. In this paper, we propose a new transformed tensor Schatten- norm to replace the rank norm and develop a transformed multi-tensor-Schatten- norm surrogate theorem to convert the non-convex transformed tensor Schatten- norm with 0<<1 into the sum of multiple convex functions. However, tensor completion constrained by low-rank prior alone cannot protect local smoothness along the spatial and tubal dimensions. To address this drawback, we combine anisotropic total variation (TV) regularization with non-convex transformed tensor Schatten- norm with 0<<1 for LRTC. The combination of global low-rank prior and local TV prior is beneficial to improving the final completion effect. Our experimental results on grey-scale video inpainting demonstrate that our proposed method outperforms other existing state-of-the-art methods.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"451 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122485832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Direct Position Determination using Distributed Unfold Coprime Arrays with Unknown Mutual Coupling: based on Reduced-Dimension Search","authors":"Baobao Li, Xiaofei Zhang, Jianfeng Li, Jinke Cao","doi":"10.1145/3573428.3573706","DOIUrl":"https://doi.org/10.1145/3573428.3573706","url":null,"abstract":"The direct position determination (DPD) approach has higher localization accuracy and better robustness than the classical two-step approach when localizing multiple sources with distributed antenna arrays. This paper focuses on the DPD algorithm using multiple Unfolded Coprime Arrays (UCAs) with unknown mutual coupling. To reduce the adverse effects of the mutual coupling, we first expand the unfolded coprime arrays into the DPD scenario. Subsequently, we introduce the HD-DPD-Capon algorithm, which fuses all inverse covariance matrices of distributed arrays, simultaneously searching for multiple unknown mutual coupling coefficients and source positions. Finally, in advance of the reduced-dimension search, we propose the RMCD-ICF algorithm, which only needs to search the two-dimension position, to reduce the high computational complexity of the HD-DPD-Capon algorithm caused by the high-dimensional search. Simulation results verify the superiority of the proposed algorithm on computation complexity and localization accuracy.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"152 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116500019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhuang Li, Y. Li, Yanping Wang, Yun Lin, Wenjie Shen
{"title":"Characteristics Analysis of RODNet ConfMap for MMW Radar Target Detection","authors":"Zhuang Li, Y. Li, Yanping Wang, Yun Lin, Wenjie Shen","doi":"10.1145/3573428.3573552","DOIUrl":"https://doi.org/10.1145/3573428.3573552","url":null,"abstract":"Unlike the Constant False-Alarm Rate (CFAR) based MMW radar target detection methods, RODNet ( A Real-Time Radar Object Detection Network ) is based on Convolutional Neural Networks ( CNNs ), and directly learns the radar target scattering signatures from the original range-azimuth ( RA ) radio frequency image sequence. Although this is a big advantage to keep more useful information, the generated confidence map (ConfMap) characteristics of predicated proximal pedestrian targets is unknown. It leads to a missed detection problem in the dense pedestrian scene. In this paper, we analyze the characteristics of ConfMap and the limitations of RODNet. The relationship among ConfMap value distribution, occupied grid spatial distribution and target number is analyzed. Through the CRUW dataset, the target detection experiment of dense pedestrian scene is carried out, and is used to support our analysis.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"113 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128105803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multi-Feature Convergence Network for Acoustic Scene Classification","authors":"Menglong Wu, Hongxia Dong, Xichang Cai, Ziling Qiao, Cuizhu Qin, Lin Zhang","doi":"10.1145/3573428.3573633","DOIUrl":"https://doi.org/10.1145/3573428.3573633","url":null,"abstract":"This paper investigates a multi-feature convergence network for acoustic scene classification (ASC). A series of neural network models designed with features of the Log Mel spectrogram, Deltas, and Delta-Deltas superimposed on the channel have achieved good classification results. However, the low-frequency part of the speech spectrogram feature extracted from the audio signal has a mosaic shape due to its low resolution, which leads to the loss of information in the low-frequency part of the Log Mel-Deltas-DeltaDeltas feature and reduces the classification accuracy. To solve this problem, the constant Q-transform (CQT) spectrogram is introduced and this feature is superimposed on the channel with the log Mel-Deltas-DeltaDeltas feature to form a 4-channel feature spectrum as the input to the network model. Moreover, the proposed network model is deepened by increasing the 8 residual blocks from the baseline system to 10 residual blocks and a snapshot integration operation is performed on the various models saved during the training process due to the complementary information. And then, a 3-classifier is added based on the ASC's primarily categorized scenes' 10-classifier and chooses the final scene classification by combining the 3–10 two-stage classification scores. The classification accuracy of our proposed network reached 77.4%, which is 5.1% higher than the baseline system set in this paper and 26% higher than the baseline on the official website of DCASE 2020.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132689981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A denoising method based on CNN through multi-layer separation","authors":"Jianyu Huang, Nanting Cai, Jian Mao, Jialin Zhang, Xiaochun Zhao","doi":"10.1145/3573428.3573490","DOIUrl":"https://doi.org/10.1145/3573428.3573490","url":null,"abstract":"In the process of electromagnetic information leakage detection, the traditional denoising methods have the problem of insufficient ability to identify useful information under low SNR. This problem results in the useful information being easily removed as a noise signal in the denoising process. Aiming at this problem, this paper proposes a denoising method based on convolutional neural network (CNN), which is achieved by decomposing red and black signals layer by layer. This method can not only show a better denoising effect under a low SNR condition, but also locate, identify and enhance the useful information effectively.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133419151","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qian Li, Yuechao Chen, Junguo Feng, Jun Zhang, Xiaoyun Sun, Yuchao Cao
{"title":"Research on Dehazing of Cable Tunnel Image Based on Dark Channel Prior","authors":"Qian Li, Yuechao Chen, Junguo Feng, Jun Zhang, Xiaoyun Sun, Yuchao Cao","doi":"10.1145/3573428.3573702","DOIUrl":"https://doi.org/10.1145/3573428.3573702","url":null,"abstract":"The cable tunnel is an architectural structure to protect the cable. The building is located underground, and the cable is not visible to the naked eye on the ground like overhead lines. In order to protect and maintain the cable, there is a cable monitoring system. Among them, the most important thing for the monitoring system is to capture the images in the cable tunnel, and the generation of fog in the tunnel is unfavorable for the monitoring images. So there is a dehazing method. The traditional dehazing method is roughly divided into a dehazing method based on a physical model and a non-physical model. This paper studies the dehazing of the cable tunnel image based on the dark channel prior algorithm of the physical model.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134332625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Analysis of The Change of Softmax Value in The Training Process of Neural Network","authors":"Yuyang Chen","doi":"10.1145/3573428.3573763","DOIUrl":"https://doi.org/10.1145/3573428.3573763","url":null,"abstract":"Classification is an essential field in deep learning. Generally, the category corresponding to the maximum value of softmax is mainly used as the prediction result and the softmax value as the prediction probability. However, whether softmax can indeed serve as a prediction probability needs further confirmation. This paper first focuses on the classification of paintings through Convolutional Neural Network. To deal with the imbalanced dataset problem, only those with more than 200 paintings are selected. Besides, class weight is also taken into consideration. Next, data augmentation is applied to enlarge the dataset and add more relevant data. For the modeling and training part, transfer learning is employed to avoid training from scratch on a new dataset and reduce the cost of later training. Techniques such as ‘EarlyStopping’ and ‘ReduceLROnPlateau’ are also used to avoid overfitting. The final prediction accuracy can achieve 99 percent on the training and 87 percent on the validation sets. Furthermore, the paper studies the change of softmax distribution during the training process and the relationship between the average maximum value of softmax and the classification performance of classes. The experiments show that the maximum value of softmax will gradually shift to the corresponding correct label during the training process. Still, there is no correlation between the classification performance and the average maximum value of softmax. Therefore, softmax cannot be used as a probability value for classification.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134554770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Deep Cross-modal Hashing Retrieval Based on Semantics Preserving and Vision Transformer","authors":"Jin Hong, Huayong Liu","doi":"10.1145/3573428.3573439","DOIUrl":"https://doi.org/10.1145/3573428.3573439","url":null,"abstract":"In response to the problem of similarity measure differences in different similarity coefficients that occur in cross-modal multi-label retrieval, this article uses an interval parameter to correct this bias. A new supervised hash method is proposed by introducing the transformer structure which performs well in CV and NLP tasks into cross-modal hash retrieval, called the Deep Semantics Preserving Vision Transformer Hashing (DSPVTH). This method uses network structures such as vision transformer to map different modal data into binary hash codes. It also uses the similarity relationship of multiple tags to maintain the semantic association between different modal data. Validation on four commonly used multimodal text datasets, Mirflickr25k, NUS-WIDE, COCO2014 and IAPR TC-12, shows a 2% to 8% improvement in average accuracy compared with the current optimal method, which means our method is robust and effective.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115677577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on Data Augmentation Strategy Methods for Image Caption","authors":"Nan Lin, Shuang Li, Yingkang Han, Mengdi Liu","doi":"10.1145/3573428.3573673","DOIUrl":"https://doi.org/10.1145/3573428.3573673","url":null,"abstract":"Data augmentation can effectively expand the number of samples in a dataset and increase the diversity of samples. Image caption refers to the generation of a description statement corresponding to an image, and its accuracy directly affects the accuracy of the description statement. In this paper, we study and analyze data augmentation and VizWiz dataset, then we find that data augmentation can effectively simulate the image quality problems existing in VizWiz dataset. In order to improve the accuracy of the image caption model on the VizWiz dataset, this paper presents a method based on a data augmentation strategy, which mainly uses four data augmentation operators to simulate camera shake, out-of-focus, flash and low light conditions. The strategy space also contains basic translate, shear and contrast operations for the image. The method achieves a score: BLEU_1 of 62.5, BLEU_4 of 23.1, ROUGE_L of 46.6 and CIDEr of 49.6 on the VizWiz dataset.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114789917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design and Implementation of Course Review System","authors":"Yang Yi, Yu Liu","doi":"10.1145/3573428.3573453","DOIUrl":"https://doi.org/10.1145/3573428.3573453","url":null,"abstract":"Based on the guiding of “Internet Plus Education”, this paper researches the main stream technical framework of website development, designs and implements a course review system. Its implementation is divided into two parts: the main website and the admin management. The main website includes searching information, registering, and reviewing courses, modifying user information, viewing personal favorites, comments and notices; the admin management includes user information, course information, comment information, collection information, review information, and notice information management. The system adopts four-layer (View- Controller-Service-Infrastructure) architecture, mainlyuses Spring Boot and Mybatis-Plus, integrates Elastic Search and Redis. The application of these technologies can reduce construction and maintenance costs, improve the user access experience inconcurrent scenarios and enhance the search accuracy and performance.","PeriodicalId":314698,"journal":{"name":"Proceedings of the 2022 6th International Conference on Electronic Information Technology and Computer Engineering","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114573796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}