计算机辅助设计与图形学学报最新文献_第7页

Improved Efficient Convolutional Neural Networks for Complex Scene Mask-Wearing Detection 用于复杂场景戴口罩检测的改进高效卷积神经网络

计算机辅助设计与图形学学报 Pub Date : 2021-07-01 DOI: 10.3724/sp.j.1089.2021.18635

Junxiao Xue, Junjin Cheng, Qibin Zhang, Yibo Guo, Aiguo Lu, Jian Li, Xi Wan, Jing Xu

{"title":"Improved Efficient Convolutional Neural Networks for Complex Scene Mask-Wearing Detection","authors":"Junxiao Xue, Junjin Cheng, Qibin Zhang, Yibo Guo, Aiguo Lu, Jian Li, Xi Wan, Jing Xu","doi":"10.3724/sp.j.1089.2021.18635","DOIUrl":"https://doi.org/10.3724/sp.j.1089.2021.18635","url":null,"abstract":": To solve the problem about low accuracy of mask wear detection under complex lighting and face lean conditions, a method of mask wear detection under intricate environment using efficient convolutional neural network is proposed, which uses pre-training such as hard negative mining to learn more samples of face feature, utilize multi-task convolutional neural networks (MTCNN) to estimate the possibility of face information, and get accurate face location. With attention mechanism in feature pyramid network, enhanc-ing the weight of key points on human face, employing efficient neural network detection will be wore on mask-wearing detection as a simple binary classification problem. Under the environment of TensorFlow platform, not only data training, data preprocessing, but also the contrast experiment with AIZOO method are completed. A data set containing with 816 images is collected, marked and trained. During the data pre-processing, images are set as fixed size to reduce the amount of computation and promote the detection speed. Then, image enhancement algorithm is used to conduct distortion processing to improve the robust-ness of this model. On this basis, MTCNN is used to detect the face information in pictures, modify and normalize all data, then put them into neural network and the trained model to detection. The experimental results show that under complex conditions such as complex lighting and face tilt, the accuracy can reach 83% and 91% respectively, which means can accurately detect whether wearing a mask.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46256511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Efficient 3D Object Detection of Indoor Scenes Based on RGB-D Video Stream 基于RGB-D视频流的室内场景三维目标高效检测

计算机辅助设计与图形学学报 Pub Date : 2021-07-01 DOI: 10.3724/sp.j.1089.2021.18630

Miao Yongwei, Jiahui Chen, Xinjie Zhang, Ma Wenjuan, S. Sun

引用次数: 1

Information Hiding Scheme Based on Quantum Generative Adversarial Network 基于量子生成对抗网络的信息隐藏方案

计算机辅助设计与图形学学报 Pub Date : 2021-07-01 DOI: 10.3724/sp.j.1089.2021.18617

Jia Luo, Rigui Zhou, Yaochong Li, Guangzhong Liu

引用次数: 2

Automatic Poetry Generation Based on Ancient Chinese Paintings 基于中国古代绘画的诗歌自动生成

计算机辅助设计与图形学学报 Pub Date : 2021-07-01 DOI: 10.3724/sp.j.1089.2021.18633

Jiazhou Chen, Keyu Huang, Yingchaojie Feng, Wei Zhang, Siwei Tan, Wei Chen

{"title":"Automatic Poetry Generation Based on Ancient Chinese Paintings","authors":"Jiazhou Chen, Keyu Huang, Yingchaojie Feng, Wei Zhang, Siwei Tan, Wei Chen","doi":"10.3724/sp.j.1089.2021.18633","DOIUrl":"https://doi.org/10.3724/sp.j.1089.2021.18633","url":null,"abstract":": The Chinese painting poem is a very special art form in the history of world art. It combines ancient Chinese literature and fine arts, complements each other and blends together. In order to obtain com-puter-based painting poetry, an automatic poetry generation is proposed based on ancient Chinese paintings. It extracts multiple sentences from ancient paintings, which improves the literary expression ability of ancient poems in paintings. Firstly, a multi-sentence annotation data set for ancient paintings is established, and then semantic features of ancient paintings are extracted through an improved image captioning method. Finally, these modern text descriptions are converted into a four-character poem through a two-way LSTM encoding and decoding framework. The experiment on the paintings of the Song Dynasty demonstrates that the coherent and prosodic poems generated by our method are consistent with the original content and con-text of the ancient paintings. User study shows that the content consistency and user satisfaction of our method are better than keyword-based methods, which proves the validity of the proposed method","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"69686226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

L0 Optimization Using Laplacian Operator for Image Smoothing 基于拉普拉斯算子的图像平滑L0优化

计算机辅助设计与图形学学报 Pub Date : 2021-07-01 DOI: 10.3724/sp.j.1089.2021.18627

Menghang Li, Shanshan Gao, Huijian Han, Caiming Zhang

{"title":"L0 Optimization Using Laplacian Operator for Image Smoothing","authors":"Menghang Li, Shanshan Gao, Huijian Han, Caiming Zhang","doi":"10.3724/sp.j.1089.2021.18627","DOIUrl":"https://doi.org/10.3724/sp.j.1089.2021.18627","url":null,"abstract":": Image smoothing often leads to the loss of image details and distortion because of over smoothing. An image smoothing method is presented which combines 0 L optimization and the second-order Laplacian operator. Laplacian operator is used to constrain the color change of the image, and 0 L optimization is used to minimize the change of the color gradient, so as to achieve the purpose of smooth color transition of the image. In order to keep the edge features of the image better in the process of smoothing, Sobel operator is introduced as the regular term of energy function, and the alternating solution strategy is adopted to solve the energy function. In the ex-periment, using the classical image in the field of image smoothing and the image obtained through network en-gine, the proposed method is compared qualitatively and quantitatively with 6 smoothing methods and 7 denois-第 ing methods. The experimental results show that the proposed method can reduce the loss of image details while smoothing the image, effectively deal with the phenomenon of stepped edges and color block distribution in the image smoothing, and effectively remove various noises in the image. And the peak signal-to-noise ratio and run-ning time of the proposed method are improved compared with other methods.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42793534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A Real-Time Semantic Segmentation Approach for Autonomous Driving Scenes 一种自动驾驶场景的实时语义分割方法

计算机辅助设计与图形学学报 Pub Date : 2021-07-01 DOI: 10.3724/sp.j.1089.2021.18631

Feiwei Qin, Xiyue Shen, Yong Peng, Yanli Shao, Wenqiang Yuan, Zhongping Ji, Jing Bai

{"title":"A Real-Time Semantic Segmentation Approach for Autonomous Driving Scenes","authors":"Feiwei Qin, Xiyue Shen, Yong Peng, Yanli Shao, Wenqiang Yuan, Zhongping Ji, Jing Bai","doi":"10.3724/sp.j.1089.2021.18631","DOIUrl":"https://doi.org/10.3724/sp.j.1089.2021.18631","url":null,"abstract":"An important part of autonomous driving is the perception of the driving environment of the car, which has created a strong demand for high precision semantic segmentation algorithms that can be run in real time on low-power mobile devices. However, when analyzing the factors that affect the accuracy and speed of the semantic segmentation network, it can be found that in the structure of the previous semantic segmentation algorithm, spatial information and context features are difficult to take into account at the same time, and using two networks to obtain spatial information and context information separately will increase the amount of calculation and storage. Therefore, a new structure is proposed that divides the spatial path and context path from the network based on the residual structure, and a two-path real-time semantic segmentation network is designed based on this structure. The network contains a feature fusion module and an attention refinement module, which are used to realize the function of fusing the multi-scale features of two 第 7 期秦飞巍, 等: 无人驾驶中的场景实时语义分割方法 1027 paths and optimizing the output results of context path. The network is based on the PyTorch framework and uses NVIDIA 1080Ti graphics cards for experiments. On the road scene data set Cityscapes, mIoU reached 78.8%, and the running speed reached 27.5 fps.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49617302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Dongba Painting Few-Shot Classification Based on Graph Neural Network 基于图神经网络的东巴绘画少镜头分类

计算机辅助设计与图形学学报 Pub Date : 2021-07-01 DOI: 10.3724/sp.j.1089.2021.18618

Ke Li, Wenhua Qian, Chengxue Wang, Dan Xu

引用次数: 0

Semi-Real-Time Bearing Fault Diagnosis Method Combined Image Method 半实时轴承故障诊断方法——组合图像法

计算机辅助设计与图形学学报 Pub Date : 2021-06-01 DOI: 10.3724/sp.j.1089.2021.18579

Pengzhi Wang, Mandun Zhang, Yahong Han, Xu Zhao, Zhengjun Wang

引用次数: 0

Spatial Positioning Method of Vehicle in Cross-Camera Traffic Scene 跨摄像头交通场景中车辆空间定位方法

计算机辅助设计与图形学学报 Pub Date : 2021-06-01 DOI: 10.3724/sp.j.1089.2021.18612

Wen Wang, Xinyao Tang, Chaoyang Zhang, Huansheng Song, Hua Cui

引用次数: 2

Pix2Pix-Based Grayscale Image Coloring Method 基于Pix2Pix的灰度图像着色方法

计算机辅助设计与图形学学报 Pub Date : 2021-06-01 DOI: 10.3724/sp.j.1089.2021.18596

Hong Li, Qiaoxue Zheng, Jing Zhang, Zhuo-Ming Du, Zhanli Li, Baosheng Kang

{"title":"Pix2Pix-Based Grayscale Image Coloring Method","authors":"Hong Li, Qiaoxue Zheng, Jing Zhang, Zhuo-Ming Du, Zhanli Li, Baosheng Kang","doi":"10.3724/sp.j.1089.2021.18596","DOIUrl":"https://doi.org/10.3724/sp.j.1089.2021.18596","url":null,"abstract":": In this study, a grayscale image coloring method combining the Pix2Pix model is proposed to solve the problem of unclear object boundaries and low image coloring quality in colorization neural net-works. First, an improved U-Net structure, using eight down-sampling and up-sampling layers, is adopted to extract features and predict the image color, which improves the network model’s ability to extract deep image features. Second, the coloring image quality is tested under different loss functions, 1 L loss and smooth 1 L loss, to measure the distance between the generated image and ground truth. Finally, gradient penalty is added to improve the network stability of the training process. The gradient of each input data is penalized by constructing a new data distribution between the generated and real image distribution to limit the dis-criminator gradient. In the same experimental environment, the Pix2Pix model and summer2winter data are utilized for comparative analysis. The experiments demonstrate that the improved U-Net using the smooth 1 L loss as generator loss generates better colored images, whereas the 1 L loss better maintains the structural information of the image. Furthermore, the gradient penalty accelerates the model convergence speed, and improves the model stability and image quality. The proposed image coloring method learns deep image features and reduces the image blurs. The model raises the image quality while effectively maintaining the image structure similarity.","PeriodicalId":52442,"journal":{"name":"计算机辅助设计与图形学学报","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47201506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6