2022 7th International Conference on Multimedia and Image Processing最新文献

筛选
英文 中文
Structure design of the shutter with slider-crank mechanism 曲柄滑块机构快门的结构设计
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517119
F. Jiaqi
{"title":"Structure design of the shutter with slider-crank mechanism","authors":"F. Jiaqi","doi":"10.1145/3517077.3517119","DOIUrl":"https://doi.org/10.1145/3517077.3517119","url":null,"abstract":"In order to realize the miniaturization and lightweight of the infrared nonuniform correction shutter, the crank slider mechanism is used to design. Firstly, the shutter blade is used as the slider and the driving mechanism is used as the crank, and the motion analysis is carried out. The transmission angle of the crank slider mechanism is calculated to be no less than 66.42 °. Then, the slider, connecting rod and crank are analyzed respectively, the force system and motion equation are established, and the driving torque of the shutter is calculated. The design results show that the volume, weight and driving torque of the same target product are reduced by 1 / 3, 1 / 2 and 1 / 2, respectively, compared with the direct motion rotary shutter commonly used in nonuniform correction of infrared camera. The design goal is achieved.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115343708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Efficient Mixed Bit-width Searching Strategy for CNN Quantization based on BN Scale Factors 基于BN尺度因子的CNN量化混合位宽搜索策略
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517108
Xuecong Han, Xulin Zhou, Zhongjian Ma
{"title":"An Efficient Mixed Bit-width Searching Strategy for CNN Quantization based on BN Scale Factors","authors":"Xuecong Han, Xulin Zhou, Zhongjian Ma","doi":"10.1145/3517077.3517108","DOIUrl":"https://doi.org/10.1145/3517077.3517108","url":null,"abstract":"In recent years, the rapid development of mixed-precision quantification technology has greatly reduced the scale of the model and the amount of calculation. However, the previous mixed bit-width strategies are too complicated, such as reinforcement learning strategies and Hessian matrix strategies. This paper proposes an efficient mixed bit-width searching strategy, which measures the sensitivity of the convolutional layer by the scale factors of the BN layer. The advantage of this strategy is that the parameters of the pre-trained model are used and no extra computation is introduced, which greatly simplifies the complexity of the bit-width selection strategy. In this paper, Resnet18 and Resnet50 models are used to conduct comparative experiments, and the differences between the proposed strategy and several previous algorithms are compared in terms of accuracy, model size and computation amount. It is verified that the accuracy of quantization in this paper is reduced within 2% compared with FP32 baseline, and the accuracy is reduced with about 0.5% compared with HAWQ. Overall, the performance is similar to that of HAWQ. This paper also compares the calculation complexity of the quantized bit-width of HAWQ-V3 with the calculation complexity of the quantized bit-width of this paper, which proves that the computational complexity of the strategy in this paper is far less than that of HAWQ-V3.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"181 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115991627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Successive statistical iterative reconstruction, 3D-filtering and region growing methods for high-quality 3D visualization of cone-beam CT image 连续统计迭代重建、三维滤波和区域生长方法实现高质量的锥束CT图像三维可视化
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517081
Jian Dong, Siyuan Zhang, Xiaoxia Yang, Jingyu Zhang
{"title":"Successive statistical iterative reconstruction, 3D-filtering and region growing methods for high-quality 3D visualization of cone-beam CT image","authors":"Jian Dong, Siyuan Zhang, Xiaoxia Yang, Jingyu Zhang","doi":"10.1145/3517077.3517081","DOIUrl":"https://doi.org/10.1145/3517077.3517081","url":null,"abstract":"Our study is to improve the accuracy of bone morphology depicted in dental cone-beam computed tomography (CBCT) images. First, successive statistical iterative reconstruction was used to reduce unavoidable streak artefacts. Then 3-dimentional filtering (Gaussian-Laplacian filter) and region growing methods were examined to present clear bone morphology. CBCT examinations were performed with the following principal exposure parameters: I-mode, FOV 10cm in diameter, 120 kV, 15 mA, 0.2 mm slice thickness, and exposure time of 10s. Ordered subset-expectation maximization (OS-EM) algorithm was applied for unavoidable streak artefact reduction. 3D Laplacian sharpening to images preprocessed by Gaussian smoothing was sequentially tested. Region growing method with dilation and erosion was used to segment maxillofacial tissue. Streak artefact induced by metallic prosthetic appliances was reduced by applying successive iterative OS-EM algorithm. Multi-planar reconstruction (MPR) images at left side molar plane and mid-sagittal plane were presented to validate effect of OS-EM algorithm and 3D Gaussian-Laplacian filter. Maxillofacial tissue was segmented and presented to show the effect of region growing method. Streak artefact reducing method, 3D filtering method for image smoothing and sharpening, and region growing method with dilation and erosion were effective to improve accuracy of bone morphology in dental CBCT images.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"494 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117023357","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-scale Deep Curve Estimation for Low-light Image Enhancement 低光图像增强的多尺度深度曲线估计
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517087
Xin Zhang, Xia Wang, Gangcheng Jiao, Ye Yang, Hongchang Cheng, Bo Yan
{"title":"Multi-scale Deep Curve Estimation for Low-light Image Enhancement","authors":"Xin Zhang, Xia Wang, Gangcheng Jiao, Ye Yang, Hongchang Cheng, Bo Yan","doi":"10.1145/3517077.3517087","DOIUrl":"https://doi.org/10.1145/3517077.3517087","url":null,"abstract":"Due to the limitation of the device, pictures taken in low-light environment usually consist of unpleasant deterioration, such as low contrast and color distortion. In this paper, we propose a Multi-scale Deep Curve Estimation network (MSDCE) for low-light image enhancement, which formulates the single low-light image enhancement task as a pixel-wise curve estimation by paired learning. To impose more priors of low-light regions, we propose an inverse illuminance map as part of the Curve Estimation network input. The curve estimation network backbone is composed of multi-scale modules which learns information from multi-scale feature streams and ensures the information exchange across different scales. Compared with several state-of-the-art methods, our method is significantly better. From the perspective of visual evaluation, our MSDCE can effectively improve the contrast and illumination of the image, and ensure the color fidelity of the image. CCS CONCEPTS • Computing methodologies • Artificial intelligence • Computer vision • Computer vision problems • Reconstruction","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123582445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Submission Research on an Integrated Service System of Self-service Intelligent itinerary-checking , Registration and Charging (I) 自助智能行程查、登记、收费综合服务系统提交研究(一)
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517116
Wanpeng Tang
{"title":"Submission Research on an Integrated Service System of Self-service Intelligent itinerary-checking , Registration and Charging (I)","authors":"Wanpeng Tang","doi":"10.1145/3517077.3517116","DOIUrl":"https://doi.org/10.1145/3517077.3517116","url":null,"abstract":"This study provides a self-service intelligent itinerary-checking , Registration and Charging service system, including communication terminal software subsystem and monitoring subsystem. The terminal software subsystem is used to respond to the interface operation event request, transmit the terminal status information and service data information to the monitoring subsystem in real time, and receive the remote maintenance instructions of the monitoring subsystem. The monitoring subsystem is used to receive and save the terminal device status information and service data information sent by the terminal software subsystem, perform fault diagnosis, automatic alarm and remote maintenance according to the terminal status information, and perform statistical analysis and query output according to the service data information. This system can meet the current registration, payment, triage, medical guidance, inquiry and printing and other self-help needs, with comprehensive supervision and maintenance of the terminal equipment monitoring subsystem, optimize the business process, improve the quality of service and the utilization of terminal equipment, better solve the problem of difficult to see a doctor.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131025611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
On-line intelligent visual identification algorithm of power equipment state under the complex environment based on SIFT 基于SIFT的复杂环境下电力设备状态在线智能视觉识别算法
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517086
Yun-Fo Liu, Qiang Lyu, Yanjie Zhang, Chao Yang, Qifan Yang, Feng Zhou
{"title":"On-line intelligent visual identification algorithm of power equipment state under the complex environment based on SIFT","authors":"Yun-Fo Liu, Qiang Lyu, Yanjie Zhang, Chao Yang, Qifan Yang, Feng Zhou","doi":"10.1145/3517077.3517086","DOIUrl":"https://doi.org/10.1145/3517077.3517086","url":null,"abstract":"At present, the application of computer vision technology in power systems is increasing. The idea of using image processing and machine vision to monitor power equipment is not new. However, the research mainly focuses on the application of computer vision technology in the fields of transmission line environment and insulator detection. Combined the actual conditions of the intelligent grid substation and the need of construction, this paper proposed and studied a kind of identification algorithm based on intelligent computer vision technology, aiming at solving the problem of automatic identification of typical outdoor circuit breakers, disconnectors and indoor switchgear. First, using scale-invariant feature transform (scale invariant feature transform, SIFT) algorithm, the paper accurately positions the area to be detected; second, extracts isolating switch line information and switchgear circle information using randomized Hough transform, and through the k-NN (k-Nearest Neighbour) extracts and ferreting breaker character information; Finally, three kinds of electric power equipment are identified intelligently by threshold setting, and the identification effect and stability of the algorithm are validated in the disconnector and Qing He substation of a 500 kv substation in China.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132385062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speckle suppression and texture preservation in optical coherence tomography images using variational image decomposition 基于变分图像分解的光学相干断层扫描图像的斑点抑制和纹理保存
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517078
Biyuan Li, Xin Zhao, Jun Zhang
{"title":"Speckle suppression and texture preservation in optical coherence tomography images using variational image decomposition","authors":"Biyuan Li, Xin Zhao, Jun Zhang","doi":"10.1145/3517077.3517078","DOIUrl":"https://doi.org/10.1145/3517077.3517078","url":null,"abstract":"Filtering off speckle noise while preserving fine details for optical coherence tomography(OCT) is particularly challenging. In this paper, an efficient method based on variational image decomposition(VID) is proposed to suppress speckle from OCT retinal images. A new BL-G-BM3D model based on VID is proposed to decompose one OCT retinal image into the background part, the texture part and noise. Each part is described by suitable function space separately and processed individually. The proposed model is able to preserve structural information while sufficiently suppressing speckle noise. We test the proposed method on two raw OCT retinal images with low contrast and high noise level, and compare with four other related and widely used filtering methods in terms of both quantitative evaluation and visual quality.The experimental results have demonstrated the validity of the proposed method.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"118 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115256866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A PCA-aided EV-EGI Method for Registering Volumetric Datasets 一种pca辅助EV-EGI方法配准体积数据集
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517095
Chun Dong, Timothy S Newman
{"title":"A PCA-aided EV-EGI Method for Registering Volumetric Datasets","authors":"Chun Dong, Timothy S Newman","doi":"10.1145/3517077.3517095","DOIUrl":"https://doi.org/10.1145/3517077.3517095","url":null,"abstract":"A method for volumetric dataset registration that utilizes principal component analysis (PCA) and volumetric extended Gaussian image (EGI)-based processing is presented. The method uses PCA to determine an initial coarse estimate of orientation difference between two volumetric datasets. The PCA is based on certain automatically selected (i.e., significant) voxels. The coarse estimate then is refined by a three-stage process that utilizes enhanced volumetric extended Gaussian images (EV-EGIs). These final EV-EGI stages also provide the translational component. The method's combination of steps allows for faster processing at roughly similar accuracy versus prior work based solely on EV-EGIs. Experimental comparisons with Globally optimal Iterative Closest Pointset (Go-ICP) registration are also reported and analyzed.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121808563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Autonomous Foot Movement Recognition Based on SVM 基于支持向量机的自主足部运动识别研究
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517090
Tongning Meng, Li Zhao, Zhiwen Zhang, Xinglin He
{"title":"Research on Autonomous Foot Movement Recognition Based on SVM","authors":"Tongning Meng, Li Zhao, Zhiwen Zhang, Xinglin He","doi":"10.1145/3517077.3517090","DOIUrl":"https://doi.org/10.1145/3517077.3517090","url":null,"abstract":"In order to improve the effectiveness of rehabilitation of stroke patients, active training can be used to treat and recover the patient's foot dyskinesia. Recognizing the different movement characteristics of the feet is an important part of the active rehabilitation of stroke patients. In this paper, the EMG signals of different movements of the right foot are classified and studied. The EMG signals of three different movement states of the foot resting state, foot stretched 15° and foot stretched 45° are collected, absolute mean and filter common space mode were used for feature extraction of EMG signal, and support vector machine (SVM) was used for classification and recognition after extraction. The experimental results show that the classification accuracy rate of resting state-foot-stretched 45° is 89.9%, which exceeds the classification accuracy rate of resting state-foot-stretched 15° of 86.8%. It shows that when the subjects stretch the foot at 45°, more motion units are activated and the characteristics are more obvious than when the feet are stretched at 15°. Therefore, by classifying the characteristics of EMG signals and identifying different autonomic movements of feet, it can be used as the basis for rehabilitation treatment of stroke patients. At the same time, the average classification accuracy of 15° -45 ° and the resting state -15 ° -45 ° is above 80%, which confirms the feasibility of the signal processing method and support vector machine classification algorithm used in this paper for the study of automatic foot motion recognition.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130766263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design of Mobile Robot Path Planning Algorithm Based on Improved Whale Optimization Algorithm 基于改进鲸鱼优化算法的移动机器人路径规划算法设计
2022 7th International Conference on Multimedia and Image Processing Pub Date : 2022-01-14 DOI: 10.1145/3517077.3517115
Jia Liu, Zhikang Chen, Qiang Liu, Rui Shen, Linlin Hou, Yunxi Zhang
{"title":"Design of Mobile Robot Path Planning Algorithm Based on Improved Whale Optimization Algorithm","authors":"Jia Liu, Zhikang Chen, Qiang Liu, Rui Shen, Linlin Hou, Yunxi Zhang","doi":"10.1145/3517077.3517115","DOIUrl":"https://doi.org/10.1145/3517077.3517115","url":null,"abstract":"In this paper, an improved whale optimization algorithm (IWOA) is proposed to solve the problem of independent path planning for mobile robots, which makes mobile robots move along the optimal path. By combining inverse initial coding optimization with Levy flight, The classic whale optimization algorithm (WOA) was improved, the optimization ability of WOA and the solving ability of optimal path point are improved, then the local optimal solutions are maximally avoided. Finally, by comparing the IWOA with the path planning effect diagram of the classical WOA through simulation, the feasibility and efficiency of the IWOA in path planning are verified.","PeriodicalId":233686,"journal":{"name":"2022 7th International Conference on Multimedia and Image Processing","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134057307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信