Proceedings of the 2022 5th International Conference on Machine Vision and Applications最新文献

筛选
英文 中文
Path Planning for Autonomous Cars 自动驾驶汽车的路径规划
Zhou Wu
{"title":"Path Planning for Autonomous Cars","authors":"Zhou Wu","doi":"10.1145/3523111.3523124","DOIUrl":"https://doi.org/10.1145/3523111.3523124","url":null,"abstract":"Abstract— Path planning plays a vital role in autonomous driving. It is the replication of the reasoning and decision-making of a human brain. This paper is about analyzing and optimizing a GitHub project which is related to path planning for autonomous cars. This work has fixed the program to drive a car on the simulated highway while avoiding collision, following traffic, safely changing lanes, and minimizing jerk. Additionally, more critical scenarios have been identified to make driving experiences safer and the present cost functions are optimized to make the car react effectively. Throughout the multiple experiments, a more efficient program has been produced that has reduced the time to finish one lap by roughly 10 seconds. Path planning is one of the extremely fundamental processes within autonomous driving. There are still challenges to make path planning safer and robust, such as behavior modelling on other cars, and more efficient path searching algorithm, etc.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131470612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mean-variance Based Risk-sensitive Reinforcement Learning with Interpretable Attention 基于均值方差的可解释注意风险敏感强化学习
Woo Kyung Kim, Youngseok Lee, Hong-Suh Woo
{"title":"Mean-variance Based Risk-sensitive Reinforcement Learning with Interpretable Attention","authors":"Woo Kyung Kim, Youngseok Lee, Hong-Suh Woo","doi":"10.1145/3523111.3523127","DOIUrl":"https://doi.org/10.1145/3523111.3523127","url":null,"abstract":"Risk-sensitive reinforcement learning (RL) has been studied to address the risk and uncertainty in autonomous systems. While a comprehensive understanding for the behaviors of RL agents plays an important role, interpretability was rarely discussed in the context of risk-sensitivity RL. In this paper, we present an interpretable visualization scheme with attention mechanism in which a saliency map represents the relative influence degree of an input state on the decision-making of mean-variance based risk-sensitive RL. Through 2D navigation experiments, we demonstrate that our scheme provides the interpretability with regard to risk-sensitive levels.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123145629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Optimizing Train-Test Data for Person Re-Identification in Real-World Applications 优化列车测试数据在现实世界中的再识别应用
Herman G. J. Groot, Tunç Alkanat, E. Bondarev, P. D. With
{"title":"Optimizing Train-Test Data for Person Re-Identification in Real-World Applications","authors":"Herman G. J. Groot, Tunç Alkanat, E. Bondarev, P. D. With","doi":"10.1145/3523111.3523121","DOIUrl":"https://doi.org/10.1145/3523111.3523121","url":null,"abstract":"Person re-identification (re-ID) aims to recognize an identity in non-overlapping camera views. Recently, re-ID received increased attention due to the growth of deep learning and its prominent applications in the field of automated video surveillance. The performance of deep learning-based methods relies heavily on the quality of training datasets and protocols. Particularly, parameters associated to the train and test set construction affect the overall performance. However, public re-ID datasets usually come with a fixed set of parameters, which are partly suitable for optimizing re-ID applications. In this paper, we study dataset construction parameters to improve re-ID performance. To this end, we first experiment on the temporal subsampling rate of the sequence of bounding boxes. Second, an experiment is performed on the effects of bounding-box enlargement under various temporal sampling rates. Thirdly, we analyze how the optimal choice of such dataset design parameters change with the dataset size. The experiments reveal that a performance increase of 2.1% Rank-1 is possible over a state-of-the-art re-ID model when optimizing the dataset construction parameters, thereby increasing the state-of-the-art performance from 91.9% to 94.0% Rank-1 on the popular DukeMTMC-reID dataset. The obtained results are not specific for the applied model and likely generalize to others.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124612572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Semantic Segmentation Approach for Road Defect Detection and Quantification 一种基于语义分割的道路缺陷检测与量化方法
Deepak Nagaraj, Marcel Mutz, Nisha George, Prateek Bansal, Dirk Werth
{"title":"A Semantic Segmentation Approach for Road Defect Detection and Quantification","authors":"Deepak Nagaraj, Marcel Mutz, Nisha George, Prateek Bansal, Dirk Werth","doi":"10.1145/3523111.3523113","DOIUrl":"https://doi.org/10.1145/3523111.3523113","url":null,"abstract":"Automated visual detection and quantification of road defects has been a hot research topic for quite a long time due to its practical importance for road maintenance and traffic safety. However, uncertainties associated with the 2D images, such as non-uniformity of defects, insufficient background illumination, and etc., make it a challenging problem. This research work aims to solve the problem by employing a deep learning based approach. Specifically, image segmentation has been carried out, using a convolutional encoder-decoder model, to segment various defects from the non-defect area of the road. The method lead to a reasonable segmentation of different defects. Consequently, the extracted defect areas, in terms of number of pixels, are used to derive road condition indices being followed in Germany. In comparison, the indices derived using deep learning based approach are found to more accurate than those derived using conventional approach.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128036038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Research on Human Motion Recognition Based on Computer Vision Technology 基于计算机视觉技术的人体运动识别研究
Jie Sun
{"title":"Research on Human Motion Recognition Based on Computer Vision Technology","authors":"Jie Sun","doi":"10.1145/3523111.3523123","DOIUrl":"https://doi.org/10.1145/3523111.3523123","url":null,"abstract":"In order to improve the accuracy of human motion recognition, computer vision technology is selected as the research tool, and a representation method of human bone and joint is proposed. This method takes the adaptive human motion bone center architecture as the action recognition tool, constructs the human motion action recognition model by calculating the angular acceleration and angular velocity of the neck and hip joints, and uses the model to calculate the action data and generate the recognition results. The experimental results show that the motion recognition accuracy of this algorithm is high, and it can be used as a motion recognition tool.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115999791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Modeling and Prediction of Invasive Systolic Blood Pressure after General Anesthesia Based on Fusion Algorithm 基于融合算法的全身麻醉后有创收缩压建模与预测
Ziyi Chen, Lei Zhang, Qianling Wang
{"title":"Modeling and Prediction of Invasive Systolic Blood Pressure after General Anesthesia Based on Fusion Algorithm","authors":"Ziyi Chen, Lei Zhang, Qianling Wang","doi":"10.1145/3523111.3523129","DOIUrl":"https://doi.org/10.1145/3523111.3523129","url":null,"abstract":"During surgery, invasive systolic blood pressure is an important basis for doctors to judge the patient's life state, which will directly affect the security of the surgery. Accurately predict the changes of invasive systolic blood pressure during general anesthesia help to reduce the risk of surgery. In order to cope with the increasing surgical risk by fluctuations of invasive systolic blood pressure, this paper optimized and combined the traditional machine learning algorithm, and put forward a new fusion algorithm to predict the invasive systolic blood pressure after general anesthesia. In the modeling process, the patients’ basic physical conditions, disease status, and intraoperative data collected by monitoring instrument during the surgical preparation stage were used as characteristic variable. In this paper, Linear Regression, Support Vector Machine Regression, Lasso Regression, and Ridge Regression were used to establish the new fusion algorithm. When the absolute error within 15mmHg, the fusion algorithm's predicting accuracy of invasive systolic blood pressure after general anesthesia reached 91.5%. The accurate prediction of invasive systolic blood pressure after general anesthesia in the preparation stage provides sufficient time for doctors to respond and reduces the risk of surgery to a certain extent.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128475830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Iterative Pruning-based Model Compression for Pose Estimation on Resource-constrained Devices 基于迭代剪枝的资源受限设备姿态估计模型压缩
Sung Hyun Choi, Wonje Choi, Youngseok Lee, Honguk Woo
{"title":"Iterative Pruning-based Model Compression for Pose Estimation on Resource-constrained Devices","authors":"Sung Hyun Choi, Wonje Choi, Youngseok Lee, Honguk Woo","doi":"10.1145/3523111.3523128","DOIUrl":"https://doi.org/10.1145/3523111.3523128","url":null,"abstract":"In this work, we propose a pruning-based model compression scheme, aiming at achieving an efficient model that has strength in both accuracy and inference time on an embedded device environment with limited resources. The proposed scheme consists of (1) pruning profiling and (2) iterative pruning via knowledge distillation. With the scheme, we develop a resource-efficient 2D pose estimation model using HRNet and evaluate the model on NVIDA JetsonNano with the Microsoft COCO keypoint dataset. Specifically, our compressed model obtains the fast pose estimation of 20.3 FPS on NVIDA JetsonNano, while maintaining a high accuracy of 74.1 AP. Compared to the conventional HRNet model without compression, the proposed compression technique achieves 33 % improvement in FPS with only 0.4 % degradation in AP.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116400320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Proceedings of the 2022 5th International Conference on Machine Vision and Applications 2022年第五届机器视觉与应用国际会议论文集
{"title":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","authors":"","doi":"10.1145/3523111","DOIUrl":"https://doi.org/10.1145/3523111","url":null,"abstract":"","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129396307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信