{"title":"Study on the Headgear and Seat of the Thangka Image based on the Improved YOLOv4 Algorithm","authors":"Guoyuan He, Wenjin Hu, Huiyuan Tang, Panpan Xue","doi":"10.1109/ISCTT51595.2020.00034","DOIUrl":null,"url":null,"abstract":"Aiming at the problems of complex background and low accuracy in Thangka image detection tasks using deep learning algorithms, an improved object detection algorithm based on YOLOv4 is proposed. Considering that the Mish function used by YOLOv4 is static, which performs the same operation on all input samples, hence it is not enough to deal with complex scenarios. We use the DynamicReLU function to encode the global up and down into a hyperfunction, and dynamically adjust the piecewise linear activation function instead of the Mish function accordingly, which solves the defect that the activation function cannot be dynamically adjusted, and realizes the detection of the headgear and seat of the Thangka image. The experimental results in Thangka image detection show that the evaluation protocol mAP of the algorithm reaches 37.9%, which is an increase of 3.8% compared to the YOLOv4 algorithm. The algorithm improves detection accuracy without increasing the depth and width of the network.","PeriodicalId":178054,"journal":{"name":"2020 5th International Conference on Information Science, Computer Technology and Transportation (ISCTT)","volume":"185 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Information Science, Computer Technology and Transportation (ISCTT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCTT51595.2020.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Aiming at the problems of complex background and low accuracy in Thangka image detection tasks using deep learning algorithms, an improved object detection algorithm based on YOLOv4 is proposed. Considering that the Mish function used by YOLOv4 is static, which performs the same operation on all input samples, hence it is not enough to deal with complex scenarios. We use the DynamicReLU function to encode the global up and down into a hyperfunction, and dynamically adjust the piecewise linear activation function instead of the Mish function accordingly, which solves the defect that the activation function cannot be dynamically adjusted, and realizes the detection of the headgear and seat of the Thangka image. The experimental results in Thangka image detection show that the evaluation protocol mAP of the algorithm reaches 37.9%, which is an increase of 3.8% compared to the YOLOv4 algorithm. The algorithm improves detection accuracy without increasing the depth and width of the network.