Jin Wang, Xiaobo Sun, Mingze Wang, Changwu Li, Xiaoman Tang
{"title":"YOLOv4-based semantic information extraction for indoor scene targets fetching algorithm research","authors":"Jin Wang, Xiaobo Sun, Mingze Wang, Changwu Li, Xiaoman Tang","doi":"10.1145/3603781.3603864","DOIUrl":null,"url":null,"abstract":"This paper investigates the semantic information extraction algorithm for indoor scene targets. The YOLOv4 algorithm is preferred, and the Leaky ReLU function is preferred as the new activation function scheme through the typical activation function comparison experiments to address the problems of YOLOv4 activation function preference and poor multi-scale representation of indoor targets; the attention fusion mechanism is introduced to improve the classification accuracy of the network. Experiments on the homemade Indoor-COCO indoor scene dataset show that the detection accuracy reaches 42.09%, which improves the accuracy of semantic information.","PeriodicalId":391180,"journal":{"name":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","volume":"435 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3603781.3603864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper investigates the semantic information extraction algorithm for indoor scene targets. The YOLOv4 algorithm is preferred, and the Leaky ReLU function is preferred as the new activation function scheme through the typical activation function comparison experiments to address the problems of YOLOv4 activation function preference and poor multi-scale representation of indoor targets; the attention fusion mechanism is introduced to improve the classification accuracy of the network. Experiments on the homemade Indoor-COCO indoor scene dataset show that the detection accuracy reaches 42.09%, which improves the accuracy of semantic information.