Jin Wang, Xiaobo Sun, Mingze Wang, Changwu Li, Xiaoman Tang
{"title":"基于yolov4语义信息提取的室内场景目标提取算法研究","authors":"Jin Wang, Xiaobo Sun, Mingze Wang, Changwu Li, Xiaoman Tang","doi":"10.1145/3603781.3603864","DOIUrl":null,"url":null,"abstract":"This paper investigates the semantic information extraction algorithm for indoor scene targets. The YOLOv4 algorithm is preferred, and the Leaky ReLU function is preferred as the new activation function scheme through the typical activation function comparison experiments to address the problems of YOLOv4 activation function preference and poor multi-scale representation of indoor targets; the attention fusion mechanism is introduced to improve the classification accuracy of the network. Experiments on the homemade Indoor-COCO indoor scene dataset show that the detection accuracy reaches 42.09%, which improves the accuracy of semantic information.","PeriodicalId":391180,"journal":{"name":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","volume":"435 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"YOLOv4-based semantic information extraction for indoor scene targets fetching algorithm research\",\"authors\":\"Jin Wang, Xiaobo Sun, Mingze Wang, Changwu Li, Xiaoman Tang\",\"doi\":\"10.1145/3603781.3603864\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper investigates the semantic information extraction algorithm for indoor scene targets. The YOLOv4 algorithm is preferred, and the Leaky ReLU function is preferred as the new activation function scheme through the typical activation function comparison experiments to address the problems of YOLOv4 activation function preference and poor multi-scale representation of indoor targets; the attention fusion mechanism is introduced to improve the classification accuracy of the network. Experiments on the homemade Indoor-COCO indoor scene dataset show that the detection accuracy reaches 42.09%, which improves the accuracy of semantic information.\",\"PeriodicalId\":391180,\"journal\":{\"name\":\"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things\",\"volume\":\"435 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3603781.3603864\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 4th International Conference on Computing, Networks and Internet of Things","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3603781.3603864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
YOLOv4-based semantic information extraction for indoor scene targets fetching algorithm research
This paper investigates the semantic information extraction algorithm for indoor scene targets. The YOLOv4 algorithm is preferred, and the Leaky ReLU function is preferred as the new activation function scheme through the typical activation function comparison experiments to address the problems of YOLOv4 activation function preference and poor multi-scale representation of indoor targets; the attention fusion mechanism is introduced to improve the classification accuracy of the network. Experiments on the homemade Indoor-COCO indoor scene dataset show that the detection accuracy reaches 42.09%, which improves the accuracy of semantic information.