{"title":"基于多信息融合的楼梯目标检测算法","authors":"Weifeng Kong, Zhiying Tan, Xu Tao, Wenbo Fan, Yanjun Ji, Meiling Wang, Xiaobin Xu","doi":"10.1145/3598151.3598187","DOIUrl":null,"url":null,"abstract":"In view of the mobile robot's perception of the environment in the process of moving, the method of deep learning is used to detect the target terrain in order to send follow-up instructions to the robot. In this paper, a lightweight stairs detection algorithm based on improved YOLOv5-Lite is proposed, which uses RepVGG as the backbone network to extract features to build YOLOv5-Lite lightweight neural network model, and removes the Focus layer in the backbone network and replaces it with a 6x6 convolution layer to reduce the number of network parameters to improve the speed of model detection. Through the introduction of BasicRFB_s pooling layer, increase the receptive field of the network, and add C3SE attention mechanism in the pooling layer to reduce pooling loss. To decrease the mistakes, we also segment the point cloud of stairs into vertical and horizontal planes by the difference of normal and consider their probability distribution for the further detection. The experimental results show that the accuracy of the improved lightweight algorithm can reach 97.1%, and the frame rate can reach 51 fps. After segmenting the point cloud, the accuracy of detection method can reach 99.9% and the frame rate can reach 46 fps. The results show that the proposed method has the characteristics of real-time, lightweight and high accuracy.","PeriodicalId":398644,"journal":{"name":"Proceedings of the 2023 3rd International Conference on Robotics and Control Engineering","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Object Detection Algorithm of Stairs Based on Multi-information Fusion\",\"authors\":\"Weifeng Kong, Zhiying Tan, Xu Tao, Wenbo Fan, Yanjun Ji, Meiling Wang, Xiaobin Xu\",\"doi\":\"10.1145/3598151.3598187\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In view of the mobile robot's perception of the environment in the process of moving, the method of deep learning is used to detect the target terrain in order to send follow-up instructions to the robot. In this paper, a lightweight stairs detection algorithm based on improved YOLOv5-Lite is proposed, which uses RepVGG as the backbone network to extract features to build YOLOv5-Lite lightweight neural network model, and removes the Focus layer in the backbone network and replaces it with a 6x6 convolution layer to reduce the number of network parameters to improve the speed of model detection. Through the introduction of BasicRFB_s pooling layer, increase the receptive field of the network, and add C3SE attention mechanism in the pooling layer to reduce pooling loss. To decrease the mistakes, we also segment the point cloud of stairs into vertical and horizontal planes by the difference of normal and consider their probability distribution for the further detection. The experimental results show that the accuracy of the improved lightweight algorithm can reach 97.1%, and the frame rate can reach 51 fps. After segmenting the point cloud, the accuracy of detection method can reach 99.9% and the frame rate can reach 46 fps. The results show that the proposed method has the characteristics of real-time, lightweight and high accuracy.\",\"PeriodicalId\":398644,\"journal\":{\"name\":\"Proceedings of the 2023 3rd International Conference on Robotics and Control Engineering\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2023 3rd International Conference on Robotics and Control Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3598151.3598187\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2023 3rd International Conference on Robotics and Control Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3598151.3598187","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Object Detection Algorithm of Stairs Based on Multi-information Fusion
In view of the mobile robot's perception of the environment in the process of moving, the method of deep learning is used to detect the target terrain in order to send follow-up instructions to the robot. In this paper, a lightweight stairs detection algorithm based on improved YOLOv5-Lite is proposed, which uses RepVGG as the backbone network to extract features to build YOLOv5-Lite lightweight neural network model, and removes the Focus layer in the backbone network and replaces it with a 6x6 convolution layer to reduce the number of network parameters to improve the speed of model detection. Through the introduction of BasicRFB_s pooling layer, increase the receptive field of the network, and add C3SE attention mechanism in the pooling layer to reduce pooling loss. To decrease the mistakes, we also segment the point cloud of stairs into vertical and horizontal planes by the difference of normal and consider their probability distribution for the further detection. The experimental results show that the accuracy of the improved lightweight algorithm can reach 97.1%, and the frame rate can reach 51 fps. After segmenting the point cloud, the accuracy of detection method can reach 99.9% and the frame rate can reach 46 fps. The results show that the proposed method has the characteristics of real-time, lightweight and high accuracy.