{"title":"博士:基于深度学习的全景视频人体检测框架","authors":"Jinting Tang, Zhenhui Chen, Yongkai Huo, Peichang Zhang","doi":"10.1109/WCSP.2019.8928143","DOIUrl":null,"url":null,"abstract":"Panoramic video has attracted substantial research attention as the coming video format. It is capable of providing 360 degree immersive experience of omnidirectional visual information. State-of-the-art detection networks may fail to detect humans on spherical images, which are normally represented in deformed rectangular shapes. In this paper, we propose a socalled Panoramic Human Detection (PHD) scheme to address the task of human detection in panoramic videos. Moreover, the PHD method is designed to detect humans by extracting multiple overlapping sub-images from each integral spherical image, where three-dimensional rotation of spherical images is employed to ensure consistency of sub-images. Two detection box filters are designed for removing redundant boxes. Our PHD method is capable of accomplishing the task of human detection in various panoramic video types. Experiments prove that our PHD method outperforms the baseline by 35% and 48.6% in terms of precision and recall, respectively.","PeriodicalId":108635,"journal":{"name":"2019 11th International Conference on Wireless Communications and Signal Processing (WCSP)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"PHD: A Deep Learning Based Human Detection Framework for Panoramic Videos\",\"authors\":\"Jinting Tang, Zhenhui Chen, Yongkai Huo, Peichang Zhang\",\"doi\":\"10.1109/WCSP.2019.8928143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Panoramic video has attracted substantial research attention as the coming video format. It is capable of providing 360 degree immersive experience of omnidirectional visual information. State-of-the-art detection networks may fail to detect humans on spherical images, which are normally represented in deformed rectangular shapes. In this paper, we propose a socalled Panoramic Human Detection (PHD) scheme to address the task of human detection in panoramic videos. Moreover, the PHD method is designed to detect humans by extracting multiple overlapping sub-images from each integral spherical image, where three-dimensional rotation of spherical images is employed to ensure consistency of sub-images. Two detection box filters are designed for removing redundant boxes. Our PHD method is capable of accomplishing the task of human detection in various panoramic video types. Experiments prove that our PHD method outperforms the baseline by 35% and 48.6% in terms of precision and recall, respectively.\",\"PeriodicalId\":108635,\"journal\":{\"name\":\"2019 11th International Conference on Wireless Communications and Signal Processing (WCSP)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 11th International Conference on Wireless Communications and Signal Processing (WCSP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WCSP.2019.8928143\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 11th International Conference on Wireless Communications and Signal Processing (WCSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WCSP.2019.8928143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
PHD: A Deep Learning Based Human Detection Framework for Panoramic Videos
Panoramic video has attracted substantial research attention as the coming video format. It is capable of providing 360 degree immersive experience of omnidirectional visual information. State-of-the-art detection networks may fail to detect humans on spherical images, which are normally represented in deformed rectangular shapes. In this paper, we propose a socalled Panoramic Human Detection (PHD) scheme to address the task of human detection in panoramic videos. Moreover, the PHD method is designed to detect humans by extracting multiple overlapping sub-images from each integral spherical image, where three-dimensional rotation of spherical images is employed to ensure consistency of sub-images. Two detection box filters are designed for removing redundant boxes. Our PHD method is capable of accomplishing the task of human detection in various panoramic video types. Experiments prove that our PHD method outperforms the baseline by 35% and 48.6% in terms of precision and recall, respectively.