{"title":"基于方向梯度直方图的图像中人检测新方法","authors":"Narges Ghaedi Bardeh, M. Palhang","doi":"10.1109/IRANIANCEE.2013.6599619","DOIUrl":null,"url":null,"abstract":"Histograms of Oriented Gradients (HoG) is one of the most used descriptors in human detection. Although it has good performance compared to other descriptors in the area, if the size and number of images increase, the dimension of the descriptor vectors would become extremely large and therefore makes the training process computationally complex. To overcome this, in this paper a human detection method based on bag-of-features model is represented. Visual words are patches of pictures described with HoG and then clustered using K-means algorithm. To highlight the most important visual words, a weighting method could be applied to the descriptor vectors. Here we used Term Frequency-Inverse Document Frequency (Tf_Idf) which has been used in document classification. In the proposed approach, Support Vector Machine (SVM) is used as the binary classifier. We applied our proposed method to the MIT and INRIA datasets and compared the performance of our algorithm with a similar method in the literature. The results of our experiments show that our method performs at least as well as other available methods.","PeriodicalId":383315,"journal":{"name":"2013 21st Iranian Conference on Electrical Engineering (ICEE)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":"{\"title\":\"New approach for human detection in images using histograms of oriented gradients\",\"authors\":\"Narges Ghaedi Bardeh, M. Palhang\",\"doi\":\"10.1109/IRANIANCEE.2013.6599619\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Histograms of Oriented Gradients (HoG) is one of the most used descriptors in human detection. Although it has good performance compared to other descriptors in the area, if the size and number of images increase, the dimension of the descriptor vectors would become extremely large and therefore makes the training process computationally complex. To overcome this, in this paper a human detection method based on bag-of-features model is represented. Visual words are patches of pictures described with HoG and then clustered using K-means algorithm. To highlight the most important visual words, a weighting method could be applied to the descriptor vectors. Here we used Term Frequency-Inverse Document Frequency (Tf_Idf) which has been used in document classification. In the proposed approach, Support Vector Machine (SVM) is used as the binary classifier. We applied our proposed method to the MIT and INRIA datasets and compared the performance of our algorithm with a similar method in the literature. The results of our experiments show that our method performs at least as well as other available methods.\",\"PeriodicalId\":383315,\"journal\":{\"name\":\"2013 21st Iranian Conference on Electrical Engineering (ICEE)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"13\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 21st Iranian Conference on Electrical Engineering (ICEE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRANIANCEE.2013.6599619\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 21st Iranian Conference on Electrical Engineering (ICEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRANIANCEE.2013.6599619","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
摘要
定向梯度直方图(Histograms of Oriented Gradients, HoG)是人体检测中最常用的描述符之一。尽管与该领域的其他描述符相比,它具有良好的性能,但如果图像的大小和数量增加,描述符向量的维数会变得非常大,从而使训练过程的计算变得复杂。为了克服这一问题,本文提出了一种基于特征袋模型的人体检测方法。视觉词是用HoG描述的图片块,然后用K-means算法聚类。为了突出显示最重要的视觉词,可以对描述符向量应用加权方法。在这里,我们使用术语频率-逆文档频率(Tf_Idf),它已用于文档分类。该方法采用支持向量机(SVM)作为二值分类器。我们将我们提出的方法应用于MIT和INRIA数据集,并将我们的算法与文献中类似方法的性能进行了比较。实验结果表明,该方法的性能至少与其他现有方法一样好。
New approach for human detection in images using histograms of oriented gradients
Histograms of Oriented Gradients (HoG) is one of the most used descriptors in human detection. Although it has good performance compared to other descriptors in the area, if the size and number of images increase, the dimension of the descriptor vectors would become extremely large and therefore makes the training process computationally complex. To overcome this, in this paper a human detection method based on bag-of-features model is represented. Visual words are patches of pictures described with HoG and then clustered using K-means algorithm. To highlight the most important visual words, a weighting method could be applied to the descriptor vectors. Here we used Term Frequency-Inverse Document Frequency (Tf_Idf) which has been used in document classification. In the proposed approach, Support Vector Machine (SVM) is used as the binary classifier. We applied our proposed method to the MIT and INRIA datasets and compared the performance of our algorithm with a similar method in the literature. The results of our experiments show that our method performs at least as well as other available methods.