{"title":"一种CNN-RNN神经网络结合长短期记忆的人群计数和密度估计","authors":"Jingnan Fu, Hongbo Yang, Ping Liu, Yuzhen Hu","doi":"10.1109/AMCON.2018.8614939","DOIUrl":null,"url":null,"abstract":"Crowd counting and density estimation is a challenging task in the field of computer vision. Most of existing methods of this task are based on convolutional neural network (CNN), which have achieved good results in low-density scene. Usually, people who are far away from the camera appear to be denser and smaller, while those who are close to the camera are more sparse and larger, therefor, structure contains only CNN gives the poor performance in some high-density crowd scene because of the uneven distribution of the crowd through camera. To address this problem, this paper designs a CNN-RNN Crowd Counting Neural Network (CRCCNN), which introduces Long Short-Term Memory (LSTM) structure, we use CNN structure to extract the features of the whole image, and use the LSTM structure to extract the contextual information of crowd region. Since LSTM has a good memory of the input information of sequential samples, it can predict the crowd density very well even for the high density population. We perform our experiments on different datasets and compare with other existing methods, which achieve the outstanding results and demonstrate the effectiveness performance of CRCCNN.","PeriodicalId":438307,"journal":{"name":"2018 IEEE International Conference on Advanced Manufacturing (ICAM)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"A CNN-RNN Neural Network Join Long Short-Term Memory For Crowd Counting and Density Estimation\",\"authors\":\"Jingnan Fu, Hongbo Yang, Ping Liu, Yuzhen Hu\",\"doi\":\"10.1109/AMCON.2018.8614939\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Crowd counting and density estimation is a challenging task in the field of computer vision. Most of existing methods of this task are based on convolutional neural network (CNN), which have achieved good results in low-density scene. Usually, people who are far away from the camera appear to be denser and smaller, while those who are close to the camera are more sparse and larger, therefor, structure contains only CNN gives the poor performance in some high-density crowd scene because of the uneven distribution of the crowd through camera. To address this problem, this paper designs a CNN-RNN Crowd Counting Neural Network (CRCCNN), which introduces Long Short-Term Memory (LSTM) structure, we use CNN structure to extract the features of the whole image, and use the LSTM structure to extract the contextual information of crowd region. Since LSTM has a good memory of the input information of sequential samples, it can predict the crowd density very well even for the high density population. We perform our experiments on different datasets and compare with other existing methods, which achieve the outstanding results and demonstrate the effectiveness performance of CRCCNN.\",\"PeriodicalId\":438307,\"journal\":{\"name\":\"2018 IEEE International Conference on Advanced Manufacturing (ICAM)\",\"volume\":\"62 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE International Conference on Advanced Manufacturing (ICAM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AMCON.2018.8614939\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Advanced Manufacturing (ICAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AMCON.2018.8614939","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A CNN-RNN Neural Network Join Long Short-Term Memory For Crowd Counting and Density Estimation
Crowd counting and density estimation is a challenging task in the field of computer vision. Most of existing methods of this task are based on convolutional neural network (CNN), which have achieved good results in low-density scene. Usually, people who are far away from the camera appear to be denser and smaller, while those who are close to the camera are more sparse and larger, therefor, structure contains only CNN gives the poor performance in some high-density crowd scene because of the uneven distribution of the crowd through camera. To address this problem, this paper designs a CNN-RNN Crowd Counting Neural Network (CRCCNN), which introduces Long Short-Term Memory (LSTM) structure, we use CNN structure to extract the features of the whole image, and use the LSTM structure to extract the contextual information of crowd region. Since LSTM has a good memory of the input information of sequential samples, it can predict the crowd density very well even for the high density population. We perform our experiments on different datasets and compare with other existing methods, which achieve the outstanding results and demonstrate the effectiveness performance of CRCCNN.