{"title":"人群计数的预训练卷积网络","authors":"Shining, Shaojiayu","doi":"10.1109/CTISC52352.2021.00062","DOIUrl":null,"url":null,"abstract":"Image crowd density estimation is widely used in video surveillance, traffic surveillance and public safety. Recently, the convolutional neural network based approach has shown better results in crowd counting than the traditional approach. However, there are still a lot of limitations and difficulties in practical application :large-range number of people, changeable environment result in the current methods fail work well. In addition, Due to the lack of practical data, many methods suffer from over-fitting in varying degrees. To solve these two problems, first, we built a data collector and annotator, which can generate a large number of synthetic crowd scenes without any labor cost, and annotate them automatically. Secondly, we use pre-training method to pre-train on the synthetic dataset, and then fine-tune with the real data to effectively improve the performance of the model in the real scene. Extensive experiments show that the method is achieves the state-of-the-art performance on four real datasets.","PeriodicalId":268378,"journal":{"name":"2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Pre-Training Convolution Network for Crowd Counting\",\"authors\":\"Shining, Shaojiayu\",\"doi\":\"10.1109/CTISC52352.2021.00062\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Image crowd density estimation is widely used in video surveillance, traffic surveillance and public safety. Recently, the convolutional neural network based approach has shown better results in crowd counting than the traditional approach. However, there are still a lot of limitations and difficulties in practical application :large-range number of people, changeable environment result in the current methods fail work well. In addition, Due to the lack of practical data, many methods suffer from over-fitting in varying degrees. To solve these two problems, first, we built a data collector and annotator, which can generate a large number of synthetic crowd scenes without any labor cost, and annotate them automatically. Secondly, we use pre-training method to pre-train on the synthetic dataset, and then fine-tune with the real data to effectively improve the performance of the model in the real scene. Extensive experiments show that the method is achieves the state-of-the-art performance on four real datasets.\",\"PeriodicalId\":268378,\"journal\":{\"name\":\"2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CTISC52352.2021.00062\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 3rd International Conference on Advances in Computer Technology, Information Science and Communication (CTISC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CTISC52352.2021.00062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Pre-Training Convolution Network for Crowd Counting
Image crowd density estimation is widely used in video surveillance, traffic surveillance and public safety. Recently, the convolutional neural network based approach has shown better results in crowd counting than the traditional approach. However, there are still a lot of limitations and difficulties in practical application :large-range number of people, changeable environment result in the current methods fail work well. In addition, Due to the lack of practical data, many methods suffer from over-fitting in varying degrees. To solve these two problems, first, we built a data collector and annotator, which can generate a large number of synthetic crowd scenes without any labor cost, and annotate them automatically. Secondly, we use pre-training method to pre-train on the synthetic dataset, and then fine-tune with the real data to effectively improve the performance of the model in the real scene. Extensive experiments show that the method is achieves the state-of-the-art performance on four real datasets.