{"title":"无微调或迁移学习的热像仪行人检测中的域移位问题","authors":"M. Fanfani, Matteo Marulli, P. Nesi","doi":"10.1109/SMARTCOMP58114.2023.00078","DOIUrl":null,"url":null,"abstract":"The use of thermal imaging to detect the presence of people in indoor and outdoor environments is gaining an increasing attention given its wide applicability in the tourism, security, and mobility domains. However, due to the particular characteristics of different contexts, it is necessary to train/finetuning specifically object detectors for each scenario in order to obtain accurate results. This is due to changes in appearance caused by camera position, scene size, environmental factors, etc. In this paper, we present a data augmentation method that can improve both versatility and robustness of pedestrian detection models based on thermal images. Thanks to our solution, the trained model can deal with unseen thermal data from both indoor and outdoor environments, reliably detecting pedestrians regardless of their apparent size and position in the image, without any fine-tuning or transfer learning, therefore avoiding time consuming labeling activities to fine-tune and deploy the system in different scenarios.","PeriodicalId":163556,"journal":{"name":"2023 IEEE International Conference on Smart Computing (SMARTCOMP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Addressing Domain Shift in Pedestrian Detection from Thermal Cameras without Fine-Tuning or Transfer Learning\",\"authors\":\"M. Fanfani, Matteo Marulli, P. Nesi\",\"doi\":\"10.1109/SMARTCOMP58114.2023.00078\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The use of thermal imaging to detect the presence of people in indoor and outdoor environments is gaining an increasing attention given its wide applicability in the tourism, security, and mobility domains. However, due to the particular characteristics of different contexts, it is necessary to train/finetuning specifically object detectors for each scenario in order to obtain accurate results. This is due to changes in appearance caused by camera position, scene size, environmental factors, etc. In this paper, we present a data augmentation method that can improve both versatility and robustness of pedestrian detection models based on thermal images. Thanks to our solution, the trained model can deal with unseen thermal data from both indoor and outdoor environments, reliably detecting pedestrians regardless of their apparent size and position in the image, without any fine-tuning or transfer learning, therefore avoiding time consuming labeling activities to fine-tune and deploy the system in different scenarios.\",\"PeriodicalId\":163556,\"journal\":{\"name\":\"2023 IEEE International Conference on Smart Computing (SMARTCOMP)\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Smart Computing (SMARTCOMP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SMARTCOMP58114.2023.00078\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Smart Computing (SMARTCOMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMARTCOMP58114.2023.00078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Addressing Domain Shift in Pedestrian Detection from Thermal Cameras without Fine-Tuning or Transfer Learning
The use of thermal imaging to detect the presence of people in indoor and outdoor environments is gaining an increasing attention given its wide applicability in the tourism, security, and mobility domains. However, due to the particular characteristics of different contexts, it is necessary to train/finetuning specifically object detectors for each scenario in order to obtain accurate results. This is due to changes in appearance caused by camera position, scene size, environmental factors, etc. In this paper, we present a data augmentation method that can improve both versatility and robustness of pedestrian detection models based on thermal images. Thanks to our solution, the trained model can deal with unseen thermal data from both indoor and outdoor environments, reliably detecting pedestrians regardless of their apparent size and position in the image, without any fine-tuning or transfer learning, therefore avoiding time consuming labeling activities to fine-tune and deploy the system in different scenarios.