Chia-Lin Wu, Chih-Yang Lin, Phanuvich Hirunsirisombut, Hui-Fuang Ng, T. Shih
{"title":"Searching ROI for Object Detection based on CNN","authors":"Chia-Lin Wu, Chih-Yang Lin, Phanuvich Hirunsirisombut, Hui-Fuang Ng, T. Shih","doi":"10.1109/ISPACS48206.2019.8986381","DOIUrl":null,"url":null,"abstract":"Several studies have explored the structural design of CNN to improve the network's performance since a well-designed feature extractor can benefit convolution-based tasks. Although CNNs are able to learn important patterns on raw images, images may contain unpredictable noise that can negatively influence the convolutional stage. Feature extraction cannot always accurately capture the desired features based solely on the input image, but including extra information could improve the result. This paper proposes a fusion input design to generate an additional feature that a CNN can use to provide extra ROI information. Whether a model can utilize the additional information is a determining factor that affects the performance improvement. The proposed method is tested on two public datasets with different structural designs. Overall, the results indicate that additional ROI information can deliver benefits to specific tasks.","PeriodicalId":6765,"journal":{"name":"2019 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","volume":"31 1","pages":"1-2"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPACS48206.2019.8986381","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Several studies have explored the structural design of CNN to improve the network's performance since a well-designed feature extractor can benefit convolution-based tasks. Although CNNs are able to learn important patterns on raw images, images may contain unpredictable noise that can negatively influence the convolutional stage. Feature extraction cannot always accurately capture the desired features based solely on the input image, but including extra information could improve the result. This paper proposes a fusion input design to generate an additional feature that a CNN can use to provide extra ROI information. Whether a model can utilize the additional information is a determining factor that affects the performance improvement. The proposed method is tested on two public datasets with different structural designs. Overall, the results indicate that additional ROI information can deliver benefits to specific tasks.