{"title":"Object detection for rotated and densely arranged objects in aerial images using path aggregated feature pyramid networks","authors":"Xiangyu Liu, Hong Pan, Xinde Li","doi":"10.1117/12.2538090","DOIUrl":"https://doi.org/10.1117/12.2538090","url":null,"abstract":"Object detection based on deep learning algorithms has been an important yet challenging research field in computer vision. The feature pyramid network has become a dominant network architecture in many detection applications because of its powerful feature learning ability for objects with varying scales. To address the challenges in detecting small and densely packed objects, this paper proposes an innovative object detection approach by combining the path aggregation scheme and the feature pyramid network into a unified framework. Specifically, we add a bottom-up branch with lateral connection onto the existing feature pyramid network and apply adaptive feature fusion strategy, which improves the detection performance for small and densely arranged objects in remote sensing images. Experiment results show that our proposed path aggregated feature pyramid network can improve the detection performance for diverse objects in aerial images.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133944580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multiscale DEM generation on basis of singular value decomposition","authors":"Caixian Zhang, Jun He, Wenguang Hou","doi":"10.1117/12.2537903","DOIUrl":"https://doi.org/10.1117/12.2537903","url":null,"abstract":"As the fundamental data about the terrains, DEM plays an important role in many fields. The high resolution DEM is increasingly popular. Yet, the multiscale resolution DEMs are still desired for some applications due to the fact that the low resolution DEM can reduce the memory demands with limited computational complexity. Then, how to obtain the multiscale DEMs remains an open question, which demands that the different resolution DEMs should discard the detailed information with maintaining the main information of the high resolution DEM. Moreover, the multiscale DEMs should not cost many memories. Generally, there is a contradiction. As such, this paper proposes a multiscale DEM generation method based on Singular Value Decomposition (SVD) which can establish multiscale DEMs maintaining the different details with a small quantity of memory increasement. The method is simple but effective. Lots of experiment shows its effectiveness.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132960148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Comparative study on colorimetric characterization of LCD based on polynomial","authors":"Tong Li, K. Xie, Hongtai Guo, Zixuan Li, Qi Zheng","doi":"10.1117/12.2539459","DOIUrl":"https://doi.org/10.1117/12.2539459","url":null,"abstract":"LCD is a common display device. Due to the device dependence in color space, it is necessary to characterize LCD. In this paper, polynomial regression method is used to establish the color conversion model from RGB to CIEXYZ for colorimetric characterization of LCD, and black point correction is added to solve different polynomial parameters and compare the color difference. In the experiment, 17 groups of training samples were selected to solve the parameters and 200 groups of random test samples were used to verify the accuracy of the model. The experimental results show that the cubic polynomial curve model has the highest accuracy and the maximum chromatic aberration is 4.2962, which achieves better display effect.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131083430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Smoke detection in infrared images based on superpixel segmentation","authors":"Min Dai, Peng Gao, Mozhou Sha, J. Tian","doi":"10.1117/12.2538195","DOIUrl":"https://doi.org/10.1117/12.2538195","url":null,"abstract":"Infrared smoke interference technology seriously infected the combat effectiveness of photoelectric guided weapons in modern warfare. As a result of the occlusion caused by smoke screen, the robustness of image matching guidance algorithm will decrease. Thus, to judge whether there is smoke interference in images and smoke screen area extraction are of great importance for the accuracy of image matching guidance algorithm. However, most of the smoke detection methods aimed at fire early warning, so that they focused on whether smoke exists or not. While both of the discrimination of smoke interference and smoke screen area extraction are what we concern. In this paper, a smoke detection method based on superpixel segmentation and region merging is proposed. Firstly, over-segmentation regions of input infrared image with superpixel segmentation are obtained. Then, fusion texture feature of the image is computed. Finally, superpixel regions are merged based on the fusion features of each superpixel block obtained in the previous step and smoke screen area extraction is completed.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"173 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132741244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiechang Sun, Hao Jiang, Tongtong Huo, Weidong Yang
{"title":"A fast multi-target detection method based on improved YOLO","authors":"Xiechang Sun, Hao Jiang, Tongtong Huo, Weidong Yang","doi":"10.1117/12.2539386","DOIUrl":"https://doi.org/10.1117/12.2539386","url":null,"abstract":"Detection of sea surface targets in large-scale remote sensing images is one of the important research topics of ocean remote sensing technology. Ocean remote sensing images have the characteristics of wide format, strong interference and small target. This paper adopts the spinning target detection method, and proposes a ship detection model based on YOLO to output the real length, width and axial information. The model can accurately output the position, length and width and axial information of a ship target by predicting the minimum external rectangular area of the ship target, so as to realize multi-target detection and improve the detection performance significantly. To improve the recall rate of the target detection algorithm, this paper adopts the spinning target detection method, and proposes a ship detection model based on YOLO. Through redefining the representation of the rotation matrix and redesigning a new network loss function and the rotated IOU computing method, this model accurately outputs the real length, width and axial information, increases the output feature dimensions, and effectively raises the recall rate and speed of multi-target detection. Lastly, to improve the practicability of the algorithm on mobile devices, the model is processed in a lightweight way. Its parameters are significantly reduced while the detection accuracy is ensured.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127389826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Guoping Xu, Hanqiang Cao, J. Udupa, Chunyi Yue, Youli Dong, Li Cao, D. Torigian
{"title":"A novel exponential loss function for pathological lymph node image classification","authors":"Guoping Xu, Hanqiang Cao, J. Udupa, Chunyi Yue, Youli Dong, Li Cao, D. Torigian","doi":"10.1117/12.2537004","DOIUrl":"https://doi.org/10.1117/12.2537004","url":null,"abstract":"Recent progress in deep learning, especially deep convolutional neural networks (DCNNs), has led to significant improvement in natural image classification. However, research is still ongoing in the domain of medical image analysis in part due to the shortage of annotated data sets for training DCNNs, the imbalanced number of positive and negative samples, and the difference between medical images and natural images. In this paper, two strategies are proposed to train a DCNN for pathological lymph node image classification. Firstly, the transfer learning strategy is used to deal with the shortage of training samples. Second, a novel exponential loss function is presented for the imbalance in training samples. Four state-of-the-art DCNNs (GoogleNet, ResNet101, Xception, and MobileNetv2) are tested. The experiments demonstrate that the two strategies are effective to improve the performance of pathological lymph node image classification in terms of accuracy and sensitivity with a mean of 0.13% and 1.50%, respectively, for the four DCNNs. In particular, the proposed exponential loss function improved the sensitivity by 3.9% and 4.0% for Xception and ResNet101, respectively.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127426256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yu Sun, Zhicheng Wang, Jingjing Fei, Ling Chen, Gang Wei
{"title":"ATSGPN: adaptive threshold instance segmentation network in 3D point cloud","authors":"Yu Sun, Zhicheng Wang, Jingjing Fei, Ling Chen, Gang Wei","doi":"10.1117/12.2541582","DOIUrl":"https://doi.org/10.1117/12.2541582","url":null,"abstract":"We introduce an adaptive threshold instance segmentation network in point cloud based on similarity group proposal network(SGPN), named adaptive threshold similarity group proposal network(ATSGPN). SGPN learns the feature of point cloud to process similarity matrix and clusters. In our experiments, we find that we cannot always get the proper threshold by heuristic method to divide the points although the similarity matrix is good enough. Based on this idea, we introduce the Threshold Map to learn segmentation threshold. We also improve the feature extraction using edge convolution(EdgeConv). The point cloud first passes EdgeConv to extract features and learns the similarity matrix in feature space. The semantic label of each point and the segmentation threshold can help to generate groups and then calculates confidence to evaluate the group quality and backpropagation. ATSGPN has higher accuracy on Stanford Large- Scale 3D Indoor Spaces Dataset (S3SID) and fewer steps than SGPN, and there are some experiments can be shown in the paper for its good performance.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116341487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Study on sophisticated vegetation classification for AHSI/GF-5 remote sensing data","authors":"K. Shang, Yisong Xie, Hongyan Wei","doi":"10.1117/12.2539369","DOIUrl":"https://doi.org/10.1117/12.2539369","url":null,"abstract":"A detailed distribution map of different vegetation classes is of great importance for us to analyze the global ecosystem. Compared with traditional remote sensing data, hyperspectral remote sensing (HRS) data have hundreds of spectral bands and continuous spectral curves, showing great potential in sophisticated vegetation classification. And the AHSI (Advance Hyper-Spectral Imager) on-board GF-5 satellite has addressed the problem of lacking in satellite HRS data. According to the characteristics of AHSI data, we propose a modified sophisticated vegetation classification method by constructing and optimizing a vegetation feature set (FBS). This method takes the band quality, vegetation biochemical parameters, and neighborhood pixels’ spectral angle distance into consideration. The results show that our method can obtain better classification results than traditional methods with higher overall accuracy and less salt and pepper noise, indicating that it is feasible to distinguish different kinds of vegetation using the AHSI/GF-5 data.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116351816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shihong Qin, Ruixing Wang, Hongcheng Zhou, Zhao Xiong, Jun Jian, Ziyi Mei
{"title":"Outdoor cycle three-dimensional intelligent parking lot system","authors":"Shihong Qin, Ruixing Wang, Hongcheng Zhou, Zhao Xiong, Jun Jian, Ziyi Mei","doi":"10.1117/12.2557558","DOIUrl":"https://doi.org/10.1117/12.2557558","url":null,"abstract":"In view of the current problem that there are too many social vehicles, and the parking spaces can not meet the needs, an outdoor cycle three-dimensional intelligent parking lot was designed. The parking lot can double the number of parking spaces in the original land area and can provide more intelligent services than the ordinary parking lot. In addition, it can make parking and fetching more convenient for the owners. PLC control and Zigbee wireless communication is used to combine the rotating parking space with the three-dimensional garage, which supports smart card, two-dimensional code reservation, and other convenient services. This design can intelligently manage the vehicles and parking spaces in the entire parking lot.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"15 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120824591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A novel self-relearning approach for Landsat image change detection","authors":"Xiaoyu Chang, Xin Huang, Jiayi Li","doi":"10.1117/12.2538192","DOIUrl":"https://doi.org/10.1117/12.2538192","url":null,"abstract":"Land cover composition and change are important aspects for many scientific research and socioeconomic assessments. The multi-date land cover change detection is generally more difficult and time-consuming to select enough training samples when considering multi-date image labels at the same location. To improve the accuracy for multi-date change detection, this study proposed a new algorithm framework, combining self-learning and relearning algorithm. Wuhan was selected as the experimental area, and Landsat images in 2005 and 2016 were used to extract six main types of change classes. Firstly, PCM (primitive co-occurrence matrix) and the minimum class certainty are used to ensure the high confidence of selected candidate set samples, and then the most informative samples are identified for classification from the candidate samples. To save computing costs, we adopt clustering method to reduce the self-relearning samples. Based on our experimental results, the self-relearning algorithm increases the final classification accuracy by approximately 2.5% (from 92.64% to 95.09%) in the case of using few initial training samples, providing a feasible solution for the multi-date change detection.","PeriodicalId":384253,"journal":{"name":"International Symposium on Multispectral Image Processing and Pattern Recognition","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124788623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}