Ju-Chan Kim, D. Le, S. Song, C. Son, Hyunseung Choo
{"title":"Multi-modal Fundus Image Registration with Deep Feature Matching and Image Scaling","authors":"Ju-Chan Kim, D. Le, S. Song, C. Son, Hyunseung Choo","doi":"10.1109/IMCOM53663.2022.9721768","DOIUrl":null,"url":null,"abstract":"Multi-modal image registration is a technology that converts heterogeneous spatial coordinate systems of different images into one unified coordinate system. This is a fundamental task of medical image analysis as well as computer vision domain because it facilitates a comprehensive understanding of images captured by aligning two or more images. In the field of ophthalmology, the registration is beneficial for ophthalmologists in clinical trial diagnosis, plan treatment, and image-guided surgery. However, registering for two types of fundus image, including conventional and ultra-wide-field fundus images, is a challenging task due to the highly difference in scales of the images. The paper proposes a method of scaling and register images based on common features (e.g., optic disc) of the two fundus image types taken from the same patient. This method improves the performance of multi-modal image registration by reducing the distance in the homogeneous coordinate system of two images through image scaling. The proposed method improves about 13% correct keypoints compared to the conventional deep learning method.","PeriodicalId":367038,"journal":{"name":"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCOM53663.2022.9721768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Multi-modal image registration is a technology that converts heterogeneous spatial coordinate systems of different images into one unified coordinate system. This is a fundamental task of medical image analysis as well as computer vision domain because it facilitates a comprehensive understanding of images captured by aligning two or more images. In the field of ophthalmology, the registration is beneficial for ophthalmologists in clinical trial diagnosis, plan treatment, and image-guided surgery. However, registering for two types of fundus image, including conventional and ultra-wide-field fundus images, is a challenging task due to the highly difference in scales of the images. The paper proposes a method of scaling and register images based on common features (e.g., optic disc) of the two fundus image types taken from the same patient. This method improves the performance of multi-modal image registration by reducing the distance in the homogeneous coordinate system of two images through image scaling. The proposed method improves about 13% correct keypoints compared to the conventional deep learning method.