A Display Method of Large Underwater Photo-Mosaics Based on Pyramid Model of Tiled Images

Nannan Liu, Xiaoming Li
{"title":"A Display Method of Large Underwater Photo-Mosaics Based on Pyramid Model of Tiled Images","authors":"Nannan Liu, Xiaoming Li","doi":"10.1109/CCAI50917.2021.9447465","DOIUrl":null,"url":null,"abstract":"In deep seafloor exploration, optical imaging provides short-range, high resolution visual information. In order to obtain a wide range of detailed visual information, it is common to stitch multiple images into a photo-mosaic which could reach tens of billions of pixels in size. Due to hardware and software constraints, it is very difficult even impossible to browse and display such a large image with existing image viewers. In this paper, we propose a display method based on pyramid model and develop a super large image display system of tiled images dedicated to underwater photo-mosaics display. Before the image can be displayed, the first step is to construct and store the multi-resolution hierarchical model of tiles, in which sub-mosaics of the same size are original tiles with the highest resolution. Similar to quadtree coding, each tile is encoded based on its location in the pyramid. Then all tile data are stored in MongoDB database. Each record in MongoDB is a key-value pair structure corresponding to a specific tile, in which key is the encoding of a tile and value is the tiled image data stored in binary stream format. Image display is implemented based on graphical user interface. By mouse operation, the system can display images at different resolutions and browse different part of the image. Based on the pyramid model and key-value storage structure, there are more than 60,000 high-definition tiled images in our application. The full panorama resolution is 16.2 billion pixels, about 45GB in RAM, but only the tiles displayed in the current window need to be loaded. Hence, our system reduces the memory requirement greatly and makes image browsing more smoothly.","PeriodicalId":121785,"journal":{"name":"2021 International Conference on Computer Communication and Artificial Intelligence (CCAI)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computer Communication and Artificial Intelligence (CCAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCAI50917.2021.9447465","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In deep seafloor exploration, optical imaging provides short-range, high resolution visual information. In order to obtain a wide range of detailed visual information, it is common to stitch multiple images into a photo-mosaic which could reach tens of billions of pixels in size. Due to hardware and software constraints, it is very difficult even impossible to browse and display such a large image with existing image viewers. In this paper, we propose a display method based on pyramid model and develop a super large image display system of tiled images dedicated to underwater photo-mosaics display. Before the image can be displayed, the first step is to construct and store the multi-resolution hierarchical model of tiles, in which sub-mosaics of the same size are original tiles with the highest resolution. Similar to quadtree coding, each tile is encoded based on its location in the pyramid. Then all tile data are stored in MongoDB database. Each record in MongoDB is a key-value pair structure corresponding to a specific tile, in which key is the encoding of a tile and value is the tiled image data stored in binary stream format. Image display is implemented based on graphical user interface. By mouse operation, the system can display images at different resolutions and browse different part of the image. Based on the pyramid model and key-value storage structure, there are more than 60,000 high-definition tiled images in our application. The full panorama resolution is 16.2 billion pixels, about 45GB in RAM, but only the tiles displayed in the current window need to be loaded. Hence, our system reduces the memory requirement greatly and makes image browsing more smoothly.
一种基于图像金字塔模型的大型水下照片拼接显示方法
在深海勘探中,光学成像提供了近距离、高分辨率的视觉信息。为了获得广泛的详细视觉信息,通常将多幅图像拼接成一张照片马赛克,其大小可达到数百亿像素。由于硬件和软件的限制,现有的图像查看器很难甚至不可能浏览和显示如此大的图像。本文提出了一种基于金字塔模型的显示方法,并开发了一种专用于水下照片拼接显示的超大图像拼接图像显示系统。在图像显示之前,首先要构建并存储瓷砖的多分辨率分层模型,其中相同大小的子马赛克是分辨率最高的原始瓷砖。与四叉树编码类似,每个贴图都是根据其在金字塔中的位置进行编码的。然后将所有tile数据存储在MongoDB数据库中。MongoDB中的每条记录都是对应于特定tile的键值对结构,其中key是tile的编码,value是以二进制流格式存储的tile图像数据。图像显示是基于图形用户界面实现的。通过鼠标操作,系统可以显示不同分辨率的图像,并浏览图像的不同部分。基于金字塔模型和键值存储结构,我们的应用中有6万多张高清平铺图像。全全景分辨率为162亿像素,内存约为45GB,但只需要加载当前窗口中显示的磁贴。因此,我们的系统大大降低了内存需求,使图像浏览更加顺畅。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信