A Compact Tri-Modal Camera Unit for RGBDT Vision

Julian Strohmayer, M. Kampel
{"title":"A Compact Tri-Modal Camera Unit for RGBDT Vision","authors":"Julian Strohmayer, M. Kampel","doi":"10.1145/3523111.3523116","DOIUrl":null,"url":null,"abstract":"The combination of RGBD and thermal cameras in multi-modal person-centric vision applications has great potential. As a complementary modality, thermal cameras can compensate for weaknesses such as the inability to operate in absolute darkness of conventional RGB cameras or the range limitations associated with consumer depth cameras, resulting in a more robust computer vision system. In addition, the high contrast between persons and their surroundings in thermal images can ease fundamental detection and segmentation tasks. Unfortunately, the market supply of low-cost consumer RGBDT vision systems is non-existent at the moment, which slows down progress in the field of person-centric vision. We address this problem by proposing a Compact Tri-modal CAmera uniT (CTCAT) for RGBDT vision, which can be manufactured from off-the-shelf components and 3D printed parts. CTCAT features a 1280 × 720 RGB camera, a 640 × 480 structured light depth camera with an operating range of 0.6 − 8m, and a 160 × 120 uncooled radiometric thermal camera. RGB, depth, and thermal images can be captured simultaneously at frame rates up to 9 fps. In this work, we describe the components, fabrication, and calibration of CTCAT. In addition, a new multi-modal calibration target suitable for the geometric calibration of RGB, depth, and thermal cameras is presented, which offers advantages over the state of the art in terms of contrast and practicality. Moreover, radiometric calibration of CTCAT is performed to evaluate the applicability to person-centric vision applications requiring radiometry.","PeriodicalId":185161,"journal":{"name":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Machine Vision and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3523111.3523116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The combination of RGBD and thermal cameras in multi-modal person-centric vision applications has great potential. As a complementary modality, thermal cameras can compensate for weaknesses such as the inability to operate in absolute darkness of conventional RGB cameras or the range limitations associated with consumer depth cameras, resulting in a more robust computer vision system. In addition, the high contrast between persons and their surroundings in thermal images can ease fundamental detection and segmentation tasks. Unfortunately, the market supply of low-cost consumer RGBDT vision systems is non-existent at the moment, which slows down progress in the field of person-centric vision. We address this problem by proposing a Compact Tri-modal CAmera uniT (CTCAT) for RGBDT vision, which can be manufactured from off-the-shelf components and 3D printed parts. CTCAT features a 1280 × 720 RGB camera, a 640 × 480 structured light depth camera with an operating range of 0.6 − 8m, and a 160 × 120 uncooled radiometric thermal camera. RGB, depth, and thermal images can be captured simultaneously at frame rates up to 9 fps. In this work, we describe the components, fabrication, and calibration of CTCAT. In addition, a new multi-modal calibration target suitable for the geometric calibration of RGB, depth, and thermal cameras is presented, which offers advantages over the state of the art in terms of contrast and practicality. Moreover, radiometric calibration of CTCAT is performed to evaluate the applicability to person-centric vision applications requiring radiometry.
一种用于RGBDT视觉的紧凑型三模态相机单元
RGBD与热像仪的结合在多模态以人为中心的视觉应用中具有很大的潜力。作为一种补充方式,热像仪可以弥补传统RGB相机无法在绝对黑暗中操作或与消费者深度相机相关的范围限制等缺点,从而产生更强大的计算机视觉系统。此外,热图像中人与周围环境的高对比度可以简化基本的检测和分割任务。遗憾的是,目前低成本消费RGBDT视觉系统的市场供应并不存在,这减缓了以人为本的视觉领域的进展。我们通过提出用于RGBDT视觉的紧凑型三模态相机单元(CTCAT)来解决这个问题,该单元可以由现成的组件和3D打印部件制造。CTCAT具有一台1280 × 720 RGB相机,一台640 × 480结构光深度相机,工作范围为0.6 - 8米,以及一台160 × 120非冷却辐射热像仪。RGB、深度和热图像可以以高达9 fps的帧率同时捕获。在这项工作中,我们描述了CTCAT的组成,制造和校准。此外,提出了一种适用于RGB相机、深度相机和热像仪几何标定的新型多模态标定目标,该目标在对比度和实用性方面具有领先于现有标定目标的优势。此外,对CTCAT进行了辐射校准,以评估需要辐射测量的以人为中心的视觉应用的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信