Sunwoo Kim, Yongjun Ahn, Daeyoung Park, Byonghyo Shim
{"title":"VOMTC:毫米波和太赫兹通信视觉对象","authors":"Sunwoo Kim, Yongjun Ahn, Daeyoung Park, Byonghyo Shim","doi":"arxiv-2409.09330","DOIUrl":null,"url":null,"abstract":"Recent advances in sensing and computer vision (CV) technologies have opened\nthe door for the application of deep learning (DL)-based CV technologies in the\nrealm of 6G wireless communications. For the successful application of this\nemerging technology, it is crucial to have a qualified vision dataset tailored\nfor wireless applications (e.g., RGB images containing wireless devices such as\nlaptops and cell phones). An aim of this paper is to propose a large-scale\nvision dataset referred to as Vision Objects for Millimeter and Terahertz\nCommunications (VOMTC). The VOMTC dataset consists of 20,232 pairs of RGB and\ndepth images obtained from a camera attached to the base station (BS), with\neach pair labeled with three representative object categories (person, cell\nphone, and laptop) and bounding boxes of the objects. Through experimental\nstudies of the VOMTC datasets, we show that the beamforming technique\nexploiting the VOMTC-trained object detector outperforms conventional\nbeamforming techniques.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":"17 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"VOMTC: Vision Objects for Millimeter and Terahertz Communications\",\"authors\":\"Sunwoo Kim, Yongjun Ahn, Daeyoung Park, Byonghyo Shim\",\"doi\":\"arxiv-2409.09330\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recent advances in sensing and computer vision (CV) technologies have opened\\nthe door for the application of deep learning (DL)-based CV technologies in the\\nrealm of 6G wireless communications. For the successful application of this\\nemerging technology, it is crucial to have a qualified vision dataset tailored\\nfor wireless applications (e.g., RGB images containing wireless devices such as\\nlaptops and cell phones). An aim of this paper is to propose a large-scale\\nvision dataset referred to as Vision Objects for Millimeter and Terahertz\\nCommunications (VOMTC). The VOMTC dataset consists of 20,232 pairs of RGB and\\ndepth images obtained from a camera attached to the base station (BS), with\\neach pair labeled with three representative object categories (person, cell\\nphone, and laptop) and bounding boxes of the objects. Through experimental\\nstudies of the VOMTC datasets, we show that the beamforming technique\\nexploiting the VOMTC-trained object detector outperforms conventional\\nbeamforming techniques.\",\"PeriodicalId\":501280,\"journal\":{\"name\":\"arXiv - CS - Networking and Internet Architecture\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Networking and Internet Architecture\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.09330\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09330","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
VOMTC: Vision Objects for Millimeter and Terahertz Communications
Recent advances in sensing and computer vision (CV) technologies have opened
the door for the application of deep learning (DL)-based CV technologies in the
realm of 6G wireless communications. For the successful application of this
emerging technology, it is crucial to have a qualified vision dataset tailored
for wireless applications (e.g., RGB images containing wireless devices such as
laptops and cell phones). An aim of this paper is to propose a large-scale
vision dataset referred to as Vision Objects for Millimeter and Terahertz
Communications (VOMTC). The VOMTC dataset consists of 20,232 pairs of RGB and
depth images obtained from a camera attached to the base station (BS), with
each pair labeled with three representative object categories (person, cell
phone, and laptop) and bounding boxes of the objects. Through experimental
studies of the VOMTC datasets, we show that the beamforming technique
exploiting the VOMTC-trained object detector outperforms conventional
beamforming techniques.