{"title":"为高动态范围设备恢复标准动态范围视频:具有混合注意力机制的学习范式","authors":"Peilin Chen, Wenhan Yang, Shiqi Wang","doi":"10.1109/MMUL.2023.3270035","DOIUrl":null,"url":null,"abstract":"With the prevalence of high-dynamic-range (HDR) display devices, the demand to convert existing standard-dynamic-range television (SDRTV) video content to its corresponding HDR television (HDRTV) counterpart is growing exponentially. Herein, we propose a two-stage learning paradigm with hybrid attention mechanisms to fully exploit spatial, channelwise, and regional correlations for faithfully driving such conversion. Specifically, in the first domain-mapping stage, the depthwise self-attention and global calibration layer are proposed, which adaptively leverage feature intrarelationships to construct better scene representation and achieve engaging SDRTV-to-HDRTV transformation. In the second highlight-generation stage, considering that the overexposed regions potentially lead to detail loss, which brings enormous challenges to the conversion, we propose a regional self-attention module to specifically restore missing highlights. Extensive experimental results on public databases show that our method outperforms state-of-the-art approaches in terms of different quality evaluation measures.","PeriodicalId":13240,"journal":{"name":"IEEE MultiMedia","volume":"30 1","pages":"110-118"},"PeriodicalIF":2.3000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Reviving Standard-Dynamic-Range Videos for High-Dynamic-Range Devices: A Learning Paradigm With Hybrid Attention Mechanisms\",\"authors\":\"Peilin Chen, Wenhan Yang, Shiqi Wang\",\"doi\":\"10.1109/MMUL.2023.3270035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the prevalence of high-dynamic-range (HDR) display devices, the demand to convert existing standard-dynamic-range television (SDRTV) video content to its corresponding HDR television (HDRTV) counterpart is growing exponentially. Herein, we propose a two-stage learning paradigm with hybrid attention mechanisms to fully exploit spatial, channelwise, and regional correlations for faithfully driving such conversion. Specifically, in the first domain-mapping stage, the depthwise self-attention and global calibration layer are proposed, which adaptively leverage feature intrarelationships to construct better scene representation and achieve engaging SDRTV-to-HDRTV transformation. In the second highlight-generation stage, considering that the overexposed regions potentially lead to detail loss, which brings enormous challenges to the conversion, we propose a regional self-attention module to specifically restore missing highlights. Extensive experimental results on public databases show that our method outperforms state-of-the-art approaches in terms of different quality evaluation measures.\",\"PeriodicalId\":13240,\"journal\":{\"name\":\"IEEE MultiMedia\",\"volume\":\"30 1\",\"pages\":\"110-118\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2023-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE MultiMedia\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1109/MMUL.2023.3270035\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE MultiMedia","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1109/MMUL.2023.3270035","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Reviving Standard-Dynamic-Range Videos for High-Dynamic-Range Devices: A Learning Paradigm With Hybrid Attention Mechanisms
With the prevalence of high-dynamic-range (HDR) display devices, the demand to convert existing standard-dynamic-range television (SDRTV) video content to its corresponding HDR television (HDRTV) counterpart is growing exponentially. Herein, we propose a two-stage learning paradigm with hybrid attention mechanisms to fully exploit spatial, channelwise, and regional correlations for faithfully driving such conversion. Specifically, in the first domain-mapping stage, the depthwise self-attention and global calibration layer are proposed, which adaptively leverage feature intrarelationships to construct better scene representation and achieve engaging SDRTV-to-HDRTV transformation. In the second highlight-generation stage, considering that the overexposed regions potentially lead to detail loss, which brings enormous challenges to the conversion, we propose a regional self-attention module to specifically restore missing highlights. Extensive experimental results on public databases show that our method outperforms state-of-the-art approaches in terms of different quality evaluation measures.
期刊介绍:
The magazine contains technical information covering a broad range of issues in multimedia systems and applications. Articles discuss research as well as advanced practice in hardware/software and are expected to span the range from theory to working systems. Especially encouraged are papers discussing experiences with new or advanced systems and subsystems. To avoid unnecessary overlap with existing publications, acceptable papers must have a significant focus on aspects unique to multimedia systems and applications. These aspects are likely to be related to the special needs of multimedia information compared to other electronic data, for example, the size requirements of digital media and the importance of time in the representation of such media. The following list is not exhaustive, but is representative of the topics that are covered: Hardware and software for media compression, coding & processing; Media representations & standards for storage, editing, interchange, transmission & presentation; Hardware platforms supporting multimedia applications; Operating systems suitable for multimedia applications; Storage devices & technologies for multimedia information; Network technologies, protocols, architectures & delivery techniques intended for multimedia; Synchronization issues; Multimedia databases; Formalisms for multimedia information systems & applications; Programming paradigms & languages for multimedia; Multimedia user interfaces; Media creation integration editing & management; Creation & modification of multimedia applications.