{"title":"VQALS: A Video Question Answering Method in Low-Light Scenes Based on Illumination Correction and Feature Enhancement","authors":"Jie Yang;Miao Ma;Yutong Li;Zhao Pei","doi":"10.23919/cje.2023.00.403","DOIUrl":null,"url":null,"abstract":"In low-light scenes, videos often exhibit low brightness, leading to less evident details in regional features. The current video question answering models have made significant progress in the fusion and reasoning of cross-modal information. However, they perform poorly in effectively extracting useful information and salient features in low-light scenes. To tackle this challenge, we propose a video question answering method in low-light scenes, in which two modules are developed: illumination correction module and feature enhancement module. The illumination correction module enhances visual quality by applying adaptive enhancement to the video with a variational threshold, thereby extracting more feature information. The feature enhancement module further enriches and strengthens important information in the features by introducing a dynamic learning strategy to enhance spatial features by two branches, providing reasonable evidence for inferring the correct answer. Finally, the enhanced visual features are fused with question features to infer and generate proper answers. We perform extensive experiments on public datasets. The experimental results manifest the advantages and effectiveness compared with state-of-the-art methods in terms of accuracy in video question answering task.","PeriodicalId":50701,"journal":{"name":"Chinese Journal of Electronics","volume":"34 4","pages":"1300-1308"},"PeriodicalIF":3.0000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11151179","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chinese Journal of Electronics","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11151179/","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
In low-light scenes, videos often exhibit low brightness, leading to less evident details in regional features. The current video question answering models have made significant progress in the fusion and reasoning of cross-modal information. However, they perform poorly in effectively extracting useful information and salient features in low-light scenes. To tackle this challenge, we propose a video question answering method in low-light scenes, in which two modules are developed: illumination correction module and feature enhancement module. The illumination correction module enhances visual quality by applying adaptive enhancement to the video with a variational threshold, thereby extracting more feature information. The feature enhancement module further enriches and strengthens important information in the features by introducing a dynamic learning strategy to enhance spatial features by two branches, providing reasonable evidence for inferring the correct answer. Finally, the enhanced visual features are fused with question features to infer and generate proper answers. We perform extensive experiments on public datasets. The experimental results manifest the advantages and effectiveness compared with state-of-the-art methods in terms of accuracy in video question answering task.
期刊介绍:
CJE focuses on the emerging fields of electronics, publishing innovative and transformative research papers. Most of the papers published in CJE are from universities and research institutes, presenting their innovative research results. Both theoretical and practical contributions are encouraged, and original research papers reporting novel solutions to the hot topics in electronics are strongly recommended.