利用机场闭路电视录像，通过视频理解技术进行能见度预测

IF 1.1 4区综合性期刊 Q3 MULTIDISCIPLINARY SCIENCES

Kuwait Journal of Science Pub Date : 2025-07-18 DOI:10.1016/j.kjs.2025.100470

Zeonlung Pun , Xinyu Tian , Shan Gao

{"title":"利用机场闭路电视录像，通过视频理解技术进行能见度预测","authors":"Zeonlung Pun , Xinyu Tian , Shan Gao","doi":"10.1016/j.kjs.2025.100470","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate atmospheric visibility prediction is critical for enhancing airport safety, especially under adverse weather conditions. However, existing visibility prediction methods predominantly rely on single-image analysis, using either traditional image processing techniques or deep learning models, which often fail to fully capture the dynamic and temporal characteristics inherent in video data. In this study, we explore various video understanding models for visibility prediction, achieving promising results and pioneering the use of video understanding techniques in this domain. Unlike traditional static image-based methods, our proposed three-stream network model integrates spatial information from individual frames, motion dynamics through optical flow, and key reference points extracted via SIFT (scale-invariant feature transform) descriptors. This enables the model to capture both short-term and long-term environmental changes. Experimental results show that our three-stream network significantly outperforms single-frame and image-based models in predicting <span><math><mrow><mi>R</mi><mi>V</mi><mi>R</mi><mtext>_</mtext><mn>1</mn><mi>A</mi></mrow></math></span> (average runway visibility range of one minute). Our model achieves an <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> mean of 0.896 and an accuracy mean of 0.860 on the test set, substantially outperforming traditional methods. These results not only demonstrate the superior performance of our approach in real-world scenarios but also highlight the potential of video understanding techniques for airport safety monitoring applications.</div></div>","PeriodicalId":17848,"journal":{"name":"Kuwait Journal of Science","volume":"53 1","pages":"Article 100470"},"PeriodicalIF":1.1000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging airport CCTV footage through video understanding techniques for visibility prediction\",\"authors\":\"Zeonlung Pun , Xinyu Tian , Shan Gao\",\"doi\":\"10.1016/j.kjs.2025.100470\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate atmospheric visibility prediction is critical for enhancing airport safety, especially under adverse weather conditions. However, existing visibility prediction methods predominantly rely on single-image analysis, using either traditional image processing techniques or deep learning models, which often fail to fully capture the dynamic and temporal characteristics inherent in video data. In this study, we explore various video understanding models for visibility prediction, achieving promising results and pioneering the use of video understanding techniques in this domain. Unlike traditional static image-based methods, our proposed three-stream network model integrates spatial information from individual frames, motion dynamics through optical flow, and key reference points extracted via SIFT (scale-invariant feature transform) descriptors. This enables the model to capture both short-term and long-term environmental changes. Experimental results show that our three-stream network significantly outperforms single-frame and image-based models in predicting <span><math><mrow><mi>R</mi><mi>V</mi><mi>R</mi><mtext>_</mtext><mn>1</mn><mi>A</mi></mrow></math></span> (average runway visibility range of one minute). Our model achieves an <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> mean of 0.896 and an accuracy mean of 0.860 on the test set, substantially outperforming traditional methods. These results not only demonstrate the superior performance of our approach in real-world scenarios but also highlight the potential of video understanding techniques for airport safety monitoring applications.</div></div>\",\"PeriodicalId\":17848,\"journal\":{\"name\":\"Kuwait Journal of Science\",\"volume\":\"53 1\",\"pages\":\"Article 100470\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2025-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Kuwait Journal of Science\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2307410825001142\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kuwait Journal of Science","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2307410825001142","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

准确的大气能见度预测对加强机场安全至关重要，特别是在恶劣天气条件下。然而，现有的可见性预测方法主要依赖于单图像分析，使用传统的图像处理技术或深度学习模型，往往无法充分捕捉视频数据固有的动态和时间特征。在本研究中，我们探索了用于可见性预测的各种视频理解模型，取得了令人鼓舞的结果，并开创了视频理解技术在该领域的应用。与传统的基于静态图像的方法不同，我们提出的三流网络模型集成了来自单个帧的空间信息，通过光流的运动动态以及通过SIFT（尺度不变特征变换）描述符提取的关键参考点。这使模型能够捕捉短期和长期的环境变化。实验结果表明，我们的三流网络在预测RVR_1A（一分钟跑道平均能见度范围）方面明显优于单帧模型和基于图像的模型。我们的模型在测试集上的R2均值为0.896，准确率均值为0.860，大大优于传统方法。这些结果不仅证明了我们的方法在现实场景中的优越性能，而且突出了视频理解技术在机场安全监控应用中的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Leveraging airport CCTV footage through video understanding techniques for visibility prediction

Accurate atmospheric visibility prediction is critical for enhancing airport safety, especially under adverse weather conditions. However, existing visibility prediction methods predominantly rely on single-image analysis, using either traditional image processing techniques or deep learning models, which often fail to fully capture the dynamic and temporal characteristics inherent in video data. In this study, we explore various video understanding models for visibility prediction, achieving promising results and pioneering the use of video understanding techniques in this domain. Unlike traditional static image-based methods, our proposed three-stream network model integrates spatial information from individual frames, motion dynamics through optical flow, and key reference points extracted via SIFT (scale-invariant feature transform) descriptors. This enables the model to capture both short-term and long-term environmental changes. Experimental results show that our three-stream network significantly outperforms single-frame and image-based models in predicting

R V R_1 A

(average runway visibility range of one minute). Our model achieves an

R^{2}

mean of 0.896 and an accuracy mean of 0.860 on the test set, substantially outperforming traditional methods. These results not only demonstrate the superior performance of our approach in real-world scenarios but also highlight the potential of video understanding techniques for airport safety monitoring applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Kuwait Journal of Science MULTIDISCIPLINARY SCIENCES-

CiteScore

1.60

自引率

28.60%

发文量

132

期刊介绍： Kuwait Journal of Science (KJS) is indexed and abstracted by major publishing houses such as Chemical Abstract, Science Citation Index, Current contents, Mathematics Abstract, Micribiological Abstracts etc. KJS publishes peer-review articles in various fields of Science including Mathematics, Computer Science, Physics, Statistics, Biology, Chemistry and Earth & Environmental Sciences. In addition, it also aims to bring the results of scientific research carried out under a variety of intellectual traditions and organizations to the attention of specialized scholarly readership. As such, the publisher expects the submission of original manuscripts which contain analysis and solutions about important theoretical, empirical and normative issues.