{"title":"利用机场闭路电视录像,通过视频理解技术进行能见度预测","authors":"Zeonlung Pun , Xinyu Tian , Shan Gao","doi":"10.1016/j.kjs.2025.100470","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate atmospheric visibility prediction is critical for enhancing airport safety, especially under adverse weather conditions. However, existing visibility prediction methods predominantly rely on single-image analysis, using either traditional image processing techniques or deep learning models, which often fail to fully capture the dynamic and temporal characteristics inherent in video data. In this study, we explore various video understanding models for visibility prediction, achieving promising results and pioneering the use of video understanding techniques in this domain. Unlike traditional static image-based methods, our proposed three-stream network model integrates spatial information from individual frames, motion dynamics through optical flow, and key reference points extracted via SIFT (scale-invariant feature transform) descriptors. This enables the model to capture both short-term and long-term environmental changes. Experimental results show that our three-stream network significantly outperforms single-frame and image-based models in predicting <span><math><mrow><mi>R</mi><mi>V</mi><mi>R</mi><mtext>_</mtext><mn>1</mn><mi>A</mi></mrow></math></span> (average runway visibility range of one minute). Our model achieves an <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> mean of 0.896 and an accuracy mean of 0.860 on the test set, substantially outperforming traditional methods. These results not only demonstrate the superior performance of our approach in real-world scenarios but also highlight the potential of video understanding techniques for airport safety monitoring applications.</div></div>","PeriodicalId":17848,"journal":{"name":"Kuwait Journal of Science","volume":"53 1","pages":"Article 100470"},"PeriodicalIF":1.1000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging airport CCTV footage through video understanding techniques for visibility prediction\",\"authors\":\"Zeonlung Pun , Xinyu Tian , Shan Gao\",\"doi\":\"10.1016/j.kjs.2025.100470\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate atmospheric visibility prediction is critical for enhancing airport safety, especially under adverse weather conditions. However, existing visibility prediction methods predominantly rely on single-image analysis, using either traditional image processing techniques or deep learning models, which often fail to fully capture the dynamic and temporal characteristics inherent in video data. In this study, we explore various video understanding models for visibility prediction, achieving promising results and pioneering the use of video understanding techniques in this domain. Unlike traditional static image-based methods, our proposed three-stream network model integrates spatial information from individual frames, motion dynamics through optical flow, and key reference points extracted via SIFT (scale-invariant feature transform) descriptors. This enables the model to capture both short-term and long-term environmental changes. Experimental results show that our three-stream network significantly outperforms single-frame and image-based models in predicting <span><math><mrow><mi>R</mi><mi>V</mi><mi>R</mi><mtext>_</mtext><mn>1</mn><mi>A</mi></mrow></math></span> (average runway visibility range of one minute). Our model achieves an <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span> mean of 0.896 and an accuracy mean of 0.860 on the test set, substantially outperforming traditional methods. These results not only demonstrate the superior performance of our approach in real-world scenarios but also highlight the potential of video understanding techniques for airport safety monitoring applications.</div></div>\",\"PeriodicalId\":17848,\"journal\":{\"name\":\"Kuwait Journal of Science\",\"volume\":\"53 1\",\"pages\":\"Article 100470\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2025-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Kuwait Journal of Science\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2307410825001142\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kuwait Journal of Science","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2307410825001142","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Leveraging airport CCTV footage through video understanding techniques for visibility prediction
Accurate atmospheric visibility prediction is critical for enhancing airport safety, especially under adverse weather conditions. However, existing visibility prediction methods predominantly rely on single-image analysis, using either traditional image processing techniques or deep learning models, which often fail to fully capture the dynamic and temporal characteristics inherent in video data. In this study, we explore various video understanding models for visibility prediction, achieving promising results and pioneering the use of video understanding techniques in this domain. Unlike traditional static image-based methods, our proposed three-stream network model integrates spatial information from individual frames, motion dynamics through optical flow, and key reference points extracted via SIFT (scale-invariant feature transform) descriptors. This enables the model to capture both short-term and long-term environmental changes. Experimental results show that our three-stream network significantly outperforms single-frame and image-based models in predicting (average runway visibility range of one minute). Our model achieves an mean of 0.896 and an accuracy mean of 0.860 on the test set, substantially outperforming traditional methods. These results not only demonstrate the superior performance of our approach in real-world scenarios but also highlight the potential of video understanding techniques for airport safety monitoring applications.
期刊介绍:
Kuwait Journal of Science (KJS) is indexed and abstracted by major publishing houses such as Chemical Abstract, Science Citation Index, Current contents, Mathematics Abstract, Micribiological Abstracts etc. KJS publishes peer-review articles in various fields of Science including Mathematics, Computer Science, Physics, Statistics, Biology, Chemistry and Earth & Environmental Sciences. In addition, it also aims to bring the results of scientific research carried out under a variety of intellectual traditions and organizations to the attention of specialized scholarly readership. As such, the publisher expects the submission of original manuscripts which contain analysis and solutions about important theoretical, empirical and normative issues.