{"title":"基于知识选择的不确定性感知弱监督时间动作定位","authors":"Huanqing Yan , Bo Sun , Jun He","doi":"10.1016/j.displa.2025.103215","DOIUrl":null,"url":null,"abstract":"<div><div>Weakly-Supervised Temporal Action Localization (WS-TAL) aims to localize actions in untrimmed videos using only video-level labels. The core challenge is the lack of fine-grained annotations, which leads to high prediction uncertainty and confusion between actions and background. To address this, we propose an <em>Uncertainty-Aware and Knowledge-Selection</em> (UAKS) approach. Specifically, we integrate two uncertainty estimation strategies to cooperatively optimize the model and leverage uncertainty to guide external knowledge selection. First, evidential learning estimates model uncertainty, generating more confident predictions via regularization. Second, probabilistic distribution learning captures data uncertainty. Both uncertainties jointly guide model optimization. Additionally, uncertainty-driven knowledge selection enables the efficient utilization of external knowledge under weak supervision. Experiments show that our method improves accuracy and robustness, with 12.9% and 2% accuracy improvements on THUMOS and ActivityNet v1.3 datasets respectively, demonstrating the potential of uncertainty modeling in WS-TAL.</div></div>","PeriodicalId":50570,"journal":{"name":"Displays","volume":"91 ","pages":"Article 103215"},"PeriodicalIF":3.4000,"publicationDate":"2025-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Uncertainty-aware weakly supervised temporal action localization with knowledge selection\",\"authors\":\"Huanqing Yan , Bo Sun , Jun He\",\"doi\":\"10.1016/j.displa.2025.103215\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Weakly-Supervised Temporal Action Localization (WS-TAL) aims to localize actions in untrimmed videos using only video-level labels. The core challenge is the lack of fine-grained annotations, which leads to high prediction uncertainty and confusion between actions and background. To address this, we propose an <em>Uncertainty-Aware and Knowledge-Selection</em> (UAKS) approach. Specifically, we integrate two uncertainty estimation strategies to cooperatively optimize the model and leverage uncertainty to guide external knowledge selection. First, evidential learning estimates model uncertainty, generating more confident predictions via regularization. Second, probabilistic distribution learning captures data uncertainty. Both uncertainties jointly guide model optimization. Additionally, uncertainty-driven knowledge selection enables the efficient utilization of external knowledge under weak supervision. Experiments show that our method improves accuracy and robustness, with 12.9% and 2% accuracy improvements on THUMOS and ActivityNet v1.3 datasets respectively, demonstrating the potential of uncertainty modeling in WS-TAL.</div></div>\",\"PeriodicalId\":50570,\"journal\":{\"name\":\"Displays\",\"volume\":\"91 \",\"pages\":\"Article 103215\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Displays\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0141938225002525\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Displays","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141938225002525","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
Uncertainty-aware weakly supervised temporal action localization with knowledge selection
Weakly-Supervised Temporal Action Localization (WS-TAL) aims to localize actions in untrimmed videos using only video-level labels. The core challenge is the lack of fine-grained annotations, which leads to high prediction uncertainty and confusion between actions and background. To address this, we propose an Uncertainty-Aware and Knowledge-Selection (UAKS) approach. Specifically, we integrate two uncertainty estimation strategies to cooperatively optimize the model and leverage uncertainty to guide external knowledge selection. First, evidential learning estimates model uncertainty, generating more confident predictions via regularization. Second, probabilistic distribution learning captures data uncertainty. Both uncertainties jointly guide model optimization. Additionally, uncertainty-driven knowledge selection enables the efficient utilization of external knowledge under weak supervision. Experiments show that our method improves accuracy and robustness, with 12.9% and 2% accuracy improvements on THUMOS and ActivityNet v1.3 datasets respectively, demonstrating the potential of uncertainty modeling in WS-TAL.
期刊介绍:
Displays is the international journal covering the research and development of display technology, its effective presentation and perception of information, and applications and systems including display-human interface.
Technical papers on practical developments in Displays technology provide an effective channel to promote greater understanding and cross-fertilization across the diverse disciplines of the Displays community. Original research papers solving ergonomics issues at the display-human interface advance effective presentation of information. Tutorial papers covering fundamentals intended for display technologies and human factor engineers new to the field will also occasionally featured.