{"title":"Pulse discrete cosine transform for saliency-based visual attention","authors":"Ying-jia Yu, Bin Wang, Liming Zhang","doi":"10.1109/DEVLRN.2009.5175512","DOIUrl":null,"url":null,"abstract":"This paper proposes a saliency-based attention model based on pulsed cosine transform that simulates the lateral surround inhibition of neurons with similar visual features. The model can be extended to Hebbian-based neural networks. The visual saliency can be represented in binary codes, which agrees with the firing pulse of neurons in human brain. In addition, motion saliency can be directly generated by these pulse codes. Due to its good performance in eye fixation prediction and low computational complexity, our model can be used in real-time system such as robot navigation, virtual human system, and intelligent auto-focus system embedded in digital camera.","PeriodicalId":192225,"journal":{"name":"2009 IEEE 8th International Conference on Development and Learning","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"37","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE 8th International Conference on Development and Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEVLRN.2009.5175512","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 37
Abstract
This paper proposes a saliency-based attention model based on pulsed cosine transform that simulates the lateral surround inhibition of neurons with similar visual features. The model can be extended to Hebbian-based neural networks. The visual saliency can be represented in binary codes, which agrees with the firing pulse of neurons in human brain. In addition, motion saliency can be directly generated by these pulse codes. Due to its good performance in eye fixation prediction and low computational complexity, our model can be used in real-time system such as robot navigation, virtual human system, and intelligent auto-focus system embedded in digital camera.