Emily J A-Izzeddin, Thomas S A Wallis, Jason B Mattingley, William J Harrison
{"title":"Low-level features predict perceived similarity for naturalistic images.","authors":"Emily J A-Izzeddin, Thomas S A Wallis, Jason B Mattingley, William J Harrison","doi":"10.1167/jov.25.12.11","DOIUrl":null,"url":null,"abstract":"<p><p>The mechanisms by which humans perceptually organize individual regions of a visual scene to generate a coherent scene representation remain largely unknown. Our perception of statistical regularities has been relatively well-studied in simple stimuli, and explicit computational mechanisms that use low-level image features (e.g., luminance, contrast energy) to explain these perceptions have been described. Here, we investigate to what extent observers can effectively use such low-level information present in isolated naturalistic scene regions to facilitate associations between said regions. Across two experiments, participants were shown an isolated reference patch, then required to select which of two subsequently presented patches came from the same scene as the reference (two-alternative forced choice method). In Experiment 1, participants made their judgments based on unaltered image patches, and were consistently above chance when performing such association judgments. Additionally, participants' responses were well-predicted by a generalized linear multilevel model using predictors based on low-level feature similarity metrics (specifically, pixel-wise luminance and phase-invariant structure correlations). In Experiment 2, participants were presented with unaltered image regions, thresholded image regions, or regions reduced to only their edge content. Performance for thresholded and edge regions was significantly poorer than for unaltered image regions. Nonetheless, the model still correlated well with participants' judgments. Our findings suggest that image region associations can be accounted for using low-level feature correlations, suggesting such basic features are strongly associated with those underlying judgments made for complex visual stimuli.</p>","PeriodicalId":49955,"journal":{"name":"Journal of Vision","volume":"25 12","pages":"11"},"PeriodicalIF":2.3000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12514980/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Vision","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1167/jov.25.12.11","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The mechanisms by which humans perceptually organize individual regions of a visual scene to generate a coherent scene representation remain largely unknown. Our perception of statistical regularities has been relatively well-studied in simple stimuli, and explicit computational mechanisms that use low-level image features (e.g., luminance, contrast energy) to explain these perceptions have been described. Here, we investigate to what extent observers can effectively use such low-level information present in isolated naturalistic scene regions to facilitate associations between said regions. Across two experiments, participants were shown an isolated reference patch, then required to select which of two subsequently presented patches came from the same scene as the reference (two-alternative forced choice method). In Experiment 1, participants made their judgments based on unaltered image patches, and were consistently above chance when performing such association judgments. Additionally, participants' responses were well-predicted by a generalized linear multilevel model using predictors based on low-level feature similarity metrics (specifically, pixel-wise luminance and phase-invariant structure correlations). In Experiment 2, participants were presented with unaltered image regions, thresholded image regions, or regions reduced to only their edge content. Performance for thresholded and edge regions was significantly poorer than for unaltered image regions. Nonetheless, the model still correlated well with participants' judgments. Our findings suggest that image region associations can be accounted for using low-level feature correlations, suggesting such basic features are strongly associated with those underlying judgments made for complex visual stimuli.
期刊介绍:
Exploring all aspects of biological visual function, including spatial vision, perception,
low vision, color vision and more, spanning the fields of neuroscience, psychology and psychophysics.