{"title":"cnn提取的特征生成对未见图像的合成fMRI响应","authors":"Parsa Delavari , Leonid Sigal , Ipek Oruc","doi":"10.1016/j.visres.2025.108641","DOIUrl":null,"url":null,"abstract":"<div><div>Inspired by biological vision, convolutional neural networks (CNNs) have tackled challenging image recognition problems once considered the sole purview of human expertise. In turn, CNNs are now widely used as a framework for studying human vision. The organizational similarity between the layers of CNNs and cortical regions along the visual pathway has been shown in studies using human fMRI data, such that early visual areas’ activities are better predicted by the first layers of CNNs while their last layers better predict the response of higher-level visual areas. However, there is a lack of agreement on how well CNN features can predict fMRI responses, particularly in the presence of fMRI noise, which can result in varying brain responses to the repetitions of the same image. Additionally, the utility of these predicted responses to previously unseen images as synthetic fMRI data has not yet been explored. Here we use the BOLD5000 dataset and the AlexNet architecture initialized with the model weights pre-trained on ImageNet to show that features extracted by CNNs can g enerate highly accurate synthetic fMRI responses to images. We demonstrate that synthetic fMRI responses show higher correlations with repetitions of real responses than the real responses themselves, surpassing the quality of real data in the presence of noise. Moreover, we train a decoder with synthetic fMRI data to classify real fMRI data for unseen images and even unseen object categories. Our decoding experiments revealed that the synthetic data outperformed real data, particularly due to the ability to generate larger synthetic datasets. Our findings showcase the high quality of generated synthetic fMRI responses to images based on CNN features, exhibiting both similarities to real data and practical utility in empirical applications.</div></div>","PeriodicalId":23670,"journal":{"name":"Vision Research","volume":"234 ","pages":"Article 108641"},"PeriodicalIF":1.4000,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CNN-extracted features generate synthetic fMRI responses to unseen images\",\"authors\":\"Parsa Delavari , Leonid Sigal , Ipek Oruc\",\"doi\":\"10.1016/j.visres.2025.108641\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Inspired by biological vision, convolutional neural networks (CNNs) have tackled challenging image recognition problems once considered the sole purview of human expertise. In turn, CNNs are now widely used as a framework for studying human vision. The organizational similarity between the layers of CNNs and cortical regions along the visual pathway has been shown in studies using human fMRI data, such that early visual areas’ activities are better predicted by the first layers of CNNs while their last layers better predict the response of higher-level visual areas. However, there is a lack of agreement on how well CNN features can predict fMRI responses, particularly in the presence of fMRI noise, which can result in varying brain responses to the repetitions of the same image. Additionally, the utility of these predicted responses to previously unseen images as synthetic fMRI data has not yet been explored. Here we use the BOLD5000 dataset and the AlexNet architecture initialized with the model weights pre-trained on ImageNet to show that features extracted by CNNs can g enerate highly accurate synthetic fMRI responses to images. We demonstrate that synthetic fMRI responses show higher correlations with repetitions of real responses than the real responses themselves, surpassing the quality of real data in the presence of noise. Moreover, we train a decoder with synthetic fMRI data to classify real fMRI data for unseen images and even unseen object categories. Our decoding experiments revealed that the synthetic data outperformed real data, particularly due to the ability to generate larger synthetic datasets. Our findings showcase the high quality of generated synthetic fMRI responses to images based on CNN features, exhibiting both similarities to real data and practical utility in empirical applications.</div></div>\",\"PeriodicalId\":23670,\"journal\":{\"name\":\"Vision Research\",\"volume\":\"234 \",\"pages\":\"Article 108641\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-06-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Vision Research\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0042698925001026\",\"RegionNum\":4,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vision Research","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0042698925001026","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
CNN-extracted features generate synthetic fMRI responses to unseen images
Inspired by biological vision, convolutional neural networks (CNNs) have tackled challenging image recognition problems once considered the sole purview of human expertise. In turn, CNNs are now widely used as a framework for studying human vision. The organizational similarity between the layers of CNNs and cortical regions along the visual pathway has been shown in studies using human fMRI data, such that early visual areas’ activities are better predicted by the first layers of CNNs while their last layers better predict the response of higher-level visual areas. However, there is a lack of agreement on how well CNN features can predict fMRI responses, particularly in the presence of fMRI noise, which can result in varying brain responses to the repetitions of the same image. Additionally, the utility of these predicted responses to previously unseen images as synthetic fMRI data has not yet been explored. Here we use the BOLD5000 dataset and the AlexNet architecture initialized with the model weights pre-trained on ImageNet to show that features extracted by CNNs can g enerate highly accurate synthetic fMRI responses to images. We demonstrate that synthetic fMRI responses show higher correlations with repetitions of real responses than the real responses themselves, surpassing the quality of real data in the presence of noise. Moreover, we train a decoder with synthetic fMRI data to classify real fMRI data for unseen images and even unseen object categories. Our decoding experiments revealed that the synthetic data outperformed real data, particularly due to the ability to generate larger synthetic datasets. Our findings showcase the high quality of generated synthetic fMRI responses to images based on CNN features, exhibiting both similarities to real data and practical utility in empirical applications.
期刊介绍:
Vision Research is a journal devoted to the functional aspects of human, vertebrate and invertebrate vision and publishes experimental and observational studies, reviews, and theoretical and computational analyses. Vision Research also publishes clinical studies relevant to normal visual function and basic research relevant to visual dysfunction or its clinical investigation. Functional aspects of vision is interpreted broadly, ranging from molecular and cellular function to perception and behavior. Detailed descriptions are encouraged but enough introductory background should be included for non-specialists. Theoretical and computational papers should give a sense of order to the facts or point to new verifiable observations. Papers dealing with questions in the history of vision science should stress the development of ideas in the field.