用于自然刺激合成的双向生成对抗表征学习。

IF 2.1 3区医学 Q3 NEUROSCIENCES

Journal of neurophysiology Pub Date : 2024-10-01 Epub Date: 2024-08-28 DOI:10.1152/jn.00421.2023

Johnny Reilly, John D Goodwin, Sihao Lu, Andriy S Kozlov

{"title":"用于自然刺激合成的双向生成对抗表征学习。","authors":"Johnny Reilly, John D Goodwin, Sihao Lu, Andriy S Kozlov","doi":"10.1152/jn.00421.2023","DOIUrl":null,"url":null,"abstract":"Thousands of species use vocal signals to communicate with one another. Vocalizations carry rich information, yet characterizing and analyzing these complex, high-dimensional signals is difficult and prone to human bias. Moreover, animal vocalizations are ethologically relevant stimuli whose representation by auditory neurons is an important subject of research in sensory neuroscience. A method that can efficiently generate naturalistic vocalization waveforms would offer an unlimited supply of stimuli with which to probe neuronal computations. Although unsupervised learning methods allow for the projection of vocalizations into low-dimensional latent spaces learned from the waveforms themselves, and generative modeling allows for the synthesis of novel vocalizations for use in downstream tasks, we are not aware of any model that combines these tasks to synthesize naturalistic vocalizations in the waveform domain for stimulus playback. In this paper, we demonstrate BiWaveGAN: a bidirectional generative adversarial network (GAN) capable of learning a latent representation of ultrasonic vocalizations (USVs) from mice. We show that BiWaveGAN can be used to generate, and interpolate between, realistic vocalization waveforms. We then use these synthesized stimuli along with natural USVs to probe the sensory input space of mouse auditory cortical neurons. We show that stimuli generated from our method evoke neuronal responses as effectively as real vocalizations, and produce receptive fields with the same predictive power. BiWaveGAN is not restricted to mouse USVs but can be used to synthesize naturalistic vocalizations of any animal species and interpolate between vocalizations of the same or different species, which could be useful for probing categorical boundaries in representations of ethologically relevant auditory signals.NEW & NOTEWORTHY A new type of artificial neural network is presented that can be used to generate animal vocalization waveforms and interpolate between them to create new vocalizations. We find that our synthetic naturalistic stimuli drive auditory cortical neurons in the mouse equally well and produce receptive field features with the same predictive power as those obtained with natural mouse vocalizations, confirming the quality of the stimuli produced by the neural network.","PeriodicalId":16563,"journal":{"name":"Journal of neurophysiology","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Bidirectional generative adversarial representation learning for natural stimulus synthesis.\",\"authors\":\"Johnny Reilly, John D Goodwin, Sihao Lu, Andriy S Kozlov\",\"doi\":\"10.1152/jn.00421.2023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Thousands of species use vocal signals to communicate with one another. Vocalizations carry rich information, yet characterizing and analyzing these complex, high-dimensional signals is difficult and prone to human bias. Moreover, animal vocalizations are ethologically relevant stimuli whose representation by auditory neurons is an important subject of research in sensory neuroscience. A method that can efficiently generate naturalistic vocalization waveforms would offer an unlimited supply of stimuli with which to probe neuronal computations. Although unsupervised learning methods allow for the projection of vocalizations into low-dimensional latent spaces learned from the waveforms themselves, and generative modeling allows for the synthesis of novel vocalizations for use in downstream tasks, we are not aware of any model that combines these tasks to synthesize naturalistic vocalizations in the waveform domain for stimulus playback. In this paper, we demonstrate BiWaveGAN: a bidirectional generative adversarial network (GAN) capable of learning a latent representation of ultrasonic vocalizations (USVs) from mice. We show that BiWaveGAN can be used to generate, and interpolate between, realistic vocalization waveforms. We then use these synthesized stimuli along with natural USVs to probe the sensory input space of mouse auditory cortical neurons. We show that stimuli generated from our method evoke neuronal responses as effectively as real vocalizations, and produce receptive fields with the same predictive power. BiWaveGAN is not restricted to mouse USVs but can be used to synthesize naturalistic vocalizations of any animal species and interpolate between vocalizations of the same or different species, which could be useful for probing categorical boundaries in representations of ethologically relevant auditory signals.NEW & NOTEWORTHY A new type of artificial neural network is presented that can be used to generate animal vocalization waveforms and interpolate between them to create new vocalizations. We find that our synthetic naturalistic stimuli drive auditory cortical neurons in the mouse equally well and produce receptive field features with the same predictive power as those obtained with natural mouse vocalizations, confirming the quality of the stimuli produced by the neural network.\",\"PeriodicalId\":16563,\"journal\":{\"name\":\"Journal of neurophysiology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of neurophysiology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1152/jn.00421.2023\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/8/28 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of neurophysiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1152/jn.00421.2023","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/28 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

发声蕴含着丰富的信息，但描述和分析这些高维信号十分困难，而且容易受到人类偏见的影响。此外，动物发声是与人种学相关的刺激，其听觉神经元的表征是感觉神经科学研究的重要课题。一种能有效生成自然发声波形的方法将为探测神经元计算提供无限的刺激。虽然无监督学习方法可以将发声投射到从波形本身学习到的低维潜在空间，生成模型可以合成新颖的发声用于下游任务，但目前还没有一种方法可以将这些任务结合起来，生成自然的发声波形用于刺激回放。在这里，我们展示了 BiWaveGAN：一种双向生成对抗网络（GAN），能够学习小鼠超声发声（USV）的潜在表示。我们展示了 BiWaveGAN 可用于生成逼真的发声波形，并在这些波形之间进行插值。然后，我们使用这些合成刺激和自然 USV 来探测小鼠听觉皮层神经元的感觉输入空间。我们的研究表明，用我们的方法生成的刺激能像真实发声一样有效地唤起神经元的反应，并产生具有相同预测能力的感受野。BiWaveGAN 并不局限于小鼠的 USV，它可以用来合成任何动物物种的自然发声，并在同一物种或不同物种的发声之间进行插值，这对于探测与人种学相关的听觉信号表征中的分类界限非常有用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Bidirectional generative adversarial representation learning for natural stimulus synthesis.

Thousands of species use vocal signals to communicate with one another. Vocalizations carry rich information, yet characterizing and analyzing these complex, high-dimensional signals is difficult and prone to human bias. Moreover, animal vocalizations are ethologically relevant stimuli whose representation by auditory neurons is an important subject of research in sensory neuroscience. A method that can efficiently generate naturalistic vocalization waveforms would offer an unlimited supply of stimuli with which to probe neuronal computations. Although unsupervised learning methods allow for the projection of vocalizations into low-dimensional latent spaces learned from the waveforms themselves, and generative modeling allows for the synthesis of novel vocalizations for use in downstream tasks, we are not aware of any model that combines these tasks to synthesize naturalistic vocalizations in the waveform domain for stimulus playback. In this paper, we demonstrate BiWaveGAN: a bidirectional generative adversarial network (GAN) capable of learning a latent representation of ultrasonic vocalizations (USVs) from mice. We show that BiWaveGAN can be used to generate, and interpolate between, realistic vocalization waveforms. We then use these synthesized stimuli along with natural USVs to probe the sensory input space of mouse auditory cortical neurons. We show that stimuli generated from our method evoke neuronal responses as effectively as real vocalizations, and produce receptive fields with the same predictive power. BiWaveGAN is not restricted to mouse USVs but can be used to synthesize naturalistic vocalizations of any animal species and interpolate between vocalizations of the same or different species, which could be useful for probing categorical boundaries in representations of ethologically relevant auditory signals.NEW & NOTEWORTHY A new type of artificial neural network is presented that can be used to generate animal vocalization waveforms and interpolate between them to create new vocalizations. We find that our synthetic naturalistic stimuli drive auditory cortical neurons in the mouse equally well and produce receptive field features with the same predictive power as those obtained with natural mouse vocalizations, confirming the quality of the stimuli produced by the neural network.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of neurophysiology 医学-神经科学

CiteScore

4.80

自引率

8.00%

发文量

255

审稿时长

2-3 weeks

期刊介绍： The Journal of Neurophysiology publishes original articles on the function of the nervous system. All levels of function are included, from the membrane and cell to systems and behavior. Experimental approaches include molecular neurobiology, cell culture and slice preparations, membrane physiology, developmental neurobiology, functional neuroanatomy, neurochemistry, neuropharmacology, systems electrophysiology, imaging and mapping techniques, and behavioral analysis. Experimental preparations may be invertebrate or vertebrate species, including humans. Theoretical studies are acceptable if they are tied closely to the interpretation of experimental data and elucidate principles of broad interest.