Retinomorphic Channel Design and Considerations

Jonah P. Sengupta, A. Andreou
{"title":"Retinomorphic Channel Design and Considerations","authors":"Jonah P. Sengupta, A. Andreou","doi":"10.1109/CISS56502.2023.10089617","DOIUrl":null,"url":null,"abstract":"By extrapolating functionality from the retina, retinomorphic engineering has yielded devices that have shown promise to alleviate the challenges presented in modern computer vision tasks. An incredible amount of work has been devoted in recent years to the development and deployment of these event-based vision sensors in applications requiring low-latency, energy-efficient, high dynamic range sensing solutions. However, not much work has been devoted to the area, energy, and speed analysis of the various encoding and decoding mechanisms necessary for sensory pipelines. This paper outlines an empirical framework that presents a clear tradeoff between the various methodologies to transduce physical information in to spikes (encoding) and reconstruct said stimuli from the incident events (decoding). Software-based models of these methodologies were constructed to evaluate the accuracy of stimuli reconstruction for a variety of input profiles. As a result, it is shown that an optimized retinomorphic architecture for a specific set of system-driven cost metrics requires a heterogenous fabric of encoders with a composition of 95% temporal contrast pixels and 5% intensity encoder assuming a temporal jitter of 1μs. Much like the composition of ganglion cells in magno- and parvo-cellular pathway, this multi-modal solution provides the most time, area, and power efficient method to convey visual data.","PeriodicalId":243775,"journal":{"name":"2023 57th Annual Conference on Information Sciences and Systems (CISS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 57th Annual Conference on Information Sciences and Systems (CISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISS56502.2023.10089617","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

By extrapolating functionality from the retina, retinomorphic engineering has yielded devices that have shown promise to alleviate the challenges presented in modern computer vision tasks. An incredible amount of work has been devoted in recent years to the development and deployment of these event-based vision sensors in applications requiring low-latency, energy-efficient, high dynamic range sensing solutions. However, not much work has been devoted to the area, energy, and speed analysis of the various encoding and decoding mechanisms necessary for sensory pipelines. This paper outlines an empirical framework that presents a clear tradeoff between the various methodologies to transduce physical information in to spikes (encoding) and reconstruct said stimuli from the incident events (decoding). Software-based models of these methodologies were constructed to evaluate the accuracy of stimuli reconstruction for a variety of input profiles. As a result, it is shown that an optimized retinomorphic architecture for a specific set of system-driven cost metrics requires a heterogenous fabric of encoders with a composition of 95% temporal contrast pixels and 5% intensity encoder assuming a temporal jitter of 1μs. Much like the composition of ganglion cells in magno- and parvo-cellular pathway, this multi-modal solution provides the most time, area, and power efficient method to convey visual data.
视胚通道设计与考虑
通过推断视网膜的功能,视网膜形态工程已经产生了有望缓解现代计算机视觉任务中出现的挑战的设备。近年来,在需要低延迟、高能效、高动态范围传感解决方案的应用中,已经投入了大量的工作来开发和部署这些基于事件的视觉传感器。然而,对于感官管道所需的各种编码和解码机制的面积、能量和速度分析,并没有太多的研究。本文概述了一个经验框架,在各种方法之间进行了明确的权衡,将物理信息转换为峰值(编码),并从事件事件(解码)中重建所述刺激。这些方法的基于软件的模型被构建来评估各种输入配置文件的刺激重建的准确性。结果表明,针对特定的一组系统驱动的成本指标,优化的视胚结构需要一个由95%时间对比度像素和5%强度编码器组成的异构结构,假设时间抖动为1μs。就像神经节细胞在磁细胞和细小细胞通路中的组成一样,这种多模式解决方案提供了最节省时间、面积和功率的方法来传递视觉数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信