{"title":"Compact signed-digit adder using multiple-valued logic","authors":"Alejandro F. González, P. Mazumder","doi":"10.1109/ARVLSI.1997.634849","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634849","url":null,"abstract":"As minimum feature sizes shrink and the number of transistors integrated in a single chip grow, interconnect complexity is one of the most important issues to be solved in future VLSI chips. The use of multivalued logic is one way to effectively solve this problem because multiple-valued signals convey more information than binary signals and thus, require a lower number of interconnecting wires to achieve similar bandwidths. This paper describes a new signed-digit adder design, which uses multiple-valued logic. The circuit is composed of resonant-tunneling diodes (RTDs) and MOS transistors. The negative differential-resistance (NDR) characteristics of RTDs help to achieve very compact circuits for implementing the multiple-valued functions found in signed-digit adders, MOS transistors are useful for implementing current-mode logic, in which addition of two or more signals is performed by simple wire interconnection. Since a redundant arithmetic is being used the selected transfer functions allow the proposed circuit to perform addition where no ripple-carry effect is present. The design was verified using circuit simulation. To demonstrate the validity of the principles being used, a modified prototype of the circuit was built. In the prototype, a standard 2-micron CMOS process was used to fabricate the MOS-based circuitry while RTDs were connected externally. Even though no fabrication processes which integrate RTDs and MOS devices are currently available, there are efforts on the development of such technologies so that the advantages of these devices can be combined.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116789625","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Image edge enhancement, dynamic compression and noise suppression using analog circuit processing","authors":"T. Hinck, A. Hubbard","doi":"10.1109/ARVLSI.1997.634850","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634850","url":null,"abstract":"We designed circuits that have potential use as preprocessors of noisy image data, which vary in intensity over as much as four orders of magnitude. The circuits are based on equations representing a stage of the Boundary Contour System/Feature Contour System model. We compared the model performance (equations) using Gaussian and exponential filters with the simulated circuit performance. Our circuits achieved compression of up to four orders of input range, while maintaining contrast ratios in local regions. Edge enhancement is maintained down to a SNR of -20 dB, and at that noise level the circuit's RMS error performance is comparable to that of the mathematical formulation. For quantitative display purposes, we present responses to test stimulation a linear array of circuits. A 15/spl times/15 array design is also presented.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133193239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Signal delay in coupled, distributed RC lines in the presence of temporal proximity","authors":"V. Chandramouli, K. Sakallah, A. Kayssi","doi":"10.1109/ARVLSI.1997.634844","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634844","url":null,"abstract":"With improvements in technology, accurate delay modeling of interconnects is becoming increasingly important. Due to decreasing feature sizes, the spacing between the signal lines is also decreasing. Consequently, the switching activities on the neighboring lines can have a significant impact on the delay of the line of interest, and can no longer be ignored. Accurate modeling of this phenomenon, which we call the proximity effect, is the subject of this paper. This is similar to the state-dependency of logic gate delays, where signal delay can be affected by the switching activities on the side inputs of a gate. We describe an efficient and accurate delay computation method using precomputed interconnect moments that treats the coupled lines as uniform, distributed RC lines and does not make any lumped approximations. This allows the proposed delay model to be used in a timing analysis tool operating over both gate and interconnect domains while accounting for state-dependency.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116296900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Next-generation RF circuits and systems","authors":"B. Razavi","doi":"10.1109/ARVLSI.1997.634859","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634859","url":null,"abstract":"This paper describes developments foreseen to occur in the near future in the RF industry. Applications such as wireless local loops, wireless local area networks, RF identification devices, multi-standard transceivers, and cable modems are considered. The choice of IC technologies and CAD tools is also discussed.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127064083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Interfacing synchronous and asynchronous modules within a high-speed pipeline","authors":"Allen E. Sjogren, C. Myers","doi":"10.1109/ARVLSI.1997.634845","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634845","url":null,"abstract":"This paper describes a new technique for integrating asynchronous modules within a high-speed synchronous pipeline. Our design eliminates potential metastability problems by using a clock generated by a stoppable ring oscillator, which is capable of driving the large clock load found in present day microprocessors. Using the ATACS design tool, we designed highly optimized transistor-level circuits to control the ring oscillator and generate the clock and handshake signals with minimal overhead. Our interface architecture requires no redesign of the synchronous circuitry. Incorporating asynchronous modules in a high-speed pipeline improves performance by exploiting data-dependent delay variations. Since the speed of the synchronous circuitry tracks the speed of the ring oscillator under different processes, temperatures, and voltages, the entire chip operates at the speed dictated by the current operating conditions, rather than being governed by the worst-case conditions. These two factors together can lead to a significant improvement in average-case performance. The interface design is tested using the 0.6 /spl mu/m HP CMOS14B process in HSPICE.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127911862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Design implementation of intrinsic area array ICs","authors":"C. Tan, D. Bouldin, P. Dehkordi","doi":"10.1109/ARVLSI.1997.634848","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634848","url":null,"abstract":"Arranging I/O in a matrix array over the core circuitry of an IC generally provides 5-10 times more I/O than the traditional method of restricting pads to the periphery. This approach also minimizes overall die size. This method was pioneered by IBM over thirty years ago and has recently become attractive for new designs requiring several hundred I/O. In this paper we describe the development of a new area-array pad router which differs from other approaches in that no additional metal layer is added (unless needed) and no redistribution is required. We describe the design guideline definition, data preparation, pad placement, pad assignment, pad routing, and output padframe generation of this technique. The results of applying this router are shown for sample designs requiring 112, 298 and 485 I/Os.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114215444","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
David M. Dahle, Jeffrey D. Hirschberg, K. Karplus, H. Keller, Eric Rice, D. Speck, Douglas H. Williams, R. Hughey
{"title":"Kestrel: design of an 8-bit SIMD parallel processor","authors":"David M. Dahle, Jeffrey D. Hirschberg, K. Karplus, H. Keller, Eric Rice, D. Speck, Douglas H. Williams, R. Hughey","doi":"10.1109/ARVLSI.1997.634852","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634852","url":null,"abstract":"Kestrel is a high-performance programmable parallel co-processor. Its design is the result of examination and reexamination of algorithmic, architectural, packaging, and silicon design issues, and the interrelations between them. The final system features a linear array of 8-bit processing elements, each with local memory, an arithmetic logic unit (ALU), a multiplier, and other functional units. Sixty-four Kestrel processing elements fit in a 1.4 million transistor, 60 mm/sup 2/, 0.5 /spl mu/m CMOS chip with just 84 pins. The planned single-board, 8-chip system will, for some applications, provide supercomputer performance at a fraction of the cost. This paper surveys four of our applications (sequence analysis, neural networks, image compression, and floating-point arithmetic), and discusses the philosophy behind many of the design comparator's compact instruction encoding and design, the architecture's facility with nested conditionals, and the multiplier's flexibility in performing multiprecision operations. Finally, we discuss the implementation and performance of the Kestrel test chips.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117218346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Clock distribution using cooperative ring oscillators","authors":"L. Hall, M. Clements, Wentai Liu, G. Bilbro","doi":"10.1109/ARVLSI.1997.634846","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634846","url":null,"abstract":"This paper presents a new form of integrated ring oscillator, the Cooperative Ring Oscillator (CRO), in which the controllable delay elements are distributed throughout a VLSI chip. Specifically, each stage of the CRO consists of many electrically parallel delay elements that are spatially distributed. The high degree of parallelism in the CRO provides strong signal aggregation that significantly reduces the skew within each clock phase. The CRO performs both clock generation and clock delivery, thus unifying the tasks of the oscillator, clock buffers, and distribution network into a single circuit. The strength of the CRO technique is that it can deliver multiple, low-skew clock phases to all areas of a large VLSI device at a cost in chip resources comparable to that of current single-phase clock distribution techniques. This strength creates the opportunity for system designers to make extensive use of multi-phase logic techniques to improve system performance.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116884440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. DeWeerth, G. Patel, M. Simoni, D. Schimmel, R. Calabrese
{"title":"A VLSI architecture for modeling intersegmental coordination","authors":"S. DeWeerth, G. Patel, M. Simoni, D. Schimmel, R. Calabrese","doi":"10.1109/ARVLSI.1997.634854","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634854","url":null,"abstract":"A hybrid analog/digital VLSI architecture that models the coordination of neurobiological segmental oscillators is presented. This architecture facilitates the modeling of systems such as those that produce swimming in vertebrates and fish, as well as motivates the creation of a class of biologically inspired, \"intelligent\" motion-control systems. The two primary components of the architecture are described: intrasegmental oscillators that consist of neurons and synapses that are implemented primarily rising analog VLSI circuits; and an intersegmental communication network that is implemented, primarily rising asynchronous digital circuits. Preliminary test results are presented to demonstrate the operation of the system and its components.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116071619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Asynchronous microengines for efficient high-level control","authors":"H. Jacobson, G. Gopalakrishnan","doi":"10.1109/ARVLSI.1997.634855","DOIUrl":"https://doi.org/10.1109/ARVLSI.1997.634855","url":null,"abstract":"Asynchronous (self-timed) circuits are quite natural for realizing control-intensive designs. Many such designs are of reactive nature and inherently complex due to complicated communication protocols. In these situations programmable controllers are preferable over hardwired controllers to allow design decisions to be bound late, help connect errors that may slip through the verification process, and even permit run-time modification of control algorithms to best suit the current situation. Virtually all recent work in asynchronous controller design focuses on generating hardwired controllers. In this paper, we propose an architecture for programmable asynchronous controllers in the form m of microprogrammed asynchronous \"microengine\". Architectures utilizing both two-phase and four-phase handshaking are proposed. The datapath structure of the asynchronous microengine is modular and easily extensible, facilitating changes during the design phase. We ensure high performance of the asynchronous microengine by exploiting concurrency between operations and employ efficient control structures. Initial results show that the proposed microengine can yield performance close to that offered by automated high-level synthesis tools targeting custom hardwired burst-mode machines for control.","PeriodicalId":201675,"journal":{"name":"Proceedings Seventeenth Conference on Advanced Research in VLSI","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115474801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}