Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/17407acmconferencesBook PagePublication PagesiscaConference Proceedingsconference-collections
ISCA '86: Proceedings of the 13th annual international symposium on Computer architecture
1986 Proceeding
Publisher:
  • IEEE Computer Society Press
  • Washington
  • DC
  • United States
Conference:
Tokyo Japan June 2 - 5, 1986
ISBN:
978-0-8186-0719-6
Published:
01 June 1986
Sponsors:
Next Conference
Reflects downloads up to 03 Nov 2024Bibliometrics
Abstract

No abstract available.

Article
Free
A model and an architecture for a relational knowledge base
Pages 2–9

A relational knowledge base model and an architecture which manipulates the model are presented. An item stored in the relational knowledge base is called a term. A unification operation on terms in the relational knowledge base is used as the retrieval ...

Article
Free
Implementation and evaluation of a list-processing-oriented data flow machine
Pages 10–19

The architecture of a data flow machine, called DFM, is developed for parallel list processing. The DFM can maximally exploit parallelism inherent in list processing, due to its ultra-multi-processing mechanism, packet communication-based parallel and ...

Article
Free
A new string search hardware architecture for VLSI
Pages 20–27

This paper presents a new architecture for practical string search hardware design. This architecture is based on the finite state automaton design concept using a character control charge transfer model. The resultant hardware is a set of programmable ...

Article
Free
Parallel algorithms and architectures for rule-based systems
Pages 28–37

Rule-based systems, on the surface, appear to be capable of exploiting large amounts of parallelism—it is possible to match each rule to the data memory in parallel. In practice, however, we show that the speed-up from parallelism is quite limited, less ...

Article
Free
Concert: design of a multiprocessor development system
Pages 40–48

Concert is a shared-memory multiprocessor testbed intended to facilitate experimentation with parallel programs and programming languages. It consists of up to eight clusters, with 4-8 processors in each cluster. The processors in each cluster ...

Article
Free
Memory requirements for balanced computer architectures
Pages 49–54

One particular result is that to balance an array of p linearly connected PEs for performing matrix computations such as matrix multiplication and matrix triangularization, the size of each PE's local memory must grow linearly with p. Thus, the larger ...

Article
Free
Graph allocation in static dataflow systems
Pages 55–64

One of the most important considerations for a dataflow multiprocessor is the algorithm by which the nodes of a program graph are allocated for execution to its processors. In the case of the static type of architecture one must consider pipelining as ...

Article
Free
Software implementation of a recursive fault tolerance algorithm on a network of computers
Pages 65–72

RAFT is a recursive algorithm for fault tolerance that uses a combination of dynamic space and time redundancy techniques for detecting faulty processors and recovering from errors. U* is a multicomputer testbed consisting of a network of AT&T 3B2 ...

Article
Free
Microprogrammable processor for object-oriented architecture
Pages 74–81

An advanced microprocessor has been developed for the high performance execution of object oriented language programs. In object oriented languages, improvement of frequent or complex operations such as dynamic type checking, procedure calls, and ...

Article
Free
An instruction fetch unit for a graph reduction machine
Pages 82–91

The G-machine provides architecture support for the evaluation of functional programming languages by graph reduction. This paper describes an instruction fetch unit for such an architecture that provides a high throughput of instructions, low latency ...

Article
Free
Fast object-oriented procedure calls: lessons from the Intel 432
Pages 92–101

As modular programming grows in importance, the efficiency of procedure calls assumes an ever more critical role in system performance. Meanwhile, software designers are becoming more aware of the benefits of object-oriented programming in structuring ...

Article
Free
On coupling many small systems for transaction processing
Pages 104–110

The prospect of coupling a large number of small inexpensive microprocessor based systems to deliver the performance of a large transaction processing system at lower cost has not been realized, to date. Inter-system interference, multi-system coupling ...

Article
Free
Performance measurement of paging behavior in multiprogramming systems
Pages 111–118

This paper presents empirical results on the performance of CD, a compiler directed memory management policy, and the Working Set policy in a multiprogramming system. A description of the multiprogramming model used in the experiments is also presented. ...

Article
Free
ATUM: a new technique for capturing address traces using microcode
Pages 119–127

Trace-driven simulation is often used in the design of computer systems, especially caches and translation lookaside buffers. Capturing address traces to drive such simulations has been problematic, often involving 1000:1 software overhead to trace a ...

Article
Experimenting with EPILOG: some results and preliminary conclusions
Pages 119–127

The EPILOG language and model of computation are briefly described, together with four multiprocessor architectures on which it is proposed to run the model: a form of chordal network and a slight variant of Wu and Feng's Baseline and Reverse Baseline ...

Article
A unification processor based on a uniformly structured cellular hardware
Pages 128–139

In this paper, an implementation of unification using a systolic-like method is presented for a VLSI-oriented Prolog machine. Not pointers but a line of symbols and the arity of each symbol are used to express the structure of terms on a uniformly ...

Article
Free
The architecture and preliminary evaluation results of the experimental parallel inference machine PIM-D
Pages 149–156

A parallel inference machine based on the dataflow model and the mechanisms to support two types of logic programming languages are presented. The machine is constructed from multiple processing elements and structure memories interconnected through a ...

Article
Free
An efficient routing control for the SIGMA network Σ(4)
Pages 158–168

When processing vectors on SIMD computers, the interconnection network may become the bottleneck for performances if it lacks an efficient routing control unit. In the pass, many multistage networks have been designed, but general algorithms to control ...

Article
Free
REYSM, a high performance, low power multi-processor bus
Pages 169–174

In order to build lower cost multimicroprocessor systems, a narrow synchronous bus (15 active lines) is proposed. It multiplexes address and data on 8 bits, and arbitrates in two pipe-lined cycles on four lines. Due to the 20 to 40 MHz bus clock, and ...

Article
Free
The extra stage gamma network
Pages 175–182

The augmented data manipulator (ADM), inverse augmented data manipulator (IADM), and the gamma network are based on the Plus-Minus-2i connection patterns. In such a network there exist multiple paths to connect a source S to a destination D except when ...

Article
Free
Evaluation of the FACOM ALPHA Lisp machine
Pages 184–190

The FACOM ALPHA is the first and only commercially dedicated processor for Lisp and Prolog manufactured in Japan. This paper discusses the evaluation of the FACOM ALPHA for Lisp execution when compared with a general-purpose computer. The CPU use rate ...

Article
Free
An architecture for efficient Lisp list access
Pages 191–198

In this paper, we present a Lisp machine architecture that supports efficient list manipulation. This Lisp architecture is organized as two processing units: a List Processor (LP), that performs all list related operations and manages the list memory, ...

Article
Free
A functional level simulation engine of MAN-YO: a special purpose parallel machine for logic design automation
Pages 202–208

The architecture of a proto-type functional level simulator element of a massively parallel machine (MAN-YO) designed for logic design automation is presented. At functional level, hardware systems are described in a hardware description language, FDL. ...

Article
Free
Exploiting parallelism in a switch-level simulation machine
Pages 209–215

The parallelism inherent in actual circuits suggests that this parallelism might be exploited in a switch-level simulation machine, in order to reduce total simulation time. This paper explores the extent to which this parallelism exists and the extent ...

Article
Free
A hardware accelerator for speech recognition algorithms
Pages 216–223

This paper describes two custom architectures tailored to a speech recognition beam search algorithm. Both architectures have been simulated using real data and the results of the simulation are presented. The paper also describes the design process of ...

Article
Free
Evaluation of a prototype data flow processor of the SIGMA-1 for scientific computations
Pages 226–234

A processing element and a structure element of data flow computer SIGMA-1 for scientific computations is now operational. The elements are evaluated for several benchmark programs. For efficient execution of loop constructs, the sticky token mechanism ...

Article
Free
Stored data structures on the Manchester dataflow machine
Pages 235–242

Experience with the Manchester Dataflow Machine has highlighted the importance of efficient handling of stored data structures in a practical parallel machine. It has proved necessary to add a special-purpose structure store to the machine, and this ...

Article
A scalable dataflow structure store
Pages 243–250

A design for a highly parallel data structure store for the prototype Manchester Dataflow Computer is presented. The main design objective is to allow all storage functions to be performed concurrently. The functions include space allocation and garbage ...

Article
Free
AT2 = O(N log4 N), T = O(log N) fast Fourier transform in a light connected 3-dimensional VLSI
Pages 252–260

We can perform a N-point FFT with time performance T=Ο(log N) and area-time performance AT2=Ο(N log4 N), by using the 3-dimensional VLSI system which is optically interconnected. This performance exceeds the theoretical lower bound of the area-time ...

Article
Free
Modular architecture for high performance implementation of FFT algorithm
Pages 261–270

The paper presents two new versions of the FFT algorithm. Based on these versions a new VLSI oriented architecture for implementing of the FFT algorithm is introduced. It consists of a homogenous structure of processing elements. The structure has a ...

Contributors
  • Keio University
Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Acceptance Rates

Overall Acceptance Rate 543 of 3,203 submissions, 17%
YearSubmittedAcceptedRate
ISCA '224006717%
ISCA '193656217%
ISCA '173225417%
ISCA '132885619%
ISCA '122624718%
ISCA '082593714%
ISCA '062343113%
ISCA '051944523%
ISCA '042173114%
ISCA '031843620%
ISCA '021802715%
ISCA '011632415%
ISCA '991352619%
Overall3,20354317%