ARCHITECTURE AND APPLICATIONS OF THE HIPAR VIDEO SIGNAL PROCESSOR

Authors
Citation
K. Ronner et J. Kneip, ARCHITECTURE AND APPLICATIONS OF THE HIPAR VIDEO SIGNAL PROCESSOR, IEEE transactions on circuits and systems for video technology, 6(1), 1996, pp. 56-66
Citations number
33
Categorie Soggetti
Engineering, Eletrical & Electronic
ISSN journal
10518215
Volume
6
Issue
1
Year of publication
1996
Pages
56 - 66
Database
ISI
SICI code
1051-8215(1996)6:1<56:AAAOTH>2.0.ZU;2-F
Abstract
In this paper, we propose the architecture of a highly parallel DSP (H iPAR-DSP) as a flexible and programmable processor for image and video processing. The design of this processor is based on an analysis of c haracteristic properties of image processing algorithms in terms of av ailable parallelization resources, demands on program control, and req uired data access mechanisms. This led to a very long instruction word (VLIW)-controlled ASIMD RISC-architecture with four or sixteen data p aths, employing data-level parallelism, parallel instructions, micro-i nstruction pipelining, and data transfer concurrently to data processi ng. Common data access patterns for image processing algorithms are su pported by use of a shared on-chip memory with parallel matrix type ac cess patterns and a separate data-cache per data path, By properly bal ancing processing and controlling capabilities as internal and externa l memory bandwidth, this approach is optimized to make best use of cur rently available silicon resources. A high clock frequency is achieved by implementation of classic RISC features. The architecture fully su pports high level language programming. With the 16 data path version at 100 MHz clock, a sustained performance of more than 2 billion arith metic operations per second (GOPS) is achieved for a wide range of alg orithms. Given examples show the parallel implementation of image proc essing algorithms like histogramming, Hough transform, or search in a sorted list with efficient use of the processor resources. A prototype of the architecture with four parallel data paths will be available i n the second quarter of 1996, using a 0.6 mu m CMOS technology.