N. Ide et al., 2.44-GFLOPS 300-MHz floating-point vector-processing unit for high-performance 3-D graphics computing, IEEE J SOLI, 35(7), 2000, pp. 1025-1033
A vector unit for high-performance three-dimensional graphics computing has
been developed, We implement four floating-point multiply-accumulate units
, which execute multiply-add operations with one throughput; one floating-p
oint divide/square root unit, which executes division and square-root opera
tions with six cycles at 300 MHz; and one vector general-purpose register f
ile, which has 128 bits x 32 words, The parallel execution of all units del
ivers a peak performance of 2.44 GFLOPS at 300 MHz.