We propose an image parallel ray tracing with a static load balancing
and data prefetching scheme on distributed memory multicomputers. We u
se a scattered decomposition of which the allocation unit is one pixel
and each processor lakes part in pixels scattered around the whole im
age space, which can make a load balance efficiently. In a scattered d
ecomposition scheme, remote data accesses may be frequent since it can
not utilize ray coherence sufficiently. To reduce the communication o
verhead due to this, we propose data prefetching by multicasting (DPM)
where requested object data are also transferred to the processors de
aling with the adjacent pixels if a processor requests a remote data.
We implemented our approach on the Intel's Paragon and showed the line
ar speedup. However, the advantages of data prefetching did not materi
alize due to the machine's inefficient multicast operation.