Due to the large word lengths involved, communication and buffering are pot
entially the major problems in implementing the modular arithmetic used in
several cryptosystems. It is shown here how a single: linear systolic array
eliminates much of the associated overheads, thereby improving throughput
and the ratio of speed to area for modular exponentiation. Alternative form
s produce simpler processing elements and make fuller use of the hardware,
malting it more easily implemented in current technology. Such designs are
regarded as much safer for use in smartcards and embedded systems as they o
ffer greater protection against attacks using differential power analysis.
A 1024-bit array can be built in an area comparable to a 64-bit multiplier.