Asynchronous transfer mode and other network technologies for wide-area and high-performance cluster computing

Citation
Ka. Hawick et Ha. James, Asynchronous transfer mode and other network technologies for wide-area and high-performance cluster computing, J SUPERCOMP, 19(3), 2001, pp. 285-297
Citations number
29
Categorie Soggetti
Computer Science & Engineering
Journal title
JOURNAL OF SUPERCOMPUTING
ISSN journal
09208542 → ACNP
Volume
19
Issue
3
Year of publication
2001
Pages
285 - 297
Database
ISI
SICI code
0920-8542(2001)19:3<285:ATMAON>2.0.ZU;2-X
Abstract
We review fast networking technologies for both wide-area and high performa nce cluster computer systems. We describe our experiences in constructing a synchronous transfer mode (ATM)-based local- and wide-area clusters and the tools and technologies this experience led us to develop. We discuss our e xperiences using Internet Protocol on such systems as well as native ATM pr otocols and the problems facing wide-area integration of cluster systems. W e are presently constructing Beowulf-class computer clusters using a mix of Fast Ethernet and Gigabit Ethernet technology and we anticipate how such s ystems will integrate into a new local-area Gigabit Ethernet network and wh at technologies will be used for connecting shared HPC resources across wid e-areas. High latencies on wide-area cluster systems led us to develop a me tacomputing problem-solving environment known as distributed information sy stems control world (DISCWorld). We summarize our main developments in this project as well as the key features and research directions for software t o exploit computational services running on fast networked cluster systems.