Communication in parallel systems

Friedhelm  Meyer auf der Heide

Communication in parallel systems

1996, Lecture Notes in Computer Science

Abstract

E cient communication in networks is a prerequisite to exploit the performance of large parallel systems. For this reason much e ort has been done in recent years to develop e cient communication mechanisms. In this paper we survey the foundations and recent developments in designing and analyzing e cient packet routing algorithms. Organization of the Paper In the following chapter we introduce the basic notation about networks, messages, and protocols for routing. In Chapter 3 we introduce the routing number of a network, and relate it to the dilation and congestion of path systems. Chapter 4 contains an overview of oblivious routing protocols, and Chapter 5 describes e cient adaptive routing protocols. 2 Networks, Messages, Protocols In this chapter we introduce the basic notions used in routing theory. In particular, we describe a typically used hardware model and message passing model, de ne the routing problem, and describe di erent classes of strategies to solve routing problems. 2.1 The Hardware Model We model the topology of a network as an undirected graph G = (V; E). V represents the computers or processors, and E represents the communication links. We assume the communication links to work bidirectional, that is, each edge represents two links, one in each direction. The bandwidth of a link is de ned as the number of messages it can forward in one time step. Unless explicitly mentioned we assume that the bandwidth This article was processed using the L A T E X macro package with LLNCS style View publication stats View publication stats

Efficient routing of messages is critical to the performance of direct network systems. The popular wormhole routing technique faces several challengesparticularly flow control and deadlock avoidance. assively parallel computers with thousands of processors are considered the most promising technology to achieve teraflops computational power. Such large-scale multiprocessors are usually organized as ensembles of nodes, where each node has its own processor, local memory, and other supporting devices. These nodes may have different functional capabilities. For example, the set of nodes may include vector processors, graphics processors, I/O processors, and symbolic processors. The way the nodes are connected to one another varies among machines. In a direct network architecture, each node has a point-to-point, or direct, connection to some number of other nodes, called neighboring nodes. Direct networks have become a popular architecture for constructing massively parallel computers because they scale well; that is, as the number of nodes in the system increases, the total communication bandwidth, memory bandwidth, and processing capability of the system also increase. Figure 1 shows a generic multiprocessor with a set of nodes interconnected through a direct network. Because they do not physically share memory, nodes must communicate by passing messages through the network. Message size may vary, depending on the application. For efficient and fair use of network resources, a message is often divided into packets prior to transmission. A packet is the smallest unit of communication that contains routing and sequencing information; this information is carried in the packet header. Neighboring nodes may send packets to one another directly, while nodes that are not directly connected must rely on other nodes in the network to relay packets from source to destination. In many systems, each node contains a separate router to handle such communication-related tasks. Although a router's function could be performed by the corresponding local processor, dedicated routers are used to allow overlapped computation and communication within each node, Figure 2 shows the architecture of a generic node. Each router supports some number of input and output channels. Normally, every input channel is paired with a corresponding output channel. Internal channels connect the local processor/ memory to the router. Although it is common to provide only one pair of internal channels, some systems use more internal channels to avoid a communication bottleneck between the local processor/memory and the router. External channels are used for communication between routers and, therefore, between nodes. In

The joint problem of selecting a primary route for each communicating pair and a capacity value for each link in computer communication networks is considered. The network topology and traffic characteristics are given; a set of candidate routes and of candidate capacities for each link are also available. The goal is to obtain the least costly feasible design where the costs include both capacity and queuing components. Lagrangean relaxation and subgradient optimization techniques were used in order to obtain verifiable good solutions to the problem. The method was tested on several topologies, and in all cases good feasible solutions, as well as tight lower bounds were obtained. I. INTRODUCTION S a result of the important advantages they offer, both the A number and the range of applications supported by communication based computer systems have significantly increased. A variety of computer networks, such as SNA [ 171, BNA [18], and DECNET [7] architectures, TELENET [25], TYMNET [26], TRANSPAC [6], and DATAPAC [4] are currently available. This paper deals with the following problem faced by the network designer whenever a new network is set up or when an existing network is to be expanded: how to simultaneously select the link capacities and the routes to be used by the communicating nodes in the network, such as to ensure an acceptable performance level at a minimum cost. The topology of the network and estimates of the external traffic requirements are given. Messages in the network follow static, nonbifurcated routes, a routing strategy adopted by many operational networks. The effectiveness of fixed routing methods is also supported by the simulation results presented in [15], suggesting that at steady state there is no significant difference between the delays induced in a network by good static and adaptive routing strategies. Statis routing policies are implemented by providing each pair of communicating nodes in the network with an ordered set of routes, out of which the first available route is chosen whenever a session is initiated. Such is, for instance, the general framework for routing in SNAbased networks (see [l]). Recently, the model presented in [13] has been implemented by IBM in a commercial product NETDA [23]. Consistent with this approach, we concentrate here on the choice of the primary route, i.e., the recommended one in the candidate set. Though some attempts at a formal treatment of the backbone network design problem in a general setting exist (see [3], [SI, [16], [19], and more recently, [9], [lo], and [24]), much of the Paper approved by the Editor for Wide Area Networks of the IEEE Communications Society.

Log In

Communication in parallel systems

Sign up for access to the world's latest research

Abstract

Related papers

Related topics