‘THE USE OF 4x4 SWITCHING ELEMENTS
IN THE MULTISTAGE CUBE NETWORK
George B.
Adams ot
Howard Jay Siegel
School of Electrical Engineering
Purdue University
West Lafayette, IN. 47007, USA
“Abstract — Multistage cubetype networks such as
the baseline, delta, Extra Stage Cube, Generalized Cube,
hdirect binary n-cube, omega, STARAN flip, and SW-
Panyan (S=F=2, L=n) have been proposed for use in
parallel and distributed computer systems, This paper
Examines one way to compare the relative merits of
Gaing 2x2 versus 4c4 crossbar switching elements 3s
building blocks for this type of network. In particular,
the probability and amount of delay through switching
Clement and expected time to transverse a switching ele-
nent, a8 a funetion of the number of messages entering
the switching element, will be quantified. ‘The results
presented give designers of parallel and distributed com-
Pater systems additional information to aid in intercon
nection network design.
1. Introduction
Multistage cube-type networks such as the baseline
ls}, delta [15], Extra Stage Cube [1], Generalized Cube
bal’ indirect binary n-cube {16}, omega [12], STARAN
fp’, and SW-banyan (S=F'=2, L=n) [9] have been
proposed for use in_parallel and’ distributed computer
Systems [20]. These include PASM [22], PUMPS {3}, the
Ballistic Missile Defense Agency distributed processing
test bed [13], Ultracomputer [10], the Flow Model Pro-
cessor of the Numerical Aerodynamic Simulator (4), and
fata flow machines [8]. ‘The Generalized Cube network is
fa network topology which was presented in [21, 23] 9s a
Standard for analyzing multistage cubetype networks,
‘This network has N input porte and N output ports,
where N=2", It is shown in Fig, 1 for N=8. ‘The net-
‘work ports are numbered from 0 to N-1. Input and out-
pat ports are network interfaces to external devices.
‘The Generalized Cube topology has n =log:N’
stages, where each stage consists of a set of N lines con
nected to N/2 interchange boxes. Each interchange box
is a two-input, two-output device and is individually
controlled. An interchange box cen be set to one of four
legitimate states, shown in Fig. 1. ‘The upper input and
output lines of a box are labeled identically, as are the
lower input and output lines.
"The network connections are based on the cube
interconnection functions [18, 10]. Stage i of the Gen-
eralized Cube topology pairs 1/0 lines (links) whose
‘This work was supported by the Nationsl Seience Foundation under
Grant ECS 016500, and the Air Force Oflce of Scientific
Research, Air Force’ Systems Command, USAF, under Grant
SFOSR-T8-3581,
G. B. Adams Il is now with the Researeh Institute for Advanced
‘Computer Science NASA Ames Research Center Moffett Feld, CA
‘94085, USA
‘€¥12039-6/84/000010585S01.0001984 IEEE,
z
&
?
stratcat EXCHANGE,
BROADEAST BROADCAST
‘The Generalized Cube network with N=8, and
Fig. 1
the four states of an interchange box.
addresses differ in the i** bit position. ‘Thus, when an
interchange box in stage i is set to exchange, the data
items input to that interchange box are transferred
between I/O lines which differ in the i bit position
When one is set to straight, there is no exchange of data,
"A more general form of interchange box is an
input/a-output (ax a) crossbar. A network related to
the Generalized Cube can be constructed from a x a
switching elements using cube-type connections between
Sages. Many payers in the literature diseuss using)
Switching elements larger than 2x2 interchange boxes for
implementing multistage cube-type networks [5-7, 11, 12,
4417, 24)
‘This paper examines one way to compare the rela
tive merits of 2x2 and 4x4 crossbar switching elements,
or nodes, with regard to performance. The work was
motivated by the study of interconnection networks for
the PASM and PUMPS systems. In particular, the pro-
ability. and amount of delay through » node and
expected time to transverse a node, a8 a function of the
umber of messages entering the node, will be quantified.
‘This is an extension of the work in [ld]. The results
presented give designers of parallel and distributed com-
puter systems additional information to aid in intercon
Rection network design. Theoretical interest and the
advent of VLSI alo provide motivation for investigating,
alternative switebing clement structures for cube-type
585networks. A 4x4 crossbar switching element is a logical
successor to the interchange box and has higher perfor-
mance potential, yet it is not of too great a complexity
to preelude implementation on a single VLSI chip [14}
‘The switching clement model used for the perfor:
mance analysis is given in Section 2. Section 3 presents
the performance of 4x4 crossbar nodes. ‘The analysis of
4x4 composite nodes, composed of four 2x2 switches, is
in Section 4.
2. Switching Element Model
‘The Generalized Cube operation model assumed for
the analysis includes routing fogs [12, 21] and. packet.
switched message handling [14]. "A packet consists of a
routing tag and a number of data items that makes ite
way from stage to stage, releasing lines and switching
nodes immediately after using them. ‘The size of each
input queue in a switching node is assumed to be an
intogral multiple of the packet sie. ‘Thus, packet size is
not restricted to any particular number of words
‘The purpose here is to investigate the performance
of Axed crossbars. versus 2x2 crossbars (interchange
boxes). Since a single interchange box is not function
ally comparable to a 4x4 erossbar (Le, it can only han
dle two items at a time instead of four), the 4x4
crossbar is compared to a Linput/4-output composition
of four interchange boxes. ‘This configuration is called a
4x4 composite node and is shown in Fig. 2. Level 1 of
composite node is the two interchange boxes connected
to the node inputs. Level 2 consists of the two inter-
change boxes connected to the outputs. A’ Generalized
Cube network constructed from properly connected (to
be specified later) 4x4 composite nodes is identical to
‘ne constructed from interchange boxes. Examination of
the 44 crossbar node in Fig. 3a shows that its external
connections are identical to those of the 434 comp.
node, so it ean be directly substituted for @ dyed compo.
site node. Fig. 3b depicts the crosspoint, or switch
between an input and an output, Unles otherwise
stated all nodes, both eressbar and composite, will be
assumed to be 454,
When building a multistage eube network with 4x4
nodes (ether eomposite or crossbar), each 4x4 node will
replace four 2x2 nodes. ‘Thus, n/2 columns of N/A dd
nodes are used instead of n columns of N/2 2x3 nodes
(to simplify the diseussion, assume n even; for n odd, one
column of 4x4 nodes vill be only half used). For
OSi