Compact network reconfiguration in fat-trees

Autor: Feroz Zahid, Tor Skeie, Ernst Gunnar Gran, Evangelos Tasoulas, Bjorn Dag Johnsen, Bartosz Bogdanski
Rok vydání: 2016
Předmět:
Routing protocol
Dynamic Source Routing
Computer science
Equal-cost multi-path routing
Routing table
Distributed computing
Enhanced Interior Gateway Routing Protocol
Wireless Routing Protocol
Geographic routing
02 engineering and technology
Network topology
Theoretical Computer Science
Routing Information Protocol
0202 electrical engineering
electronic engineering
information engineering

Destination-Sequenced Distance Vector routing
Hierarchical routing
Triangular routing
020203 distributed computing
Interconnection
Zone Routing Protocol
Static routing
business.industry
ComputerSystemsOrganization_COMPUTER-COMMUNICATIONNETWORKS
Policy-based routing
Path vector protocol
DSRFLOW
Control reconfiguration
020202 computer hardware & architecture
Link-state routing protocol
Routing domain
Hardware and Architecture
Multipath routing
business
Software
Information Systems
Computer network
Zdroj: The Journal of Supercomputing. 72:4438-4467
ISSN: 1573-0484
0920-8542
Popis: In large high-performance computing systems, the probability of component failure is high. At the same time, for a sustained system performance, reconfiguration is often needed to ensure high utilization of available resources. Reconfiguration in interconnection networks, like InfiniBand (IB), typically involves computation and distribution of a new set of routes in order to maintain connectivity and performance. In general, current routing algorithms do not consider the existing routes in a network when calculating new ones. Such configuration-oblivious routing might result in substantial modifications to the existing paths, and the reconfiguration becomes costly as it potentially involves a large number of source–destination pairs. In this paper, we propose a novel routing algorithm for IB-based fat-tree topologies, SlimUpdate. SlimUpdate employs path preservation techniques to achieve a decrease of up to 80 % in the number of total path modifications, as compared to the OpenSM’s fat-tree routing algorithm, in most reconfiguration scenarios. Furthermore, we present a metabase-aided re-routing method for fat-trees, based on destination leaf-switch multipathing. Our proposed method significantly reduces network reconfiguration overhead, while providing greater routing flexibility. On successive runs, our proposed method saves up to 85 % of the total routing time over the traditional re-routing scheme. Based on the metabase-aided routing, we also present a modified SlimUpdate routing algorithm to dynamically optimize routes for a given MPI node order.
Databáze: OpenAIRE