A dynamic congestion management system for InfiniBand networks

Mizero, F., Veeraraghavan, M., Liu, Q., Russell, R. D., Dennis, J. M.. (2016). A dynamic congestion management system for InfiniBand networks. Supercomputing Frontiers and Innovations, doi:10.14529/jsfi160201

Title A dynamic congestion management system for InfiniBand networks
Author(s) Fabrice Mizero, Melathy Veeraraghavan, Qian Liu, Robert D. Russell, John M. Dennis
Abstract While the InfiniBand link-by-link flow control helps avoid packet loss, it unfortunately causes the effects of congestion to spread through a network. Flows whose paths do not even pass through congested ports could suffer from reduced throughput. We propose a Dynamic Congestion Management System (DCMS) to address this problem. Without per-flow information, the DCMS leverages performance counters of switch ports to detect onset of congestion, and determines whether-or-not victim flows are present. The DCMS then takes actions to cause an aggressive reduction in the sending rates of congestion-causing (contributor) flows if victim flows are present. On the other hand, in the absence of victim flows, the DCMS allows the contributor flows to maintain high sending rates and finish as quickly as possible.Our results show that dynamic congestion management can enable a network to serve both contributor flows and victim flows effectively. The DCMS solution operates within the constraints of the InfiniBand Standard.
Publication Title Supercomputing Frontiers and Innovations
Publication Date Sep 1, 2016
Publisher's Version of Record https://dx.doi.org/10.14529/jsfi160201
OpenSky Citable URL https://n2t.net/ark:/85065/d7jw8hf4
OpenSky Listing View on OpenSky
CISL Affiliations TDD, ASAP

< Back to our listing of publications.