A dynamic congestion management system for InfiniBand networks
Mizero, F., Veeraraghavan, M., Liu, Q., Russell, R. D., Dennis, J. M.. (2016). A dynamic congestion management system for InfiniBand networks. Supercomputing Frontiers and Innovations, doi:10.14529/jsfi160201
Title | A dynamic congestion management system for InfiniBand networks |
---|---|
Author(s) | Fabrice Mizero, Melathy Veeraraghavan, Qian Liu, Robert D. Russell, John M. Dennis |
Abstract | While the InfiniBand link-by-link flow control helps avoid packet loss, it unfortunately causes the effects of congestion to spread through a network. Flows whose paths do not even pass through congested ports could suffer from reduced throughput. We propose a Dynamic Congestion Management System (DCMS) to address this problem. Without per-flow information, the DCMS leverages performance counters of switch ports to detect onset of congestion, and determines whether-or-not victim flows are present. The DCMS then takes actions to cause an aggressive reduction in the sending rates of congestion-causing (contributor) flows if victim flows are present. On the other hand, in the absence of victim flows, the DCMS allows the contributor flows to maintain high sending rates and finish as quickly as possible.Our results show that dynamic congestion management can enable a network to serve both contributor flows and victim flows effectively. The DCMS solution operates within the constraints of the InfiniBand Standard. |
Publication Title | Supercomputing Frontiers and Innovations |
Publication Date | Sep 1, 2016 |
Publisher's Version of Record | https://dx.doi.org/10.14529/jsfi160201 |
OpenSky Citable URL | https://n2t.net/ark:/85065/d7jw8hf4 |
OpenSky Listing | View on OpenSky |
CISL Affiliations | TDD, ASAP |