Abstract
uDAPL is a portable and platform independent communication library that provides RDMA as well as send/recv operations. Some well-known software has attempted to take advantage of uDAPL's portability, such as Open MPI, MVAPICH2, Intel MPI, and Cluster OpenMP. However, network bandwidth limitations can still be a bottleneck for applications using these software. Engaging a "Multirail" network is a method to by-pass this. In this paper, we design a non-threaded and a threaded approach to improve the performance of uDAPL over multirail configured clusters. The two approaches are evaluated on an InfiniBand cluster with different multirail configurations. The results show that the threaded approach improves by 33% and 148% the uni-directional bandwidth on the multi-port and the multi-HCA configured network respectively, and the nonthreaded approach improves ∼90% of the uni-directional bandwidth on the multi-HCA configured network. A similar improvement is achieved for the bi-directional bandwidth.
Original language | English |
---|---|
Title of host publication | NPC 2009 - 6th International Conference on Network and Parallel Computing |
Pages | 233-239 |
Number of pages | 7 |
DOIs | |
Publication status | Published - 1 Dec 2009 |
Externally published | Yes |
Event | NPC 2009 - 6th International Conference on Network and Parallel Computing - Gold Coast, QLD, Australia Duration: 19 Oct 2009 → 21 Oct 2009 |
Conference
Conference | NPC 2009 - 6th International Conference on Network and Parallel Computing |
---|---|
Country/Territory | Australia |
City | Gold Coast, QLD |
Period | 19/10/09 → 21/10/09 |