LibMP

Introduction

LibMP is a lightweight messaging library built on top of LibGDSync APIs, developed as a technology demonstrator to easily deploy the GPUDirect Async technology in applications. Main LibMP features are:

Thin layer on top of IB Verbs, LibGDSync
MPI out-of-band mechanism to distribute the process info order to establish IB connections
MPI is never used during actual communications
Point-to-point and one-sided communications, no collectives
No tags, no wildcards, no data types
Can easily combine GPUDirect Async with GPUDirect RDMA

Requirements

Basic LibMP requirements are:

OpenMPI v1.10 or newer
Mellanox OFED (MOFED) 4.0 or newer
Mellanox Connect-IB, ConnectX-4 HCAs or newer
LibGDSync

To use GPUDirect Async in combination with GPUDirect RDMA:

OpenMPI with CUDA support
A recent CUDA Toolkit is required, minimally 8.0
A recent display driver, i.e. r361, r367 or later, is required
The Mellanox OFED GPUDirect RDMA kernel module, https://github.com/Mellanox/nv_peer_memory, is required to allow the HCA to access the GPU memory.

Build

Use the scripts/env_setup.sh file to specify MPI_PATH, CUDA_PATH, LIBGDSYNC_PATH and LIBMP_PATH env vars useful for both LibMP and LibGDSync.

Use the build.sh script to build LibMP.

Run

In scripts folder:

wrapper.sh: sample script with some topology example
test.sh: sample script to test all libmp examples and benchmarks

You need to create your own hostfile inside scripts directory

COMM library

COMM is an additional library built on top of LibMP. With COMM you can easily deploy LibMP in you applications; the pingpong is an example of COMM usage.

GPUDirect Async suite

We created a new repository here in order to collect in a single project all the components of the GPUDirect Async technology. In this repo you can find several scripts useful to configure, build and run all the GPUDirect Async libraries, tests, benchmarks and examples.

Acknowledging LibMP and GPUDirect Async

If you find this software useful in your work, please cite:

"GPUDirect Async: exploring GPU synchronous communication techniques for InfiniBand clusters", E. Agostini, D. Rossetti, S. Potluri. Journal of Parallel and Distributed Computing, Vol. 114, Pages 28-45, April 2018

"Offloading communication control logic in GPU accelerated applications", E. Agostini, D. Rossetti, S. Potluri. Proceedings of the 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid’ 17), IEEE Conference Publications, Pages 248-257, Nov 2016

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
benchmarks		benchmarks
comm_library		comm_library
examples		examples
include		include
scripts		scripts
src		src
.gitmodules		.gitmodules
Makefile.am		Makefile.am
README.md		README.md
autogen.sh		autogen.sh
build.sh		build.sh
configure.ac		configure.ac
libmp.spec.in		libmp.spec.in

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LibMP

Introduction

Requirements

Build

Run

COMM library

GPUDirect Async suite

Acknowledging LibMP and GPUDirect Async

About

Releases

Packages

Contributors 2

Languages

gpudirect/libmp

Folders and files

Latest commit

History

Repository files navigation

LibMP

Introduction

Requirements

Build

Run

COMM library

GPUDirect Async suite

Acknowledging LibMP and GPUDirect Async

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages