Domain Decomposition for neXtSIM_DG

Coding conventions

We follow the same criteria as those used in the neXtSIM_DG project.

A dedicated clang format file has been designed for the code. You may run it locally and manually with the command clang-format -i $yourfile or have a plugin with your favorite code editor or implement a git pre-commit hook locally by putting this pre-commit file in your .git/hooks/. An example pre-commit file can be found at .pre-commit. This clang formatting will also be run each time a pull request is done as part of the continuous integration.

Problem Statement

We address the problem of domain decomposition of numerical ocean and sea-ice models. Such models typically use a sea-land mask to omit unnecessary computations on land. A typical approach is to use a cartesian (or rectilinear or general block distribution) domain decomposition among processors, where the domain is divided in equally sized sub-domains, ignoring the sea-land mask. This, however, may lead to significant load imbalance which impedes scalability.

We have identified the following requirements for the domain decomposition algorithm:

produce balanced sub-domains in terms of work and communication
produce rectangular sub-domains as this will help in handling communication during halo exchanges between neighbouring processes
be static (computed either offline or during the initialiasation phase)
be scalable

The proposed approach is based on the Recursive Coordinate Bisection (RCB) geometric partitioning algorithm ¹. Geometric coordinates are first partitioned into two balanced parts. Partitioning continues recursively in each part until the desired number of balanced parts has been created. The algorithm can be tuned to build rectilinear partitions. We are using the implementation of the RCB algorithm available in the Zoltan library developed by Sandia National Laboratories.

Getting Started

Requirements

A Unix-like operating system (e.g., Linux or Mac OS X)
ANSI C++ compiler
MPI library for message passing (e.g., MPICH or OpenMPI)
CMake >= 3.10
netCDF-4 C, built with parallel I/O support to netCDF-4 files through HDF5 and to classic files through PnetCDF
Zoltan, built with CMake from the Trilinos package
Catch2 for unit testing
Boost program_options library

Building Zoltan from source

git clone https://github.com/trilinos/Trilinos.git
cd Trilinos
mkdir build && cd build
cmake \
 -DTPL_ENABLE_MPI:BOOL=ON \
 -DTrilinos_ENABLE_ALL_PACKAGES:BOOL=OFF \
 -DTrilinos_ENABLE_Zoltan:BOOL=ON \
 -DTrilinos_ENABLE_Fortran:BOOL=OFF \
 -DZoltan_ENABLE_EXAMPLES:BOOL=ON \
 -DZoltan_ENABLE_TESTS:BOOL=ON \
 -DBUILD_SHARED_LIBS=ON \
 -DCMAKE_INSTALL_PREFIX:FILEPATH=<path> \
 ..
make
make test
make install

If you want to use an MPI library installed in a non-standard location you will need to additionally set:

 -DCMAKE_CXX_COMPILER=<mpicxx> -DCMAKE_C_COMPILER=<mpicc>

Building PnetCDF from source

Download v1.12.3 of PnetCDF.

cd PnetCDF
autoreconf -i
CXX=mpicxx CC=mpicc ./configure --disable-fortran --enable-shared --prefix=<installation_path>
make
make install

Building netCDF from source with parallel I/O

Download v4.8.1 of netCDF-4 C. Parallel I/O is implemented through HDF5 (built with parallel I/O).

For netCDF-C (for more details see here for requirements):

cd netcdf-c-4.8.1
mkdir build && cd build
cmake \
 -DENABLE_PNETCDF=ON \
 -DCMAKE_C_COMPILER=mpicc \
 -DCMAKE_CXX_COMPILER=mpicxx \
 -DCMAKE_BUILD_TYPE=Release \
 -DCMAKE_PREFIX_PATH=<path_to_hdf5> \
 -DCMAKE_INSTALL_PREFIX=<installation_path> \
 ..
make
make test
make install

Installation

It is recommended to build the code in a separate directory from the source directory. The basic steps for building with CMake are:

Create a build directory, outside of the source directory.
In your build directory run cmake <path-to-src>
It is recommended to set options by calling ccmake in your build directory. Alternatively you can use the -DVARIABLE=value syntax in the previous step.
Run make to build.
Run make test to run tests.
Run make install to install.

The project installs a shared library that can be imported by other CMake projects as neXtSIMutils::domain_decomp, as well as a binary decomp that be used to partition grids directly.

How to run

The binary decomp can be used to partition a 2D grid with an optional land mask represented as a netCDF file. By default, the name of the dimensions in the netCDF file are x and y with the y dimension increasing the fastest, and the name of the variable representing the land mask is mask. For netCDF files using the enhanced model, we assume the group that contains the information of interest is named data. These can be overridden using command-line options. For example:

mpirun -n 2 ./decomp --grid grid.nc --dim0 y --dim1 x --mask land_mask

The decomp tool produces two netCDF-4 files (using the classic data model) named partition_mask_<num_mpi_processes>.nc and partition_metadata_<num_mpi_processes>.nc with the following layout:

netcdf partition_mask_2 {
dimensions:
	x = 30 ;
	y = 30 ;
variables:
	short pid(x, y) ;

// global attributes:
		:num_processes = 2s ;
data:

 pid = ...
}

The netCDF variable pid is defined as the process ID of each point in the grid.

netcdf partition_metadata_2 {
dimensions:
	P = 2 ;
	T = 1 ;
	B = 1 ;
	L = UNLIMITED ; // (0 currently)
	R = UNLIMITED ; // (0 currently)

group: bounding_boxes {
  variables:
	int domain_x(P) ;
	int domain_y(P) ;
	int domain_extent_x(P) ;
	int domain_extent_y(P) ;
  data:
	domain_x = 0, 16 ;
	domain_y = 0, 0 ;
	domain_extent_x = 16, 14 ;
	domain_extent_y = 30, 30 ;
  } // group bounding_boxes

group: connectivity {
  variables:
	int top_neighbours(P) ;
	int top_neighbour_ids(T) ;
	int top_neighbour_halos(T) ;
	int bottom_neighbours(P) ;
	int bottom_neighbour_ids(B) ;
	int bottom_neighbour_halos(B) ;
	int left_neighbours(P) ;
	int left_neighbour_ids(L) ;
	int left_neighbour_halos(L) ;
	int right_neighbours(P) ;
	int right_neighbour_ids(R) ;
	int right_neighbour_halos(R) ;
  data:
	top_neighbours = 0, 1 ;
	top_neighbour_ids = 0 ;
	top_neighbour_halos = 30 ;
	bottom_neighbours = 1, 0 ;
	bottom_neighbour_ids = 1 ;
	bottom_neighbour_halos = 30 ;
	left_neighbours = 0, 0 ;
	right_neighbours = 0, 0 ;
  } // group connectivity
}

The netCDF variables domain_x/y are defined as the coordinates of the upper left corner of the bounding box for each MPI process using zero-based indexing and domain_extent_x/y are the extents in the corresponding dimensions of the bounding box for each MPI process. The file also defines the variables X_neighbours(P), X_neighbour_ids(X_dim) and X_neighbour_halos(X_dim), where X is top/bottom/left/right, which correspond to the number of neighbours per process, the neighbour IDs and halo sizes of each process sorted from lower to higher MPI rank.

Layout

The original version of decomp used a TBLR naming convention (top, down, left, right), but this was confusing in how it related to x and y co-ordinates.

To remove ambiguity we renamed the outputs produced in the partition_metadata_<num_mpi_processes>.nc file.

netcdf partition_metadata_3 {
dimensions:
 NX = 30 ;
 NY = 24 ;
 P = 3 ;
 T = 2 ;                                              0            12        24
 B = 2 ;                                               ┌──────────────────────►  y
 L = 1 ;                                               │
 R = 1 ;                                               │  ┌─────────┬─────────┐
group: bounding_boxes {                                │  │         │         │
   domain_x = 0, 0, 20 ;                               │  │         │         │
   domain_y = 0, 12, 0 ;                               │  │         │         │
   domain_extent_x = 20, 20, 10 ;                      │  │    0    │    1    │
   domain_extent_y = 12, 12, 24 ;                      │  │         │         │
  } // group bounding_boxes                            │  │         │         │
group: connectivity {                                  │  │         │         │
   top_neighbours = 0, 0, 2 ;                       20 │  ├─────────┴─────────┤
   top_neighbour_ids = 0, 1 ;                          │  │                   │
   top_neighbour_halos = 12, 12 ;                      │  │                   │
   bottom_neighbours = 1, 1, 0 ;                       │  │         2         │
   bottom_neighbour_ids = 2, 2 ;                       │  │                   │
   bottom_neighbour_halos = 12, 12 ;                   │  │                   │
   left_neighbours = 0, 1, 0 ;                      30 ▼  └───────────────────┘
   left_neighbour_ids = 0 ;
   left_neighbour_halos = 20 ;                         x
   right_neighbours = 1, 0, 0 ;
   right_neighbour_ids = 1 ;
   right_neighbour_halos = 20 ;
  } // group connectivity
}

M. Berger and S. Bokhari. "A partitioning strategy for nonuniform problems on multiprocessors." IEEE Trans. Computers, C-36 (1987) 570-580. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 116 Commits
.github/workflows		.github/workflows
examples		examples
grids		grids
img		img
test		test
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
Grid.cpp		Grid.cpp
Grid.hpp		Grid.hpp
LICENSE		LICENSE
Partitioner.cpp		Partitioner.cpp
Partitioner.hpp		Partitioner.hpp
README.md		README.md
Utils.hpp		Utils.hpp
ZoltanPartitioner.cpp		ZoltanPartitioner.cpp
ZoltanPartitioner.hpp		ZoltanPartitioner.hpp
main.cpp		main.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Domain Decomposition for neXtSIM_DG

Coding conventions

Problem Statement

Getting Started

Requirements

Building Zoltan from source

Building PnetCDF from source

Building netCDF from source with parallel I/O

Installation

How to run

Layout

About

Releases

Packages

Contributors 4

Languages

License

nextsimhub/domain_decomp

Folders and files

Latest commit

History

Repository files navigation

Domain Decomposition for neXtSIM_DG

Coding conventions

Problem Statement

Getting Started

Requirements

Building Zoltan from source

Building PnetCDF from source

Building netCDF from source with parallel I/O

Installation

How to run

Layout

Footnotes

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages