Releases: GridTools/gridtools
GridTools version 2.3.8
GridTools version 2.3.7
GridTools version 2.3.6
changes since v2.3.5
Minor feature
- fn: introduce fn::index as alias to positional #1806
Performance improvements
- Loop Blocking for fn GPU Backend #1787
- Use ‘const’ in Neighbor Table Value Types #1796
- Remove ldg_ptr and Replace Functionality by const_ptr_deref, possible improvement for sid::composite #1810
- Add Const to SID Neighbor Table Element Type #1808
Bug fixes
- Respect ReadOnly Property in Nanobind Adapter #1809
GridTools version 2.3.5
GridTools version 2.3.4
changes since v2.3.2 (v2.3.3 removed because it introduced a breaking change in the nanobind adapter)
Performance improvements
- Introduce GT_PROMISE for __builtin_assume by @havogt, @iomaganaris in #1785, #1788
Bug fixes
- Bug: storage/gpu.h functions within CUDA_ARCH by @havogt in #1778
- Update nanobind to v2 by @havogt in #1777, #1790
- Update minimum required boost to 1.73 by @havogt in #1772
Tests
- Add Missing fn_unstructured_nabla_fused_tuple_of_fields to Regression & Performance Tests by @fthaler in #1783
- Improved Data Layout for Neighbor Tables by @fthaler in #1782
CI / Deployment
- test NVHPC 23.9 by @havogt in #1769
- build: Update deployment action with trusted publisher by @havogt in #1770
- build: Add download wheel step in deployment action by @havogt in #1771
- CI: test CUDA 12.4 by @havogt in #1773
- GitHub actions: Update compiler versions in configure test by @havogt in #1760
- Move to CUDA 11.2 for daint nvcc gcc by @havogt in #1786
GridTools version 2.3.2
GridTools version 2.3.1
GridTools version 2.3.0
Support for NVHPC (#1747)
GridTools now supports NVHPC starting from release 23.3!
Parallel fn::backend::naive (#1746)
Naive (just parallel for
, no blocking and other optimizations) OpenMP parallelization of the naive backend.
SID util to transform a dimension to a tuple_like element type (#1750)
Translates a SID with dimension D
and element type T
to a SID with D
removed and type is tuple<T>
-like, with tuple_size N
for sid::dimension_to_tuple_like<D, N>(sid)
.
Bug fixes and smaller features
Build fixes
- Support for Clang 16 (#1751)
and other changes already included in v2.2.3
fn: SID neighbor table wrapper (#1730)
Adds a simple class that wraps a SID and implements the neighbour table concept. (Picked for convenience into 2.2.2.)
Support for Python packaging (#1720)
Starting with this release we will publish GridTools C++ on pypi.org to make it easier to consume GridTools C++ from GT4Py.
Bug fixes
- Fix CUDA 12.0 compilation (#1741)
- Improvements to Python packaging (#1742, #1743, #1744)
- Fix get_keys of empty hymap (#1728)
- fn: CUDA early exit on empty grid - an empty domain skips execution instead of erroring (#1729)
- fn: prefer qualified names over ADL for fn builtins (they are not customization points for the user) (#1731, #1732)
- Enable workarounds for CUDA 11.8 (#1734)
- Enable workarounds for Clang 15 (#1735)
- Update pybind11 version to fix wrong C++ standard (#1723)
- Fix perfect forwarding in sid::composite::make_values (#1722)
- Workaround for NVCC bug in gcl (present in 11.6, 11.7 and most likely in 11.8) (#1726)
Performance fixes
- Alternative skip value check in fn, which improves CUDA performance (#1721)
Build fixes
- Fix perftests CMake target when no tests are added (#1724)
Cleanup
- Replace boost::variant by std::variant (#1718)
CI
Contributions
This release contains contributions from
@DropD, @egparedes, @fthaler, @havogt, @petiaccja
GridTools version 2.0.1
Bug fixes
- Fix: storage_gpu for HIPCC-AMDGPU (#1540)
- Performance fix for C++17 (#1618)
- Enable several CUDA workarounds for recent compilers (#1681 and others)
- Some declarations to definitions
- Workaround gtest incompatibilities in recent compilers
CI
- Compile (and run) all tests on GitHub actions (Jenkins doesn't run on v2.0.x anymore)