Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Array4: __cuda_array_interface__ v3 #30

Merged
merged 9 commits into from
Oct 17, 2022

Commits on Oct 17, 2022

  1. Array4: __cuda_array_interface__ v2

    Start implementing the `__cuda_array_interface__` for zero-copy
    data exchange on Nvidia CUDA GPUs.
    ax3l committed Oct 17, 2022
    Configuration menu
    Copy the full SHA
    62f2340 View commit details
    Browse the repository at this point in the history
  2. MultiFab: CuPy Test

    ax3l committed Oct 17, 2022
    Configuration menu
    Copy the full SHA
    271a021 View commit details
    Browse the repository at this point in the history
  3. MFIter: Finalize() on StopIteration

    Since `for` loops create no scope in Python, we need to trigger
    finalize logic, including stream syncs, before the destructor of
    `MultiFab` iterators are called.
    ax3l committed Oct 17, 2022
    Configuration menu
    Copy the full SHA
    5acd36a View commit details
    Browse the repository at this point in the history
  4. Add numba test

    incl. 3D kernel launch
    RemiLehe authored and ax3l committed Oct 17, 2022
    Configuration menu
    Copy the full SHA
    6965f9a View commit details
    Browse the repository at this point in the history
  5. Add pytorch

    RemiLehe authored and ax3l committed Oct 17, 2022
    Configuration menu
    Copy the full SHA
    9f539f9 View commit details
    Browse the repository at this point in the history
  6. CuPy Fuse: Avoid Extra Memset

    ax3l committed Oct 17, 2022
    Configuration menu
    Copy the full SHA
    4175194 View commit details
    Browse the repository at this point in the history
  7. MultiFab Device Test: Fixes

    ax3l committed Oct 17, 2022
    Configuration menu
    Copy the full SHA
    6eb2da4 View commit details
    Browse the repository at this point in the history
  8. Update to v3

    ax3l authored Oct 17, 2022
    Configuration menu
    Copy the full SHA
    7f6d80b View commit details
    Browse the repository at this point in the history
  9. Array4: TODO from CUDA

    A bit tricky to implement this caster as new constructor.
    Not currently needed, but adds comments where to do this.
    ax3l committed Oct 17, 2022
    Configuration menu
    Copy the full SHA
    e65cd41 View commit details
    Browse the repository at this point in the history