Skip to content

Commit

Permalink
Rearranged slides around exercise block III
Browse files Browse the repository at this point in the history
  • Loading branch information
CFrauen authored Sep 1, 2024
1 parent a44a394 commit 804910f
Showing 1 changed file with 7 additions and 6 deletions.
13 changes: 7 additions & 6 deletions B3/L9/docs/gpu_lecture.md
Original file line number Diff line number Diff line change
Expand Up @@ -1012,11 +1012,6 @@ end do
* synccheck : Synchronization checking
* initcheck : Global memory initialization checking

# Hands-on exercises III

* See https://github.com/csc-training/esiwace-summerschool-2024/tree/main/B3/L9/exercises
* Profiling and performance optimisation

# Interoperability with libraries (BONUS)
- Often it may be useful to integrate the accelerated OpenACC code with other accelerated libraries or CUDA code.
- MPI, CUBLAS, CUFFT, MAGMA, CULA...
Expand All @@ -1033,7 +1028,6 @@ istat = curandGenerateUniform(g, y, n)
!$acc end host_data
```


# OpenACC - OpenMP

* Most OpenACC directives can be directly translated to OpenMP target directives:
Expand All @@ -1046,6 +1040,13 @@ istat = curandGenerateUniform(g, y, n)
| `!$ACC DATA COPYIN` | `!$OMP TARGET DATA MAP(TO:)` |
| `!$ACC UPDATA HOST` | `!$OMP TARGET UPDATE FROM()` |

# Hands-on exercises III

* See https://github.com/csc-training/esiwace-summerschool-2024/tree/main/B3/L9/exercises
* Profiling and performance optimisation
* Debugging (Exercise 6)
* Interoperability (Exercise 7, BONUS)

# Any questions or comments?

# Further reading
Expand Down

0 comments on commit 804910f

Please sign in to comment.