Skip to content

Commit

Permalink
Remove broken links in tuning guide (#2636)
Browse files Browse the repository at this point in the history
A couple of "for more details" links to external sites are no longer finding
details. Better to remove them.

Signed-off-by: markstur <mark.sturdevant@ibm.com>
Co-authored-by: Svetlana Karslioglu <svekars@meta.com>
  • Loading branch information
markstur and svekars authored Nov 1, 2023
1 parent 5d69b49 commit 53e4142
Showing 1 changed file with 1 addition and 4 deletions.
5 changes: 1 addition & 4 deletions recipes_source/recipes/tuning_guide.py
Original file line number Diff line number Diff line change
Expand Up @@ -193,15 +193,12 @@ def fused_gelu(x):
#
# numactl --cpunodebind=N --membind=N python <pytorch_script>

###############################################################################
# More detailed descriptions can be found `here <https://software.intel.com/content/www/us/en/develop/articles/how-to-get-better-performance-on-pytorchcaffe2-with-intel-acceleration.html>`_.

###############################################################################
# Utilize OpenMP
# ~~~~~~~~~~~~~~
# OpenMP is utilized to bring better performance for parallel computation tasks.
# ``OMP_NUM_THREADS`` is the easiest switch that can be used to accelerate computations. It determines number of threads used for OpenMP computations.
# CPU affinity setting controls how workloads are distributed over multiple cores. It affects communication overhead, cache line invalidation overhead, or page thrashing, thus proper setting of CPU affinity brings performance benefits. ``GOMP_CPU_AFFINITY`` or ``KMP_AFFINITY`` determines how to bind OpenMP* threads to physical processing units. Detailed information can be found `here <https://software.intel.com/content/www/us/en/develop/articles/how-to-get-better-performance-on-pytorchcaffe2-with-intel-acceleration.html>`_.
# CPU affinity setting controls how workloads are distributed over multiple cores. It affects communication overhead, cache line invalidation overhead, or page thrashing, thus proper setting of CPU affinity brings performance benefits. ``GOMP_CPU_AFFINITY`` or ``KMP_AFFINITY`` determines how to bind OpenMP* threads to physical processing units.

###############################################################################
# With the following command, PyTorch run the task on N OpenMP threads.
Expand Down

0 comments on commit 53e4142

Please sign in to comment.