Skip to content

Commit 540afee

Browse files
committed
Fix minor details
1 parent 15918f7 commit 540afee

File tree

1 file changed

+3
-4
lines changed

1 file changed

+3
-4
lines changed

recipes_source/recipes/tuning_guide.py

+3-4
Original file line numberDiff line numberDiff line change
@@ -213,7 +213,7 @@ def gelu(x):
213213

214214
###############################################################################
215215
# Typically, the following environment variables are used to set for CPU affinity with GNU OpenMP implementation. ``OMP_PROC_BIND`` specifies whether threads may be moved between processors. Setting it to CLOSE keeps OpenMP threads close to the primary thread in contiguous place partitions. ``OMP_SCHEDULE`` determines how OpenMP threads are scheduled. ``GOMP_CPU_AFFINITY`` binds threads to specific CPUs.
216-
# An important tuning parameter is core pinning which prevent the threads of migrating between multiple CPUs, enchancing data location and minimizing intra core communication.
216+
# An important tuning parameter is core pinning which prevent the threads of migrating between multiple CPUs, enchancing data location and minimizing intra-core communication.
217217
#
218218
# .. code-block:: sh
219219
#
@@ -327,14 +327,13 @@ def gelu(x):
327327
#
328328
# Hardware tensor core operations tend to use a different floating point format
329329
# which sacrifices precision at expense of speed gains.
330-
331-
torch.backends.cuda.matmul.allow_tf32
332-
333330
# Prior to pytorch 1.12 this was enabled by default but since this version
334331
# it must be explicitly set as it can conflict with some operations which do not
335332
# benefit from Tensor core computations.
336333

337334

335+
torch.backends.cuda.matmul.allow_tf32
336+
338337
###############################################################################
339338
# Use CUDA Graphs
340339
# ~~~~~~~~~~~~~~~~~~~~~~~

0 commit comments

Comments
 (0)