File tree 3 files changed +5
-5
lines changed
Docs/sphinx_documentation/source
3 files changed +5
-5
lines changed Original file line number Diff line number Diff line change @@ -229,9 +229,9 @@ Building with CMake
229
229
230
230
To build AMReX with GPU support in CMake, add
231
231
``-DAMReX_GPU_BACKEND=CUDA|HIP|SYCL `` to the ``cmake `` invocation, for CUDA,
232
- HIP and SYCL, respectively. By default, AMReX uses 256 threads per GPU
232
+ HIP and SYCL, respectively. By default, AMReX uses 128 threads per GPU
233
233
block/group in most situations. This can be changed with
234
- ``-DAMReX_GPU_MAX_THREADS=N ``, where ``N `` is 128 for example.
234
+ ``-DAMReX_GPU_MAX_THREADS=N ``, where ``N `` is 256 for example.
235
235
236
236
Enabling CUDA support
237
237
^^^^^^^^^^^^^^^^^^^^^
@@ -1166,7 +1166,7 @@ GPU block size
1166
1166
1167
1167
By default, :cpp: `ParallelFor ` launches ``AMREX_GPU_MAX_THREADS `` threads
1168
1168
per GPU block, where ``AMREX_GPU_MAX_THREADS `` is a compile-time constant
1169
- with a default value of 256 . The users can also explicitly specify the
1169
+ with a default value of 128 . The users can also explicitly specify the
1170
1170
number of threads per block by :cpp: `ParallelFor<MY_BLOCK_SIZE>(...) `, where
1171
1171
``MY_BLOCK_SIZE `` is a multiple of the warp size (e.g., 128). This allows
1172
1172
the users to do performance tuning for individual kernels.
Original file line number Diff line number Diff line change @@ -133,7 +133,7 @@ if (NOT AMReX_GPU_BACKEND STREQUAL NONE)
133
133
message ( STATUS " AMReX_GPU_BACKEND = ${AMReX_GPU_BACKEND} " )
134
134
135
135
# We might set different default for different GPUs in the future.
136
- set (AMReX_GPU_MAX_THREADS_DEFAULT "256 " )
136
+ set (AMReX_GPU_MAX_THREADS_DEFAULT "128 " )
137
137
set (AMReX_GPU_MAX_THREADS ${AMReX_GPU_MAX_THREADS_DEFAULT} CACHE STRING
138
138
"Maximum number of GPU threads per block" )
139
139
message ( STATUS " AMReX_GPU_MAX_THREADS = ${AMReX_GPU_MAX_THREADS} " )
Original file line number Diff line number Diff line change 269
269
endif
270
270
271
271
# Maximum number of GPU threads per block.
272
- CUDA_MAX_THREADS ?= 256
272
+ CUDA_MAX_THREADS ?= 128
273
273
GPU_MAX_THREADS ?= $(CUDA_MAX_THREADS)
274
274
275
275
ifeq ($(USE_CUDA),TRUE)
You can’t perform that action at this time.
0 commit comments