Skip to content

Commit 2d52709

Browse files
committed
Improve docs
1 parent 3592588 commit 2d52709

File tree

1 file changed

+5
-4
lines changed

1 file changed

+5
-4
lines changed

src/nhs_precomputed.jl

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -53,10 +53,11 @@ to strip the internal neighborhood search, which is not needed anymore.
5353
By default, the neighbors of each point are stored contiguously
5454
in memory. This layout optimizes cache hits when looping
5555
over all neighbors of a point on CPUs.
56-
On GPUs, however, storing the first neighbors of all points
57-
contiguously in memory (`transpose_backend = true`)
58-
allows for coalesced memory accesses when all threads process
59-
the n-th neighbor of their respective point in parallel.
56+
On GPUs, however, storing all first neighbors of all points
57+
contiguously in memory, then all second neighbors, etc.,
58+
(`transpose_backend = true`) allows for coalesced
59+
memory accesses when all threads process the n-th neighbor
60+
of their respective point in parallel.
6061
This can lead to a speedup of ~3x in many cases.
6162
"""
6263
struct PrecomputedNeighborhoodSearch{NDIMS, NL, ELTYPE, PB, NHS} <:

0 commit comments

Comments
 (0)