Skip to content

Commit 584f48a

Browse files
Remove warm-up invocations outside of launcher in examples/throughout and auto_throughput
1 parent d8b0acc commit 584f48a

File tree

2 files changed

+0
-14
lines changed

2 files changed

+0
-14
lines changed

python/examples/auto_throughput.py

Lines changed: 0 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -58,13 +58,6 @@ def throughput_bench(state: nvbench.State) -> None:
5858

5959
krn = make_throughput_kernel(ipt)
6060

61-
# warm-up call ensures that kernel is loaded into context
62-
# before blocking kernel is launched. Kernel loading may cause
63-
# a synchronization to occur.
64-
krn[blocks_in_grid, threads_per_block, alloc_stream, 0](
65-
stride, elements, inp_arr, out_arr
66-
)
67-
6861
def launcher(launch: nvbench.Launch):
6962
exec_stream = as_cuda_stream(launch.get_stream())
7063
krn[blocks_in_grid, threads_per_block, exec_stream, 0](

python/examples/throughput.py

Lines changed: 0 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -59,13 +59,6 @@ def throughput_bench(state: nvbench.State) -> None:
5959

6060
krn = make_throughput_kernel(ipt)
6161

62-
# warm-up call ensures that kernel is loaded into context
63-
# before blocking kernel is launched. Kernel loading may
64-
# cause synchronization to occur.
65-
krn[blocks_in_grid, threads_per_block, alloc_stream, 0](
66-
stride, elements, inp_arr, out_arr
67-
)
68-
6962
def launcher(launch: nvbench.Launch):
7063
exec_stream = as_cuda_stream(launch.get_stream())
7164
krn[blocks_in_grid, threads_per_block, exec_stream, 0](

0 commit comments

Comments
 (0)