-
Notifications
You must be signed in to change notification settings - Fork 23
Open
Description
Hello,
I am looking to use this library in a project so I just compiled it and ran the test dsscfg program and it failed with what looks like an out-of-bounds write to a global memory array somewhere in the matupd::kernel20 kernel function.
This is on a GeForce RTX 2060 Super (CC 7.5) with CUDA 11.2.
$ ./bin/dsscfg
Begin testing DSSCFG on the CPU (double precision)
CPU iteration 0 F: -0.349732
CPU iteration 0 F: 0.0192875
CPU iteration 0 F: -0.496264
CPU iteration 100 F: -0.888434
CPU iteration 200 F: -1.00093
CPU iteration 300 F: -1.01115
CPU iteration 400 F: -1.01126
CPU iteration 500 F: -1.01129
Timing: 9.94557 ms / iteration
Begin testing DSSCFG with CUDA (double precision)
CUDA iteration 0 F: -0.349732
CUDA iteration 0 F: -0.199307
CUDA iteration 0 F: -0.496264
lbfgsb failure: 700, /home/alex/lbfgsb-gpu/culbfgsb/./cauchy.cu, 594
Here's the cuda-memcheck output:
========= Invalid __global__ write of size 8
========= at 0x000003d0 in void lbfgsbcuda::cuda::matupd::kernel20<int=256, double>(int, int, int, int, int, int, double const *, lbfgsbcuda::cuda::matupd::kernel20<int=256, double>*, double const )
========= by thread (0,0,0) in block (135,0,0)
========= Address 0x7f7999e22038 is out of bounds
========= Device Frame:void lbfgsbcuda::cuda::matupd::kernel20<int=256, double>(int, int, int, int, int, int, double const *, lbfgsbcuda::cuda::matupd::kernel20<int=256, double>*, double const ) (void lbfgsbcuda::cuda::matupd::kernel20<int=256, double>(int, int, int, int, int, int, double const *, lbfgsbcuda::cuda::matupd::kernel20<int=256, double>*, double const ) : 0x3d0)
========= Saved host backtrace up to driver entry point at kernel launch time
========= Host Frame:/usr/lib/x86_64-linux-gnu/libcuda.so.1 (cuLaunchKernel + 0x2b8) [0x222dd8]
========= Host Frame:./bin/dsscfg [0x2136b]
========= Host Frame:./bin/dsscfg [0x6de20]
========= Host Frame:./bin/dsscfg [0xd0c25]
========= Host Frame:./bin/dsscfg [0x9b098]
========= Host Frame:./bin/dsscfg [0x9e633]
========= Host Frame:./bin/dsscfg [0x102a4]
========= Host Frame:./bin/dsscfg [0x9db6]
========= Host Frame:/lib/x86_64-linux-gnu/libc.so.6 (__libc_start_main + 0xf3) [0x270b3]
========= Host Frame:./bin/dsscfg [0xdf4e]
=========
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels