Skip to content

Analyzing problems with HIP port #178

@markdewing

Description

@markdewing

Track investigations into the HIP port

Snapshot of the state is in #161

Tests with ROCm 4.1
Main 'hip' executable still fails with the same error

Now working:

  • prefixScan_t
  • eigenSoA_t

Still failing

  • radixSort_t
  • gpuVertexFinder_t

Broken differently - testEigenGPUNoFit returns an error rather than hangs

./testEigenGPUNoFit
TEST EIGENVALUES
TEST INVERSE 3x3
TEST INVERSE 4x4
TEST INVERSE 5x5
:0:rocdevice.cpp            :2325: 4804578096 us: Device::callbackQueue aborting with status: 0x1016
Aborted (core dumped)

New failure?

./testEigenGPU
...
testEigenGPU: /mnt/nvme/physics/hep/tracking/patatrack/pixeltrack-standalone/src/hip/test/test_common.h:24: bool isEqualFuzzy(C1, C2, double) [C1 = Eigen::Matrix<double, 2, 1, 0, 2, 1>, C2 = Eigen::Matrix<double, 2, 1, 0, 2, 1>]: Assertion `std::abs(a(i, j) - b(i, j)) < std::min(std::abs(a(i, j)), std::abs(b(i, j))) * epsilon' failed.
Aborted (core dumped)

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinghip

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions