Skip to content

[BUG] Extra cuLibraryLoadData calls when LIBCUDF_HOST_DECOMPRESSION=AUTO #18482

Open
@GregoryKimball

Description

@GregoryKimball

Describe the bug
I was surprised to see extra calls to cuLibraryLoadData just before the first decompress_page_data range when LIBCUDF_HOST_DECOMPRESSION=AUTO is enabled. This happens even when CUDA_MODULE_LOADING=EAGER.

This load library region does not happen when LIBCUDF_HOST_DECOMPRESSION is unset or LIBCUDF_HOST_DECOMPRESSION=ON. In the PDS benchmark it adds perhaps 25 ms per query.

Image

The library loading seems to be in sorting the blocks I guess SortPairsDescending.

Image

Expected behavior
All the cuLibraryLoadData calls should be in the beginning when CUDA_MODULE_LOADING=EAGER

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions