Implementation of frontier primitive from SYGraph #3289

antonio-decaro · 2025-07-09T12:08:02Z

Description

This PR introduces support for integrating SYgraph, an heterogeneous graph analytics framework, into oneDAL.

In graph processing, a frontier represents the active subset of vertices currently being processed during an iteration of a graph algorithm (e.g. Breadth First Search, Single Source Shortest Path).
To operate over the frontier efficiently, SYgraph introduces a set of composable, GPU-parallel graph operators, each designed to process the active vertex set in a scalable, data-driven manner. These operators are invoked over frontiers and graphs using user-defined lambda functions and include:

Advance (edge operation): Traverses edges from the current frontier to generate a new set of active vertices (the next frontier). For each vertex in the input frontier, it inspects neighbors and uses a lambda to decide which should be added to the output.
Compute (vertex operation): Applies a computation to each vertex in the frontier — typically to update vertex properties.
Filter: Refines a frontier by selecting only the elements that satisfy a given condition.

This PR aims to implement the concept of Two-Layer Bitmap Frontier (see the SYgraph paper for more details), primitives operators (advance, filter, compute), and a load-balancing mechanism to evenly distribute computation across all GPU compute units.

Detailed Overview

Changes are contained into the cpp/onedal/dal/backend/primitives/frontier folder.

bitset.hpp
Templated bitset implemented as an array of integer words where each bit encodes a vertex state (1 = active, 0 = inactive). Serves as the low-level building block for the frontier.
frontier.hpp/frontier_dpc.cpp
Two-level bitmap frontier representing vertex activity for graph algorithms. Exposes core operations:
- insert a vertex into the frontier
- test/contains a vertex
- check whether the frontier is empty
- precompute an offsets buffer to improve workload distribution
  Implementation details and device-specific DPC++ code live in frontier_dpc.cpp to keep headers lightweight.
advance.hpp
Templated advance primitive that accepts a callable (e.g., a lambda) executed for every newly discovered vertex during an advance step. Includes a workload-balancing strategy to improve resource utilization during the advance.
graph.hpp
A small graph interface used by SYgraph: a non-owning CSR view that enables generic graph operations without taking ownership of the underlying memory.

Testing

See test/ for unit tests covering basic operations and advance workflows.
Tests focus on correctness of bit manipulations, frontier semantics (insert/test/empty), offsets computation, and the advance callable invocation for newly discovered vertices.

Notes for reviewers

The design is template-first to remain flexible across integer/index types and host/device execution;
The advance.hpp must remain a header-only implementation and cannot be split into a header + precompiled implementation file. It accepts user-provided callables (typically lambdas) that must be instantiated at each call site at compile time; these callables (and the device kernels that use them) cannot be precompiled into a separate translation unit. Keeping advance.hpp header-only ensures the lambda is correctly compiled/instantiated for both host and device code.

PR should start as a draft, then move to ready for review state after CI is passed and all applicable checkboxes are closed.
This approach ensures that reviewers don't spend extra time asking for regular requirements.

You can remove a checkbox as not applicable only if it doesn't relate to this PR in any way.
For example, PR with docs update doesn't require checkboxes for performance while PR with any change in actual code should have checkboxes and justify how this code change is expected to affect performance (or justification should be self-evident).

Checklist to comply with before moving PR from draft:

PR completeness and readability

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with update and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have added a respective label(s) to PR if I have a permission for that.
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
The failures in the internal CI are unrelated to these changes.
The status of the added tests:

I have extended testing suite if new functionality was introduced in this PR.

Performance

I have measured performance for affected algorithms using scikit-learn_bench and provided at least summary table with measured data, if performance change is expected.
I have provided justification why performance has changed or why changes are not expected.
I have provided justification why quality metrics have changed or why changes are not expected.
I have extended benchmarking suite and provided corresponding scikit-learn_bench PR if new measurable functionality was introduced in this PR.

…move obsolete test header

…eck and clear operations

…consistency; update related functionality; fixed bug for SIGSEGV

…name printing and frontier checks

- Removed the existing frontier_dpc.hpp file to streamline the codebase. - Introduced new test files for advance operation, BFS, and basic frontier operations. - Implemented comprehensive tests to validate the functionality of the frontier data structure. - Enhanced the frontier class with additional methods for better performance and usability. - Ensured compatibility with SYCL and improved device memory management.

cpp/oneapi/dal/backend/primitives/frontier/BUILD

cpp/oneapi/dal/backend/primitives/frontier/advance_dpc.hpp

…ize global size calculation

…ter.

…al and add explanatory comment)

…headers, sources and tests

cpp/oneapi/dal/backend/primitives/frontier/advance.hpp

cpp/oneapi/dal/backend/primitives/frontier/frontier_dpc.cpp

avolkov-intel · 2025-09-18T10:49:24Z

cpp/oneapi/dal/backend/primitives/frontier/test/advance_dpc.cpp

+void compare_frontiers(T& device_frontier, std::vector<uint32_t>& host_frontier, size_t num_nodes) {
+    for (size_t i = 0; i < num_nodes; ++i) {
+        bool tmpd = device_frontier.check(i);
+        bool tmph = std::find(host_frontier.begin(), host_frontier.end(), i) != host_frontier.end();


host_frontier should be sorted by construction, so instead of std::find we can use a second pointer that will be pointing to the current node, thus we can reduce complexity from O(n^2) to O(n), this can save some time for testing in case we want check frontiers for big sizes

I used a boolean map to represent whether each node is inside the frontier, in this way the complexity of the compare_frontier method changed to O(n).

…ate and introduce hierarchical reductions

…ame test file

…ble declarations in BitmapKernel and frontier_dpc implementations

…sentation and update test case names for clarity

…t32_t> for improved performance and memory efficiency

…header to BUILD

…ne helper signatures

Vika-F · 2025-10-02T12:07:33Z

/intelci: run

Copilot

Pull Request Overview

This PR introduces support for SYgraph, a heterogeneous graph analytics framework, by implementing frontier-based graph processing primitives in oneDAL. The implementation provides GPU-parallel graph operators for processing active vertex sets using composable operations.

Key changes include:

Implementation of Two-Layer Bitmap Frontier for tracking active vertices in graph algorithms
Core primitives: advance (edge traversal), compute (vertex operations), and filter operations
Load-balancing mechanism for efficient GPU compute unit utilization

Reviewed Changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
`cpp/oneapi/dal/backend/primitives/frontier/bitset.hpp`	Templated bitset implementation using integer word arrays for vertex state encoding
`cpp/oneapi/dal/backend/primitives/frontier/frontier.hpp`	Two-level bitmap frontier interface with core operations (insert, test, empty, offsets)
`cpp/oneapi/dal/backend/primitives/frontier/frontier_dpc.cpp`	Device-specific DPC++ implementation of frontier operations
`cpp/oneapi/dal/backend/primitives/frontier/advance.hpp`	Templated advance primitive with workload balancing for edge traversal
`cpp/oneapi/dal/backend/primitives/frontier/graph.hpp`	Non-owning CSR graph view interface for generic graph operations
`cpp/oneapi/dal/backend/primitives/frontier/test/*.cpp`	Unit tests for frontier operations, advance workflows, and BFS implementation
`cpp/oneapi/dal/backend/primitives/frontier/test/utils.hpp`	Test utilities for random graph generation and device information
`cpp/oneapi/dal/backend/common.hpp`	Added device_max_sg_count function for subgroup query support
Build and module configuration files	Integration of frontier module into build system

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

cpp/oneapi/dal/backend/primitives/frontier/advance.hpp

Copilot · 2025-10-10T11:37:06Z

cpp/oneapi/dal/backend/primitives/frontier/frontier_dpc.cpp

+    auto offsets_pointer = _offsets.get_mutable_data() + 1;
+
+    const uint32_t element_bitsize = bitmap.get_element_bitsize();
+    const size_t local_range = 256; // propose_wg_size(this->_queue);


The hard-coded value 256 should be replaced with the commented function call propose_wg_size(this->_queue) or made configurable to avoid magic numbers.

Suggested change

const size_t local_range = 256; // propose_wg_size(this->_queue);

const size_t local_range = propose_wg_size(this->_queue);

cpp/oneapi/dal/backend/primitives/frontier/test/advance_dpc.cpp

cpp/oneapi/dal/backend/primitives/frontier/test/utils.hpp

cpp/oneapi/dal/backend/primitives/frontier/advance.hpp

Vika-F · 2025-10-10T11:40:40Z

cpp/oneapi/dal/backend/primitives/frontier/frontier.hpp

+
+private:
+    sycl::queue& _queue;
+    size_t _num_items;


The common convention is to use std::inte64_t type for data sizes in oneDAL.

Is it still fine if I use std::uint64_t for size_t types instead?

cpp/oneapi/dal/backend/primitives/frontier/bitset.hpp

cpp/oneapi/dal/backend/primitives/frontier/advance.hpp

cpp/oneapi/dal/backend/primitives/frontier/frontier.hpp

…y frontier_context API

…r_dpc.cpp

…nd loops

…tting

…zation

antonio-decaro added 13 commits July 1, 2025 19:56

feat: add bitset and frontier classes with atomic operations and tests

f16d4d6

feat: enhance frontier class with additional operations and tests; re…

3d19912

…move obsolete test header

feat: enhance frontier class with additional methods and tests for ch…

52d1098

…eck and clear operations

insert compunte active kernel

df75bdf

add compute active frontier operation

6dbbc35

bug fix

98bdf4f

refactor: rename clear and gas methods to unset and atomic_unset for …

246466c

…consistency; update related functionality; fixed bug for SIGSEGV

feat: add advance operation tests for frontier class; include device …

bb4d6a4

…name printing and frontier checks

fixed advance operator

66008a3

implemented async advance

d7f39ff

add documentation for frontier

542022a

add performance tests for bfs

7a8d80b

avolkov-intel reviewed Jul 29, 2025

View reviewed changes

cpp/oneapi/dal/backend/primitives/frontier/BUILD Outdated Show resolved Hide resolved

avolkov-intel reviewed Jul 29, 2025

View reviewed changes

cpp/oneapi/dal/backend/primitives/frontier/advance_dpc.hpp Outdated Show resolved Hide resolved

antonio-decaro added 8 commits August 27, 2025 16:21

Refactor advance function to remove expected_size parameter and optim…

f12297f

…ize global size calculation

moved bitset to frontier folder

aa997e1

applied clang-format

c476c4b

added trailing new line to source files to complie with the CI format…

12393aa

…ter.

update headers to comply with the oneDAL naming style

17e191d

primitives/frontier: remove unused dependencies from BUILD

08944cc

primitives/frontier: small code cleanup in advance.hpp (comment remov…

4632056

…al and add explanatory comment)

primitives/frontier: remove unused includes and stray blank lines in …

e0a9060

…headers, sources and tests