Skip to content

Conversation

@umar456
Copy link
Contributor

@umar456 umar456 commented Jan 7, 2026

This pull request introduces support for a new "grouped micro GEMM" (General Matrix Multiply) implementation, along with corresponding kernel code, and enhancements to the testing utilities to support grouped memory.

@github-actions github-actions bot added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel component:tests Codeowner: @oneapi-src/onednn-arch component:common labels Jan 7, 2026
@@ -0,0 +1,168 @@

#include "dnnl_test_common.hpp"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what is missing from benchdnn, so that this test is required..?

Copy link
Contributor Author

@umar456 umar456 Jan 7, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nothing is missing. I want more control over how the data is initialized so that its easier to test/debug new configurations. I will probably remove this in the future.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I want more control over how the data is initialized

Could you clarify, so that I could possibly make it better/easier?

I will probably remove this in the future

I agree, we would need this removed prior merging if it is covered by benchdnn..

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's difficult to initialize the input data with known values to identify incorrect results. For example I want to find out why I am getting an incorrect value at a certain location in the destination buffer. With my own tests I can make my inputs to be known values to debug. I don't know of a way to do that with benchdnn.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does --buffer-prefix work?

@mzhukova mzhukova force-pushed the mzhukova/main/poc-grouped-mem branch 2 times, most recently from be4fed9 to 0ea335d Compare January 10, 2026 01:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

component:common component:tests Codeowner: @oneapi-src/onednn-arch platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants