Skip to content

[WIP] common: verbose: asynchronous verbose mode for execution time tracking #3055

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

avmanerikar
Copy link
Contributor

@avmanerikar avmanerikar commented Apr 9, 2025

WORK IN PROGRESS (PROTOTYPE) - DO NOT REVIEW

Description
This PR proposes a prototype for introducing an asynchronous verbose mode to accurately track kernel execution times for GPU engines. For the verbose mode, retrieving the kernel timing causes significant overhead as it requires the GPU kernel execution to be synchronized and also because it is tracked on the host.
The asynchronous mode uses event-based tracking to measure the GPU kernel execution to avoid the synchronization issue.
The prototype is created for SYCL/OCL GPU APIs that provide the kernel execution statistics for profiling.

Addresses MFDNN-12088.

Checklist

  • Have you published an RFC for the new feature?
  • Was the RFC approved?
  • Have you added relevant tests?

@avmanerikar avmanerikar requested review from a team as code owners April 9, 2025 17:50
@github-actions github-actions bot added documentation A request to change/fix/improve the documentation. Codeowner: @oneapi-src/onednn-doc platform:gpu-generic Codeowner: @oneapi-src/onednn-gpu-generic component:api Codeowner: @oneapi-src/onednn-arch component:build labels Apr 9, 2025
@avmanerikar avmanerikar marked this pull request as draft April 9, 2025 17:52
@avmanerikar avmanerikar force-pushed the amanerik/main/async-verbose-mode branch from 25b0638 to bf1e8d1 Compare April 9, 2025 18:00
@avmanerikar avmanerikar force-pushed the amanerik/main/async-verbose-mode branch from bf1e8d1 to 625eec4 Compare April 28, 2025 17:45
@github-actions github-actions bot added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel component:common labels Apr 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:api Codeowner: @oneapi-src/onednn-arch component:build component:common documentation A request to change/fix/improve the documentation. Codeowner: @oneapi-src/onednn-doc platform:gpu-generic Codeowner: @oneapi-src/onednn-gpu-generic platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant