Open
Description
Hi team,
It looks like LibKineto can enable GPU hardware counters (Events) and/or GPU Metrics, but the integration with PyTorch profiler is not clear for me. I can see the potential use of the CUDA metrics in kDerivedMetrics, in EventProfilerTest and its initialization in EventProfiler::initMetrics...
So my questions are:
- Is there any documentation on how to use LibKineto CUpti events and metrics within the PyTorch profiler?
- If integration with the PyTorch profiler isn’t the intended use case, what are the main applications for these metrics?
- Can LibKineto produce Chrome traces that include GPU events and/or metrics?
I could not find examples of PyTorch profiler using GPU counter events, which suggests potential integration challenges. Clarifying these points would really help in understanding how to leverage LibKineto’s capabilities. Specially because I am trying to extend LibKineto to support metrics of a different AI hardware accelerator.
Metadata
Assignees
Labels
No labels