Skip to content

Integration of LibKineto CUpti Events and Metrics with PyTorch Profiler #973

Open
@marceloamaral

Description

Hi team,

It looks like LibKineto can enable GPU hardware counters (Events) and/or GPU Metrics, but the integration with PyTorch profiler is not clear for me. I can see the potential use of the CUDA metrics in kDerivedMetrics, in EventProfilerTest and its initialization in EventProfiler::initMetrics...

So my questions are:

  • Is there any documentation on how to use LibKineto CUpti events and metrics within the PyTorch profiler?
  • If integration with the PyTorch profiler isn’t the intended use case, what are the main applications for these metrics?
  • Can LibKineto produce Chrome traces that include GPU events and/or metrics?

I could not find examples of PyTorch profiler using GPU counter events, which suggests potential integration challenges. Clarifying these points would really help in understanding how to leverage LibKineto’s capabilities. Specially because I am trying to extend LibKineto to support metrics of a different AI hardware accelerator.

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions