Skip to content

Improve performance profiling experience #640

@pranavm-nvidia

Description

@pranavm-nvidia

When we build the MLIR, we currently encode trace tensor names as the location attributes. We should instead embed the stack information with an option to include code snippets as well so that each kernel in an nsys trace will show us which line of Python code it originates from.

Metadata

Metadata

Assignees

No one assigned

    Labels

    tripyPull request for the tripy project

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions