Add trace points in xrt classes by rbramand-xilinx · Pull Request #9734 · Xilinx/XRT

rbramand-xilinx · 2026-04-15T06:23:26Z

Problem solved by the commit

Added new trace points in xrt classes and its member functions

Bug / issue (if any) fixed, which PR introduced the bug, how it was discovered

Its an EOU/Enhancement change

How problem was solved, alternative solutions (if any) and why they were rejected

Added trace points in xrt objects alloc functions with _ctor in name and also in impl destructors with _dtor in name

Risks (if any) associated the changes in the commit

Low

What has been tested and how, request additional testing if necessary

Tested with perf tool on ve2 and used a python script to pretty print the statistics and it works as expected

Documentation impact (if any)

NA

github-actions

clang-tidy made some suggestions

github-actions · 2026-04-15T06:42:56Z

clang-tidy review says "All clean, LGTM! 👍"

stsoe

Why all these tracepoints? How about adding tracepoints strategically where we want to measure bottlenecks? Tracepoints are not meant to show function calls. The dtor tracepoints are completely useless, they measure nothing at all.

rbramand-xilinx · 2026-04-16T05:07:41Z

Why all these tracepoints? How about adding tracepoints strategically where we want to measure bottlenecks? Tracepoints are not meant to show function calls. The dtor tracepoints are completely useless, they measure nothing at all.

okay Soren, I will make changes accordingly and have tracepoints at needed places only. Thanks

chvamshi-xilinx · 2026-04-16T18:00:50Z

Why all these tracepoints? How about adding tracepoints strategically where we want to measure bottlenecks? Tracepoints are not meant to show function calls. The dtor tracepoints are completely useless, they measure nothing at all.

Hi @stsoe ,
We would like to enable trace log in destructors sothat we would get to know that destructors are being called etc.
In all destructor, Rahul added a TRACEPOINT_LOG sothat we can get the log.
In other places, he enabled TRACEPOINT_SCOPE.
As these are anyway no-ops if trace is not enabled, do you still have concern?

stsoe · 2026-04-16T19:26:37Z

Why all these tracepoints? How about adding tracepoints strategically where we want to measure bottlenecks? Tracepoints are not meant to show function calls. The dtor tracepoints are completely useless, they measure nothing at all.

Hi @stsoe , We would like to enable trace log in destructors sothat we would get to know that destructors are being called etc. In all destructor, Rahul added a TRACEPOINT_LOG sothat we can get the log. In other places, he enabled TRACEPOINT_SCOPE. As these are anyway no-ops if trace is not enabled, do you still have concern?

Thanks @chvamshi-xilinx . It sounds like you adding trace points for debugging function calls, I don't think that is intended purpose? My understanding is that tracepoints are to locate bottlenecks. Yes, they are no-ops (at least on Linux), but my concern is the noise it generates by adding uninteresting tracepoints to the tracelog.

chvamshi-xilinx · 2026-04-17T04:55:51Z

Why all these tracepoints? How about adding tracepoints strategically where we want to measure bottlenecks? Tracepoints are not meant to show function calls. The dtor tracepoints are completely useless, they measure nothing at all.

Hi @stsoe , We would like to enable trace log in destructors sothat we would get to know that destructors are being called etc. In all destructor, Rahul added a TRACEPOINT_LOG sothat we can get the log. In other places, he enabled TRACEPOINT_SCOPE. As these are anyway no-ops if trace is not enabled, do you still have concern?

Thanks @chvamshi-xilinx . It sounds like you adding trace points for debugging function calls, I don't think that is intended purpose? My understanding is that tracepoints are to locate bottlenecks. Yes, they are no-ops (at least on Linux), but my concern is the noise it generates by adding uninteresting tracepoints to the tracelog.

Got it. Thanks @stsoe
@rbramand-xilinx , please make changes suggested by Soren.
For identifying whether destructors are being called properly or not etc, we can use some other mechanism (verbosity level etc)

Signed-off-by: rahul <rbramand@amd.com>

rbramand-xilinx · 2026-04-17T15:16:40Z

Hi @stsoe, added trace points only at places where we need to identify bottlenecks as suggested. Please review. Thanks

github-actions · 2026-04-17T15:21:04Z

clang-tidy review says "All clean, LGTM! 👍"

stsoe

When I look at all these tracepoints, I think they are added without much thought about what we want to measure. There are tracepoints in Alveo obsolete code, .e.g update arg functions, why were they added?

I am dubious about the value of many of these tracepoints. Profiling of user space code is not through tracepoints, but with a profiler, e.g. valgrind.

In the past we have added tracepoints to show bottlenecks through kernel module code paths. It make a lot of sense to trace API overhead in UMD code paths that involve critical KMD code paths. Is this the thinking behind all these tracepoints added here? Otherwise just use valgrind or some other profiling tool to optimize the code if we think there are bottlenecks.

stsoe · 2026-04-17T15:28:16Z

  void
  dump_uc_log_buffer()
  {
+    XRT_TRACE_POINT_SCOPE(xrt_hw_context_dump_uc_log_buffer);


Isn't it the dump() function you want to trace?

stsoe · 2026-04-17T15:41:43Z

+static std::shared_ptr<runlist_impl>
+alloc_runlist(xrt::hw_context hwctx)
+{
+  XRT_TRACE_POINT_SCOPE(xrt_runlist_alloc);


What are we measuring? The constructor is trivial with 0 overhead. I understand why alloc_runlist was created, it was to ensure that you can measure the time it takes to create runlist_impl, which does all initialization in its initializer list. This makes sense if we truly want to measure the construction time, but do we?

The function is kind of like an observer of hwctx, where the sink is really runlist_impl ctor. Shouldn't alloc_runlist take hwctx by const ref and leave the copying to the runlist_impl ctor?

stsoe · 2026-04-17T15:44:49Z

 runlist::
 add(const xrt::run& run)
 {
+  XRT_TRACE_POINT_SCOPE(xrt_runlist_add);


Why are we measuring here and not in && method. If we want to trace anything here, then it should be runlist_impl::add().

stsoe · 2026-04-17T15:46:30Z

 runlist::
 execute()
 {
  XRT_TRACE_POINT_SCOPE(xrt_runlist_execute);


This wasn't added n this PR, but probably shouldn't even be here, it probably should be in runlist_imp::execute().

rbramand-xilinx requested review from rozumx and stsoe as code owners April 15, 2026 06:23

rbramand-xilinx requested review from chvamshi-xilinx and removed request for rozumx April 15, 2026 06:23

github-actions Bot reviewed Apr 15, 2026

View reviewed changes

Comment thread src/runtime_src/core/common/device.cpp

rbramand-xilinx force-pushed the userspace branch from c68c43a to ca9ca60 Compare April 15, 2026 06:35

stsoe requested changes Apr 15, 2026

View reviewed changes

rbramand-xilinx added 2 commits April 17, 2026 09:14

Add trace points in xrt classes

27115a4

Signed-off-by: rahul <rbramand@amd.com>

Address comments on PR

90dd121

Signed-off-by: rahul <rbramand@amd.com>

rbramand-xilinx force-pushed the userspace branch from ca9ca60 to 90dd121 Compare April 17, 2026 15:15

rbramand-xilinx requested a review from stsoe April 17, 2026 15:15

stsoe reviewed Apr 17, 2026

View reviewed changes

Conversation

rbramand-xilinx commented Apr 15, 2026

Problem solved by the commit

Bug / issue (if any) fixed, which PR introduced the bug, how it was discovered

How problem was solved, alternative solutions (if any) and why they were rejected

Risks (if any) associated the changes in the commit

What has been tested and how, request additional testing if necessary

Documentation impact (if any)

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Apr 15, 2026

Uh oh!

stsoe left a comment

Choose a reason for hiding this comment

Uh oh!

rbramand-xilinx commented Apr 16, 2026

Uh oh!

chvamshi-xilinx commented Apr 16, 2026

Uh oh!

stsoe commented Apr 16, 2026

Uh oh!

chvamshi-xilinx commented Apr 17, 2026

Uh oh!

rbramand-xilinx commented Apr 17, 2026

Uh oh!

github-actions Bot commented Apr 17, 2026

Uh oh!

stsoe left a comment

Choose a reason for hiding this comment

Uh oh!

stsoe Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

stsoe Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

stsoe Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

stsoe Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants