Add `enable_profiling` in Runtime Options #1949

xiaofeihan1 · 2026-01-19T09:06:04Z

Background

Previously, developers could only enable profiling through genai_config.json, which profiles the entire lifetime of a session—from session creation to session destruction. When the number of runs is large (for example, generating ~3000 tokens during inference), the profiling output can become excessively large, causing profiling to fail due to oversized files.

2025-12-17 11:17:36.496 Python[9861:2179219] 2025-12-17 11:17:36.493043 [E:onnxruntime:onnxruntime-genai, profiler.cc:93 EndTimeAndRecordEvent] Maximum number of events reached, could not record profile event.

Description

This PR addresses this limitation by introducing run-level profiling. We expose enable_profiling in RuntimeOptions. Developers can now enable profiling for specific runs only.

With the following code, two profiling JSON files will be generated using the run_profiler_file prefix:
• one containing profiling data for the 100th run
• another containing profiling data for the 101th run

while not generator.is_done():
    if len(new_tokens) == 100 or len(new_tokens) == 101 :
        generator.set_runtime_option("enable_profiling", "run_profiler_file")
    else:
        generator.set_runtime_option("enable_profiling", "0")
    generator.generate_next_token()
    new_token = generator.get_next_tokens()[0]
    new_tokens.append(new_token)

kunal-vaishnavi · 2026-01-27T18:21:33Z

.github/workflows/win-cpu-arm64-build.yml

          choco uninstall llvm --yes
          python -m pip install "numpy<2" coloredlogs flatbuffers packaging protobuf sympy pytest
-          python -m pip install onnxruntime-qnn
+          python -m pip install onnxruntime-qnn==1.25.0.dev20260126001 -i https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ORT-Nightly/pypi/simple/


Do we need to specify a version here or would not listing a version and using the latest nightly package be sufficient?

kunal-vaishnavi · 2026-01-27T18:23:07Z

.pipelines/nuget-publishing.yml

    displayName: 'OnnxRuntime version'
    type: string
-    default: '1.23.0'
+    default: '1.25.0-dev-20260125-1205-727db0d3dc'


These are default values for publishing the official packages for ORT GenAI. I think we should keep the defaults as a stable version of ORT. We can always override what version of ORT to use when the packages are built.

kunal-vaishnavi · 2026-01-27T18:33:48Z

src/models/model.cpp

+    } else if (strcmp(value, "1") == 0) {
+      run_options_->EnableProfiling(ORT_TSTR("onnxruntime_run_profile"));
+    } else {
+      auto ToProfileString = [](const char* s) -> std::basic_string<ORTCHAR_T> {


Why is converting a char* to basic_string<ORTCHAR_T> before going back to a char* needed?

Can you add a comment here to explain that this else condition is for a custom prefix for the log file?

Can the else if and else conditions be merged since they both enable profiling?

impl

5206687

xiaofeihan1 mentioned this pull request Jan 20, 2026

expose start/end profiling API in Models class #1898

Closed

xiaofeihan1 closed this Jan 26, 2026

xiaofeihan1 reopened this Jan 26, 2026

xiaofeihan1 closed this Jan 26, 2026

xiaofeihan1 reopened this Jan 26, 2026

xiaofeihan1 added 5 commits January 26, 2026 15:13

update version

9499adb

fix test

cb66055

fixed transformers version

ad5cf28

fix pipeline

52ce057

fix

9c4d8ce

xiaofeihan1 marked this pull request as ready for review January 27, 2026 15:01

xiaofeihan1 requested a review from kunal-vaishnavi January 27, 2026 15:01

kunal-vaishnavi requested a review from baijumeswani January 27, 2026 18:19

kunal-vaishnavi reviewed Jan 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `enable_profiling` in Runtime Options #1949

Add `enable_profiling` in Runtime Options #1949

Uh oh!

xiaofeihan1 commented Jan 19, 2026 •

edited

Loading

Uh oh!

kunal-vaishnavi Jan 27, 2026

Uh oh!

kunal-vaishnavi Jan 27, 2026

Uh oh!

kunal-vaishnavi Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add enable_profiling in Runtime Options #1949

Are you sure you want to change the base?

Add enable_profiling in Runtime Options #1949

Uh oh!

Conversation

xiaofeihan1 commented Jan 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Description

Uh oh!

kunal-vaishnavi Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

kunal-vaishnavi Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

kunal-vaishnavi Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add `enable_profiling` in Runtime Options #1949

Add `enable_profiling` in Runtime Options #1949

xiaofeihan1 commented Jan 19, 2026 •

edited

Loading