Skip to content

Releases: tenstorrent/tt-metal

v0.66.0-dev20251230

31 Dec 00:32
b628853

Choose a tag to compare

v0.66.0-dev20251230 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20585723919

📦 Uncategorized

  • Add missing tensix_neo_reg.h for quasar
  • Bilinear upsample sharding restriction and optimizations
  • [skip ci] readability-avoid-unconditional-preprocessor-if
  • #0: Update clip encoder margin
  • Add option to disable progress bar in tt-triage
  • #33539: Add uint32 and uint16 support for rsub
  • Fix RMSNorm test to generate reference IO on-the-fly
  • #0: disable llama3 from single card demo tests
  • Add compute kernel API for stochastic rounding
  • PDL: Move matmul configs to model_configs.py
  • Sampling - Update Docs, Create Example
  • [UNET] Bumping the average kernel samples per second threshold
  • Modernize use starts ends with
  • Add vector to nanobind
  • Enable hang detection on llk tests
  • 34951: use output mem config for compute_output_specs in generic reduce
  • Moving more tests to CPU only

v0.65.1-rc12

31 Dec 00:42

Choose a tag to compare

v0.65.1-rc12 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20585736852

📦 Uncategorized

  • Fix batched prefill pcc issue
  • Llama-3.1-8B decode TSU optimizations

v0.66.0-dev20251229

29 Dec 07:48
2b69650

Choose a tag to compare

v0.66.0-dev20251229 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20561689015

📦 Uncategorized

  • Fix model trace sweep tests
  • [skip ci] cleanup cmake presets
  • Add LLK and Compute API for pack_rows operation
  • chore: update LLK submodule to 55896df

v0.65.1-rc11

29 Dec 16:18
0c5e982

Choose a tag to compare

v0.65.1-rc11 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20573786687

📦 Uncategorized

  • Remove prefetcher dangling reference from previous test

v0.66.0-dev20251228

28 Dec 07:47
c4dc6ac

Choose a tag to compare

v0.66.0-dev20251228 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20546330059

📦 Uncategorized

v0.65.1-rc10

28 Dec 02:29
dcd7f85

Choose a tag to compare

v0.65.1-rc10 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20546342097

📦 Uncategorized

  • Add prefill sampling support to TTT models

v0.66.0-dev20251227

27 Dec 07:42
b16a0d9

Choose a tag to compare

v0.66.0-dev20251227 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20531742521

📦 Uncategorized

  • chore: update LLK submodule to eab2948
  • Remove pybind11. Nanobind is stable.
  • Move PDL to 110 cores BH P150
  • [UMD Bump] Automated UMD Bump 25.12.2025
  • [tt-triage] Additional script info
  • Skipping SGD cpp test with segmentation fault until it gets resolved
  • Skipping bad pcc test_split.py breaking L2 Nightly until it gets resolved
  • SDXL refiner and img_to_img seed issue fix
  • Fixing program selection error in untilize migration to fix test_slice failure
  • [skip ci] Fix Galaxy Quick hard coded test params

v0.66.0-dev20251226

26 Dec 07:46
f649156

Choose a tag to compare

v0.66.0-dev20251226 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20512964236

📦 Uncategorized

  • Finish moving hard coded params to nanobind
  • chore: update LLK submodule to 282cea4
  • Fix conv1d to use width dimension instead of height for 1D convolution
  • [CONV] Fix null check and memory validation in ConvTranspose2d DRAM path
  • SDXL CI encoder perf
  • chore: update LLK submodule to e5d5906
  • Remove N300 mistral7b test from demo tests
  • Enable hang detection and calling tt-triage on all CI workflows
  • Auto slicing in VAE module
  • Reserving 16 bytes for debug bus atomic reading
  • chore: update LLK submodule to a34968b
  • docs: update dprint.h include path in documentation
  • Revert "Reserving 16 bytes for debug bus atomic reading (#35029)"
  • Update to overlay register map from
  • chore: update LLK submodule to c9aeed6

v0.65.1-rc9

26 Dec 06:43

Choose a tag to compare

v0.65.1-rc9 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20512973248

  • no changes

v0.66.0-dev20251225

25 Dec 07:40
e0c41fa

Choose a tag to compare

v0.66.0-dev20251225 Pre-release
Pre-release

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20496087066

📦 Uncategorized

  • add explicit torch dtype
  • #34990: [skip ci] Disable hanging llama3 galaxy quick prefill and decode tests while team deals with other fires
  • Add 3DETR Model to TTNN
  • Add OpenVLA model to ttnn
  • #34993: Add initial cpu-only fabric tests job in merge gate
  • Support non-tile-aligned widths in LayerNorm (interleaved), and use dest registers more effectively
  • [skip ci] Enhance codeowners-group-analysis workflow
  • Python Multi-Mesh APIs and Programming Example
  • Use optimised fp32→bf16 typecast with RNE; fp32_to_[u]int32; uint16_to_uint32.
  • Fix start_tile_id when work is split over batches in MatmulMultiCoreConfig
  • chore: update LLK submodule to 38295fe
  • Reduce code duplication in Panoptic DeepLab demo and E2E test.
  • TT-Triage Fix dump_running_operations.py to use ElfVariable for watcher_kernel_id
  • [CONV] Defaulting to fp32 dest accumulation in conv ops if inputs are FP32
  • [skip ci] ci: allow specifying architecture for L2 tests
  • [skip ci] ci: update handling of different architectures when starting L2 nightly jobs via LLK uplift workflow
  • Improve error when attempting to initialize in-use device.
  • [skip ci] ci: parametrize device perf tests with architecture parameter
  • Fix I2S corruption issue
  • readability-make-member-function-const
  • Add module level device fixture
  • Fix pytensor ownership
  • TT-Switch MeshDevice API fix and and Distributed Context barrier
  • Unify LLK HW Configs
  • Split DeepSeek V3 tests into separate unit and module test jobs
  • Fix silu initialization mismatch causing PCC errors in Stable Diffusion
  • SDXL CI fix: pcc
  • SDXL CI fix: perf
  • Pool2D Fix for openpdn_mnist