Skip to content

v0.67.0-dev20260206

Pre-release
Pre-release

Choose a tag to compare

@github-actions github-actions released this 07 Feb 03:50
· 264 commits to main since this release
Immutable release. Only release title and notes can be modified.
0474b44

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/21734080303

📦 Uncategorized

  • Topology Mapper Pinning Regression Tests
  • Remove deprecated Grayskull (tt::ARCH::GRAYSKULL) architecture support
  • latency result superset export
  • [BEVFormer] Update PCC
  • SDXL Refiner Matmul memory configs optimization
  • Fix conv2d reader kernel runtime arg mismatch for height-sharded conv
  • [TT-Transformers] Reduce batch-32 prompt length to avoid some tokenizers going over 1024 tokens
  • SDXL override global timeot
  • Remove unused type declarations in tt_metal identified by clangd
  • [skip ci] Reduce CMake install message verbosity for incremental builds
  • #36225: Handle Binary_op_type for mixed dtypes - FPU for EQ
  • SDXL disable timeout
  • add per core compile args
  • Add fused post sdpa op
  • #28532: Add Installer validation as a CI workflow
  • Add time budget controls for Galaxy frequent pipeline -> now Galaxy integration pipeline
  • [skip ci] fix(copilot-autofix-clangsa): fix broken pipe error in jq query
  • Improve custom_mm to performantly cover more shapes and enable transpose
  • [skip ci] Add workflow comparison script for CI analysis
  • [tt-train] TP+DP Llama training
  • #23354 more data type support for llk bcast
  • Fix noc debugging tool test when run back to back
  • [skip ci] Increase timeouts for longer running BH multicard model tests
  • Fix hard-coded action hash causing CI errors
  • Add versioning system to fabric telemetry
  • Increase hang detection timeout for data movement tests
  • #0 - Tests scripts update
  • [deepseek] Fix test_model decode reference for non‑zero position ids
  • Lower Tensor Utilities to Runtime Staging Area
  • Add commands to do packed large linear reads/unicast writes
  • [skip ci] Add pytest timeout flags to long-running model tests in CI
  • Add new all_to_all_dispatch variant for DeepSeek that supports multiple algorithms, fabric mux variants, and persistent buffer/semaphore optimizations
  • Add scattered core support for gather operation
  • Support for local tile reduce using DST accum
  • Add support for 'export TT_METAL_DISABLE_SFPLOADMACRO=1'
  • Remove unused types from ttnn
  • Optimized number of workers for ReduceScatterMinimalMatmul for Llama 70B on Galaxy
  • Extend CB tests