Skip to content

v0.67.0-dev20260207

Pre-release
Pre-release

Choose a tag to compare

@github-actions github-actions released this 08 Feb 00:52
· 224 commits to main since this release
Immutable release. Only release title and notes can be modified.
430d1f4

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/21770820481

📦 Uncategorized

  • [skip ci] Update vLLM nightly to test sampling
  • Fuse TP/SP Broadcast into pre_sdpa
  • [skip ci] set-cpu-governor VM handling
  • Move tt_dit out of experimental directory
  • chore: update LLK submodule to e050aab
  • #36149: Add native llk kernel for addcdiv
  • Fix Clang Static Analyzer warning: virtual call in GraphProcessor constructor
  • [gpt-oss] attention decode optimizations
  • Fix Blackhole op performance model FPU and DRAM utilization calculations
  • [Gemma3] Test fix: Ref MLP uses float32 for long sequences
  • Fix undefined behavior in fabric worker memory allocation
  • Encapsulate noc non blocking reads in cq into separate files
  • Expose Parameters in all gather
  • [Watcher] In order to get tt-train-cpp-unit group of tests green there was a need to skip some tests with watcher
  • SFPI 7.23.0 243
  • Using known interval to calculate uptime in check arc
  • Fix ttnn.{gcd,lcm} docs.
  • [Watcher] ttnn-unit-test group skips with watcher
  • Add 32x4 quad BH rankbindings file
  • move fabric benchmark test and update golden
  • #37259: add ifdef guard for layernorm kernels
  • [Quasar DFB]: Add support for multi-threaded producer/consumer + make blocked consumer use remapper
  • Add time budget controls for Galaxy model perf -> Galaxy perf pipeline
  • [skip ci] bring back BH GLX tests in CI
  • Add fabric telemetry neighbor node id exchange
  • Remove harvesting info from build_key when coordinate virtualization is enabled
  • [skip ci] Update CODEOWNERS for programming_examples
  • #0: add models timeout for bh
  • Make perf test timeout explicit for stable_diffusion_1_4 model
  • Fix DeepSeek V3 config loading when model path is a symlink
  • Refactor TTNN tests to use shared config for CI and TTSim
  • Add time budget controls for Galaxy demo pipeline
  • Remove tests/scripts/run_tests.sh and stress-fast-dispatch-build-and-unit-tests.yaml pipeline
  • #36852 BinaryNg kernel deadlocks with reshard
  • Fix segfaults on ttnn.ones, ttnn.zeros, ttnn.empty
  • [Merge stable to main] Llama3.3-70b and 3.1-8b - Fix sampling parameters
  • D2H Sockets
  • Fuse Post SDPA with TP All Reduce.
  • [skip ci] Increase timeout for blackhole deepseek blitz tests
  • #36881: add validation check for sharded softmax