Skip to content

v0.66.0-dev20260109

Pre-release
Pre-release

Choose a tag to compare

@github-actions github-actions released this 09 Jan 14:22
· 1161 commits to main since this release
f18f1d8

Note

If you are installing from a release, please refer to the README, INSTALLATION instructions, and any other documentation packaged with the release, not on the main branch. There may be differences between the latest main and the previous release.

The changelog will now follow, showing the changes from last release.

This release was generated by the CI workflow https://github.com/tenstorrent/tt-metal/actions/runs/20836658890

📦 Uncategorized

  • [skip ci] Add timeout to package installation step
  • Expanding module tests to ensure added seq len functionality for Deepseek 671B model
  • Fix BH performance: Remove unnecessary NOC_BRCST_EXCLUDE resets
  • Graph tracing improvement
  • #35326: Add Deepseek Blitz unit tests to CI
  • Make allocate_tensor_on_device private, use create_device_tensor instead
  • [Fabric] Add infra for dynamic packet header sizing
  • Add sweeps for new model traced ops
  • Improve Out of Memory Error Message
  • [skip ci] update gpt-oss README
  • Add teacher forcing demo test for Deepseek 671B model
  • [DM] Update data movement tests
  • #34947: ttnn_tracer_model ttnn tutorial fix
  • Add memory usage tracking for DRAM & L1 in training loop
  • #0: [skip ci] Add P100 support in git bisect
  • Update ttexalens reference version to 0.2.0
  • [skip ci] Enable t3k demo tests cron job
  • adds TT_METAL_JIT_ANALYTICS environment variable
  • Add support for Automatic Prefix Caching in TT-Transformers
  • Reenable fabric manager tests in Galaxy Quick
  • #32983: Remove some initial calls to test_system_health as it's being deprecated
  • Expose Hyperparams to Standard Namespace AG & RS
  • Strip unused symbols in sub_device.hpp
  • Launch dispatch kernels in parallel on multiple devices
  • [skip ci] Update Wheel Artifact Naming Convention in CI
  • Reduce channel count when not all channels are needed.
  • allow subordinate_sync_t per architecture
  • [skip ci] Add bh demo tests and bh multi card test to release testing
  • [skip ci] Optimize clang-tidy presets: disable tt-train and switch to Debug config
  • Apascual/30094 test mixtral decoder against hf
  • [skip ci] update merge gate alerts
  • [TT-Train] GSM8K Finetuning example with dashboard and Galaxy support
  • Fix swapped BASE_DIRS in kernel_helper_functions CMakeLists.txt
  • Moved get_batch_size to shape file
  • Move compute_flat_indices to shape
  • Add owners of vLLM integration tech report
  • feat: refactor import_tracy_op_logs
  • Migrate op to new infra: all_gather_async
  • [skip ci] zstd for .debs
  • #35441: Fix ttnn.visualize_tensor() crash on multi-host systems
  • Haibo sun/issue#29156