Skip to content

v0.39.0

Choose a tag to compare

@github-actions github-actions released this 12 Dec 15:57
· 18645 commits to main since this release

📦 Uncategorized

  • #0: Add extra sentence about use cases in somewhat vague terms
  • #3824: cache weight tensors for mistral
  • Npetrovic/power fp sweep
  • #3918: Fix falcon7b perf profiling & add support to load weights from HF when weka is not mounted
  • Rename KernelID -> KernelHandle and CircularBufferID -> CBHandle
  • Aliu/erisc cleanup
  • #3003: ttnn program logging
  • Watcher output/doc tweaks
  • #4014: added support for uint16 datatype
  • #4000: Add links to demo folders in note in first 5 things
  • #3751: Fix sfpu load/store of ints
  • enable watcher for stress test actions
  • #3058: Give first pass at flattening build by getting rid of tt-metal intermediate libs
  • Revert "#3058: Give first pass at flattening build by getting rid of …
  • #3219: Added host functions which tilize and untilize bfloat16 vectors
  • stress test machine config update
  • #0: update to use concat on device
  • #3895: ttnn functional optimized Bert
  • #4014: Fix bug with packing uint16 datatype
  • #3824: move mistral embedding weights to weka
  • #3978: Fix readme to instruct running pytest without warnings
  • Dma/3467 dprint cleanup
  • #0: identity operator for comparison of SFPU ops
  • #3058: Add tracy back into build and test with ENABLE_TRACY=1
  • #3979: Add support for ResNet for weka unmounted machines to download ImageNet
  • #3990: Remove DPRINT SETW sticky bit
  • #4041: Add moreh_layernorm op
  • #4044: Add moreh_softmax, moreh_softmin ops
  • #3103: profile the SFPU operators
  • #0: function typo fix
  • #3211: bug in WH B0 - sum along dim3
  • Implementation for Bert Sharded Batch 12
  • #4069: Avoid reading out of bounds in the hugepage
  • #4014: Add testing for uint16 and uint32 on device
  • #0: Disable TestPrintRaiseWait gtest until a fix for nondet issue is in
  • Move hugepages section and refer to public syseng instructions for accelerator-level dependencies
  • #4055: non-deterministic test_pow_fractional PCC error with watcher enabled
  • #0: update test_sfpu and profiling conflict
  • #4043: Add discord link to docs support page + README
  • Noc on erisc
  • #3894: backward ops for tt-metal
  • #3972: Update tracy and device-side profiler docs
  • #4085: update seed value and re-verify the reported bug
  • #2860: Init one UMD per MMIO device ID and the remote devices it controls
  • #4074: Add opened, reopened, synchronize pull_request triggers (default) for static checks pipeline
  • #0: Ignore /device, not device/ in .gitignore
  • #4074: Add wording to CONTRIBUTING.md to be open to future forks + to discourage clogging up pipelines with too many PRs
  • #4053: Upgrade driver from 1.23 to 1.26 in release assets from syseng
  • #4065: Update pinned python3.8-venv to 20.04.9 because 20.04.8 is gone
  • #4096: Fix issue with DPRINT server closing too early for some WAITs
  • #4053: Add chmod ugo+x step in ansible scripts for copying over script assets
  • #4109: ttnn examples.rst needs update
  • #4158: support full repeat interleave developed for Mistral
  • #4076: Add instructions for execution for programming_examples and fix one typo
  • #0: (MINOR) Bump minor to v0.39.0
  • #4053: Get rid of FW labels for silicon runner targets
  • #3752: update ttnn tutorials and make them more descriptive
  • #3994: Add bfloat16 dtype to sweep tests
  • #0: update ownership for SFPU ops profiler, and Backward ops code
  • #3420: move init erisc info to clear l1 call
  • #3918: Add falcon caching support
  • #4125: Refactor tests for backward ops
  • Perf bloom
  • #4121: Unset TT_METAL_SLOW_DISPATCH_MODE when empty string in yaml. R…
  • #4079: Remove dprints from op kernels
  • #4176: uplift umd to include create-eth-map fixes
  • #4017: Replace static device APIs to query num available devices and num availale pcie devices with standalone host APIs
  • Fixup some error messages
  • Rework build system
  • #4228: Revert umd change to see if seg faults go away
  • #4003: use if-else instead of try-except in ttnn.reshape and ttnn.permute
  • #4003: updated ttnn.model_preprocessing to keep the structure of the model weights
  • #0: Changing name for major places from Metal to Metalium
  • #4186: Move all assets except for setup_hugepages.py to internal workflows
  • #4003: run test_performance_of_bloom_for_question_answering using L1 Config and assuming fused softmax
  • #3003: updated ttnn tests