Skip to content

Releases: ROCm/Tensile

rocm-7.0.2

14 Oct 17:14

Choose a tag to compare

ROCm release v7.0.2

rocm-7.0.0

17 Sep 15:48

Choose a tag to compare

ROCm release v7.0.0

Tensile 4.43.0 for ROCm 6.4.4

24 Sep 14:01
7449c7f

Choose a tag to compare

Tensile code for ROCm 6.4.4 did not change. The library was rebuilt for the updated ROCm 6.4.4 stack.

Tensile 4.43.0 for ROCm 6.4.3

07 Aug 14:19
be49885

Choose a tag to compare

Tensile code for ROCm 6.4.3 did not change. The library was rebuilt for the updated ROCm 6.4.3 stack.

Tensile 4.43.0 for ROCm 6.4.2

21 Jul 16:54
be49885

Choose a tag to compare

Tensile code for ROCm 6.4.2 did not change. The library was rebuilt for the updated ROCm 6.4.2 stack.

Tensile 4.43.0 for ROCm 6.4.1

20 May 13:15
be49885

Choose a tag to compare

Tensile code for ROCm 6.4.1 did not change. The library was rebuilt for the updated ROCm 6.4.1 stack.

Tensile 4.43.0 for ROCm 6.4.0

11 Apr 13:34
be49885

Choose a tag to compare

Added

  • Nightly builds with performance statistics
  • Cache asm capabilities for reuse
  • venv for Tensile create on Linux
  • Flag to keep build_tmp when running Tensile
  • Generalized profiling scripts
  • GFX1151 support
  • Single-threaded support in TensileCreateLibrary
  • Logic to remove temporary build artifacts

Changed

  • Updated Tensile documents (API reference, README.md, and comments)
  • Disabled asm-cache for tests
  • Used hipcc.bat as a compiler on Windows instead of the Perl script
  • Improved clarity of CHANGELOG.md
  • Enabled external CI
  • Improved Tensile documentation
  • Refactored kernel source and header creation
  • Refactored writeKernels in TensileCreateLibrary
  • Suppressed developer warnings (simplifying the Tensile output)
  • Used an explicit cast when invoking min is called
  • Used cache abbreviations to compute kernel names

Removed

  • OCL backend
  • Unsupported tests
  • Deep copy in TensileCreateLibrary

Optimized

  • Linearized asm register search to reduce build time

Resolved issues

  • Fixed Stream-K dynamic grid model
  • Fixed logic related to caching asm capabilities
  • Fixed accvgpr overflow
  • Fixed test failures in SLES containers when running TensileTests
  • Fixed a regression that prevents TensileCreateLibrary from completing when fallback logic is not available

Tensile 4.41.0 for ROCm 6.2.4

06 Nov 19:55
81ae953

Choose a tag to compare

Tensile code for ROCm 6.2.4 did not change. The library was rebuilt for the updated ROCm 6.2.4 stack.

Tensile 4.42.0 for ROCm 6.3.3

19 Feb 17:47
aca95d1

Choose a tag to compare

Tensile code for ROCm 6.3.3 did not change. The library was rebuilt for the updated ROCm 6.3.3 stack.

Tensile 4.42.0 for ROCm 6.3.2

28 Jan 15:43
aca95d1

Choose a tag to compare

Tensile code for ROCm 6.3.2 did not change. The library was rebuilt for the updated ROCm 6.3.2 stack.