Skip to content

Commit 44e8cdc

Browse files
authored
Migrate device headers from tt_metal/include to tt_metal/hw/inc (#36583)
Note: This PR is awaiting code review bypass. Do not review unless you're explicitly requested via DM. ### Ticket #31083 ### Problem description Device headers are scattered around tt_metal repo - reorganization is needed. We plan to put all device headers under `tt_metal/hw/inc`. ### What's changed This PR moves device headers from `tt_metal/include` to `tt_metal/hw/inc`. This change was created programatically. ### Checklist - [ ] [![All post-commit tests](https://github.com/tenstorrent/tt-metal/actions/workflows/all-post-commit-workflows.yaml/badge.svg?branch=prybicki/migrate-tt-metal-include2)](https://github.com/tenstorrent/tt-metal/actions/workflows/all-post-commit-workflows.yaml?query=branch:prybicki/migrate-tt-metal-include2) - [ ] [![Blackhole Post commit](https://github.com/tenstorrent/tt-metal/actions/workflows/blackhole-post-commit.yaml/badge.svg?branch=prybicki/migrate-tt-metal-include2)](https://github.com/tenstorrent/tt-metal/actions/workflows/blackhole-post-commit.yaml?query=branch:prybicki/migrate-tt-metal-include2) - [ ] [![cpp-unit-tests](https://github.com/tenstorrent/tt-metal/actions/workflows/tt-metal-l2-nightly.yaml/badge.svg?branch=prybicki/migrate-tt-metal-include2)](https://github.com/tenstorrent/tt-metal/actions/workflows/tt-metal-l2-nightly.yaml?query=branch:prybicki/migrate-tt-metal-include2) - [ ] New/Existing tests provide coverage for changes #### Model tests If your changes cover model-related code, you should run tests corresponding to affected models and platforms (Single card, T3K, Galaxy). "Choose your pipeline" workflows facilitate running multiple kinds of tests in a single run. Each offers `models-mandatory` and `models-extended` presets. The former includes a minimal set of tests, to be run always. The latter extends that with additional ones - use your best judgement in deciding which is the most appropriate for your PR. - [ ] [![(Single) Choose your pipeline](https://github.com/tenstorrent/tt-metal/actions/workflows/pipeline-select.yaml/badge.svg?branch=prybicki/migrate-tt-metal-include2)](https://github.com/tenstorrent/tt-metal/actions/workflows/pipeline-select.yaml?query=branch:prybicki/migrate-tt-metal-include2) - [ ] `models-mandatory` preset (runs: [Device perf regressions](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-device-models.yaml) and [Frequent model and ttnn tests](https://github.com/tenstorrent/tt-metal/actions/workflows/fast-dispatch-full-regressions-and-models.yaml)) - [ ] `models-extended` preset (runs: the mandatory tests, plus [Demo](https://github.com/tenstorrent/tt-metal/actions/workflows/single-card-demo-tests.yaml) and [Model perf](https://github.com/tenstorrent/tt-metal/actions/workflows/perf-models.yaml) tests) - [ ] other selection - specify runs - [ ] [![(T3K) Choose your pipeline](https://github.com/tenstorrent/tt-metal/actions/workflows/pipeline-select-t3k.yaml/badge.svg?branch=prybicki/migrate-tt-metal-include2)](https://github.com/tenstorrent/tt-metal/actions/workflows/pipeline-select-t3k.yaml?query=branch:prybicki/migrate-tt-metal-include2) - [ ] `models-mandatory` preset (runs: [Unit tests](https://github.com/tenstorrent/tt-metal/actions/workflows/t3000-unit-tests.yaml)) - [ ] `models-extended` preset (runs: the mandatory tests, plus [Demo](https://github.com/tenstorrent/tt-metal/actions/workflows/t3000-demo-tests.yaml) and [Model perf](https://github.com/tenstorrent/tt-metal/actions/workflows/t3000-model-perf-tests.yaml) tests) - [ ] other selection - specify runs - [ ] [![(Galaxy) Choose your pipeline](https://github.com/tenstorrent/tt-metal/actions/workflows/pipeline-select-galaxy.yaml/badge.svg?branch=prybicki/migrate-tt-metal-include2)](https://github.com/tenstorrent/tt-metal/actions/workflows/pipeline-select-galaxy.yaml?query=branch:prybicki/migrate-tt-metal-include2) - [ ] `models-mandatory` preset (runs: [Quick tests](https://github.com/tenstorrent/tt-metal/actions/workflows/galaxy-quick.yaml)) - [ ] `models-extended` preset (runs: the mandatory tests, plus [Demo](https://github.com/tenstorrent/tt-metal/actions/workflows/galaxy-demo-tests.yaml) and [Model perf](https://github.com/tenstorrent/tt-metal/actions/workflows/galaxy-model-perf-tests.yaml) tests) - [ ] other selection - specify runs
1 parent bf5dfee commit 44e8cdc

File tree

473 files changed

+2011
-1911
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

473 files changed

+2011
-1911
lines changed

.github/CODEOWNERS

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -87,8 +87,8 @@ tt_metal/impl/program/ @abhullar-tt @jbaumanTT @tt-asaigal @tt-aho @tenstorrent/
8787
tt_metal/impl/sub_device/ @abhullar-tt @jbaumanTT @tt-asaigal @tt-aho @tenstorrent/codeowner-bypass
8888
tt_metal/impl/trace/ @abhullar-tt @jbaumanTT @tt-asaigal @tt-aho @tenstorrent/codeowner-bypass
8989
tt_metal/impl/tensor/ @akerteszTT @riverwuTT @tenstorrent/codeowner-bypass
90-
tt_metal/include/compute_kernel_api.h @davorchap @rtawfik01 @rdjogoTT @nvelickovicTT @amahmudTT @lpremovicTT @ncvetkovicTT @fvranicTT @tenstorrent/codeowner-bypass
91-
tt_metal/include/compute_kernel_api/ @rtawfik01 @rdjogoTT @nvelickovicTT @amahmudTT @lpremovicTT @ncvetkovicTT @fvranicTT @tenstorrent/codeowner-bypass
90+
tt_metal/hw/inc/api/compute/compute_kernel_api.h @davorchap @rtawfik01 @rdjogoTT @nvelickovicTT @amahmudTT @lpremovicTT @ncvetkovicTT @fvranicTT @tenstorrent/codeowner-bypass
91+
tt_metal/hw/inc/api/compute/ @rtawfik01 @rdjogoTT @nvelickovicTT @amahmudTT @lpremovicTT @ncvetkovicTT @fvranicTT @tenstorrent/codeowner-bypass
9292
tt_metal/jit_build/ @abhullar-tt @nathan-TT @jbaumanTT @kstevensTT @ruizhangTT @tenstorrent/codeowner-bypass
9393
tt_metal/jit_build/**/CMakeLists.txt @abhullar-tt @nathan-TT @jbaumanTT @ruizhangTT @tenstorrent/metalium-developers-infra @tenstorrent/codeowner-bypass
9494
tt_metal/kernels/ @abhullar-tt @tenstorrent/codeowner-bypass # these should go away or get moved to tests

docs/Doxyfile

Lines changed: 77 additions & 77 deletions
Original file line numberDiff line numberDiff line change
@@ -928,83 +928,83 @@ INPUT = tt_metal/hw/inc/api/compile_time_args.h \
928928
tt_metal/hw/inc/internal/ethernet/dataflow_api.h \
929929
tt_metal/api/tt-metalium/host_api.hpp \
930930
tt_metal/api/tt-metalium/experimental/profiler.hpp \
931-
tt_metal/include/compute_kernel_api/eltwise_unary/comp.h \
932-
tt_metal/include/compute_kernel_api/eltwise_unary/erf_erfc.h \
933-
tt_metal/include/compute_kernel_api/eltwise_unary/erfinv.h \
934-
tt_metal/include/compute_kernel_api/eltwise_unary/exp.h \
935-
tt_metal/include/compute_kernel_api/eltwise_unary/gelu.h \
936-
tt_metal/include/compute_kernel_api/eltwise_unary/recip.h \
937-
tt_metal/include/compute_kernel_api/eltwise_unary/sqrt.h \
938-
tt_metal/include/compute_kernel_api/eltwise_unary/rsqrt.h \
939-
tt_metal/include/compute_kernel_api/eltwise_unary/elu.h \
940-
tt_metal/include/compute_kernel_api/eltwise_unary/relu.h \
941-
tt_metal/include/compute_kernel_api/eltwise_unary/i0.h \
942-
tt_metal/include/compute_kernel_api/eltwise_unary/i1.h \
943-
tt_metal/include/compute_kernel_api/eltwise_unary/reverseops.h \
944-
tt_metal/include/compute_kernel_api/eltwise_unary/isinf_isnan.h \
945-
tt_metal/include/compute_kernel_api/eltwise_unary/logical_not_noti.h \
946-
tt_metal/include/compute_kernel_api/eltwise_unary/trigonometry.h \
947-
tt_metal/include/compute_kernel_api/eltwise_unary/rounding.h \
948-
tt_metal/include/compute_kernel_api/eltwise_unary/activations.h \
949-
tt_metal/include/compute_kernel_api/eltwise_unary/clamp.h \
950-
tt_metal/include/compute_kernel_api/eltwise_unary/trigonometric.h \
951-
tt_metal/include/compute_kernel_api/eltwise_unary/binop_with_scalar.h \
952-
tt_metal/include/compute_kernel_api/eltwise_unary/fill.h \
953-
tt_metal/include/compute_kernel_api/eltwise_unary/fmod.h \
954-
tt_metal/include/compute_kernel_api/eltwise_unary/rounding.h \
955-
tt_metal/include/compute_kernel_api/eltwise_unary/identity.h \
956-
tt_metal/include/compute_kernel_api/eltwise_unary/negative.h \
957-
tt_metal/include/compute_kernel_api/eltwise_unary/dropout.h \
958-
tt_metal/include/compute_kernel_api/eltwise_unary/typecast.h \
959-
tt_metal/include/compute_kernel_api/eltwise_unary/prelu.h \
960-
tt_metal/include/compute_kernel_api/eltwise_unary/softplus.h \
961-
tt_metal/include/compute_kernel_api/eltwise_unary/softsign.h \
962-
tt_metal/include/compute_kernel_api/eltwise_unary/hardtanh.h \
963-
tt_metal/include/compute_kernel_api/eltwise_unary/rand.h \
964-
tt_metal/include/compute_kernel_api/eltwise_unary/isinf_isnan.h \
965-
tt_metal/include/compute_kernel_api/eltwise_unary/left_shift.h \
966-
tt_metal/include/compute_kernel_api/eltwise_unary/right_shift.h \
967-
tt_metal/include/compute_kernel_api/eltwise_unary/remainder.h \
968-
tt_metal/include/compute_kernel_api/eltwise_unary/threshold.h \
969-
tt_metal/include/compute_kernel_api/eltwise_unary/selu.h \
970-
tt_metal/include/compute_kernel_api/eltwise_unary/where.h \
971-
tt_metal/include/compute_kernel_api/eltwise_unary/addcmul.h \
972-
tt_metal/include/compute_kernel_api/eltwise_unary/log1p.h \
973-
tt_metal/include/compute_kernel_api/eltwise_unary/comp.h \
974-
tt_metal/include/compute_kernel_api/eltwise_unary/bitwise_or.h \
975-
tt_metal/include/compute_kernel_api/eltwise_unary/bitwise_and.h \
976-
tt_metal/include/compute_kernel_api/eltwise_unary/bitwise_xor.h \
977-
tt_metal/include/compute_kernel_api/eltwise_unary/bitwise_not.h \
978-
tt_metal/include/compute_kernel_api/bcast.h \
979-
tt_metal/include/compute_kernel_api/cb_api.h \
980-
tt_metal/include/compute_kernel_api/compute_kernel_hw_startup.h \
981-
tt_metal/include/compute_kernel_api/eltwise_binary.h \
982-
tt_metal/include/compute_kernel_api/eltwise_binary_sfpu.h \
983-
tt_metal/include/compute_kernel_api/matmul.h \
984-
tt_metal/include/compute_kernel_api/pack.h \
985-
tt_metal/include/compute_kernel_api/pack_untilize.h \
986-
tt_metal/include/compute_kernel_api/quantization.h \
987-
tt_metal/include/compute_kernel_api/reconfig_data_format.h \
988-
tt_metal/include/compute_kernel_api/reduce.h \
989-
tt_metal/include/compute_kernel_api/reg_api.h \
990-
tt_metal/include/compute_kernel_api/tile_move_copy.h \
991-
tt_metal/include/compute_kernel_api/tilize.h \
992-
tt_metal/include/compute_kernel_api/transpose_wh.h \
993-
tt_metal/include/compute_kernel_api/untilize.h \
994-
tt_metal/include/compute_kernel_api/cumsum.h \
995-
tt_metal/include/compute_kernel_api/add_int_sfpu.h \
996-
tt_metal/include/compute_kernel_api/binop_with_scalar.h \
997-
tt_metal/include/compute_kernel_api/gcd.h \
998-
tt_metal/include/compute_kernel_api/xlogy.h \
999-
tt_metal/include/compute_kernel_api/lcm.h \
1000-
tt_metal/include/compute_kernel_api/mask.h \
1001-
tt_metal/include/compute_kernel_api/reshuffle.h \
1002-
tt_metal/include/compute_kernel_api/binary_shift.h \
1003-
tt_metal/include/compute_kernel_api/binary_max_min.h \
1004-
tt_metal/include/compute_kernel_api/binary_bitwise_sfpu.h \
1005-
tt_metal/include/compute_kernel_api/sub_int_sfpu.h \
1006-
tt_metal/include/compute_kernel_api/mul_int_sfpu.h \
1007-
tt_metal/include/compute_kernel_api.h \
931+
tt_metal/hw/inc/api/compute/eltwise_unary/comp.h \
932+
tt_metal/hw/inc/api/compute/eltwise_unary/erf_erfc.h \
933+
tt_metal/hw/inc/api/compute/eltwise_unary/erfinv.h \
934+
tt_metal/hw/inc/api/compute/eltwise_unary/exp.h \
935+
tt_metal/hw/inc/api/compute/eltwise_unary/gelu.h \
936+
tt_metal/hw/inc/api/compute/eltwise_unary/recip.h \
937+
tt_metal/hw/inc/api/compute/eltwise_unary/sqrt.h \
938+
tt_metal/hw/inc/api/compute/eltwise_unary/rsqrt.h \
939+
tt_metal/hw/inc/api/compute/eltwise_unary/elu.h \
940+
tt_metal/hw/inc/api/compute/eltwise_unary/relu.h \
941+
tt_metal/hw/inc/api/compute/eltwise_unary/i0.h \
942+
tt_metal/hw/inc/api/compute/eltwise_unary/i1.h \
943+
tt_metal/hw/inc/api/compute/eltwise_unary/reverseops.h \
944+
tt_metal/hw/inc/api/compute/eltwise_unary/isinf_isnan.h \
945+
tt_metal/hw/inc/api/compute/eltwise_unary/logical_not_noti.h \
946+
tt_metal/hw/inc/api/compute/eltwise_unary/trigonometry.h \
947+
tt_metal/hw/inc/api/compute/eltwise_unary/rounding.h \
948+
tt_metal/hw/inc/api/compute/eltwise_unary/activations.h \
949+
tt_metal/hw/inc/api/compute/eltwise_unary/clamp.h \
950+
tt_metal/hw/inc/api/compute/eltwise_unary/trigonometric.h \
951+
tt_metal/hw/inc/api/compute/eltwise_unary/binop_with_scalar.h \
952+
tt_metal/hw/inc/api/compute/eltwise_unary/fill.h \
953+
tt_metal/hw/inc/api/compute/eltwise_unary/fmod.h \
954+
tt_metal/hw/inc/api/compute/eltwise_unary/rounding.h \
955+
tt_metal/hw/inc/api/compute/eltwise_unary/identity.h \
956+
tt_metal/hw/inc/api/compute/eltwise_unary/negative.h \
957+
tt_metal/hw/inc/api/compute/eltwise_unary/dropout.h \
958+
tt_metal/hw/inc/api/compute/eltwise_unary/typecast.h \
959+
tt_metal/hw/inc/api/compute/eltwise_unary/prelu.h \
960+
tt_metal/hw/inc/api/compute/eltwise_unary/softplus.h \
961+
tt_metal/hw/inc/api/compute/eltwise_unary/softsign.h \
962+
tt_metal/hw/inc/api/compute/eltwise_unary/hardtanh.h \
963+
tt_metal/hw/inc/api/compute/eltwise_unary/rand.h \
964+
tt_metal/hw/inc/api/compute/eltwise_unary/isinf_isnan.h \
965+
tt_metal/hw/inc/api/compute/eltwise_unary/left_shift.h \
966+
tt_metal/hw/inc/api/compute/eltwise_unary/right_shift.h \
967+
tt_metal/hw/inc/api/compute/eltwise_unary/remainder.h \
968+
tt_metal/hw/inc/api/compute/eltwise_unary/threshold.h \
969+
tt_metal/hw/inc/api/compute/eltwise_unary/selu.h \
970+
tt_metal/hw/inc/api/compute/eltwise_unary/where.h \
971+
tt_metal/hw/inc/api/compute/eltwise_unary/addcmul.h \
972+
tt_metal/hw/inc/api/compute/eltwise_unary/log1p.h \
973+
tt_metal/hw/inc/api/compute/eltwise_unary/comp.h \
974+
tt_metal/hw/inc/api/compute/eltwise_unary/bitwise_or.h \
975+
tt_metal/hw/inc/api/compute/eltwise_unary/bitwise_and.h \
976+
tt_metal/hw/inc/api/compute/eltwise_unary/bitwise_xor.h \
977+
tt_metal/hw/inc/api/compute/eltwise_unary/bitwise_not.h \
978+
tt_metal/hw/inc/api/compute/bcast.h \
979+
tt_metal/hw/inc/api/compute/cb_api.h \
980+
tt_metal/hw/inc/api/compute/compute_kernel_hw_startup.h \
981+
tt_metal/hw/inc/api/compute/eltwise_binary.h \
982+
tt_metal/hw/inc/api/compute/eltwise_binary_sfpu.h \
983+
tt_metal/hw/inc/api/compute/matmul.h \
984+
tt_metal/hw/inc/api/compute/pack.h \
985+
tt_metal/hw/inc/api/compute/pack_untilize.h \
986+
tt_metal/hw/inc/api/compute/quantization.h \
987+
tt_metal/hw/inc/api/compute/reconfig_data_format.h \
988+
tt_metal/hw/inc/api/compute/reduce.h \
989+
tt_metal/hw/inc/api/compute/reg_api.h \
990+
tt_metal/hw/inc/api/compute/tile_move_copy.h \
991+
tt_metal/hw/inc/api/compute/tilize.h \
992+
tt_metal/hw/inc/api/compute/transpose_wh.h \
993+
tt_metal/hw/inc/api/compute/untilize.h \
994+
tt_metal/hw/inc/api/compute/cumsum.h \
995+
tt_metal/hw/inc/api/compute/add_int_sfpu.h \
996+
tt_metal/hw/inc/api/compute/binop_with_scalar.h \
997+
tt_metal/hw/inc/api/compute/gcd.h \
998+
tt_metal/hw/inc/api/compute/xlogy.h \
999+
tt_metal/hw/inc/api/compute/lcm.h \
1000+
tt_metal/hw/inc/api/compute/mask.h \
1001+
tt_metal/hw/inc/api/compute/reshuffle.h \
1002+
tt_metal/hw/inc/api/compute/binary_shift.h \
1003+
tt_metal/hw/inc/api/compute/binary_max_min.h \
1004+
tt_metal/hw/inc/api/compute/binary_bitwise_sfpu.h \
1005+
tt_metal/hw/inc/api/compute/sub_int_sfpu.h \
1006+
tt_metal/hw/inc/api/compute/mul_int_sfpu.h \
1007+
tt_metal/hw/inc/api/compute/compute_kernel_api.h \
10081008
tt_metal/impl/kernels/kernel_args.hpp \
10091009
tt_metal/include/tt_metal/metal.hpp \
10101010
tt_metal/api/tt-metalium/types.hpp \

docs/source/tt-metalium/tt_metal/labs/matmul/lab1/lab1.rst

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1163,10 +1163,10 @@ Then, adjust the code to perform matrix multiplication, by making the following
11631163
so your code only needs to generate indices in the right order.
11641164

11651165
#. Update the compute kernel to perform matrix multiplication rather than elementwise addition.
1166-
To initialize the Tensix Engine for matrix multiplication, you will need to use the ``mm_init`` function provided in ``tt_metal/include/compute_kernel_api/matmul.h``.
1166+
To initialize the Tensix Engine for matrix multiplication, you will need to use the ``mm_init`` function provided in ``tt_metal/hw/inc/api/compute/matmul.h``.
11671167
Do not use any other initialization functions for matrix multiplication (specifically do **not** use ``binary_op_init_common``, because that function is only
11681168
applicable to elementwise operations, not to matrix multiplication).
1169-
To multiply two tiles, you will need to use the ``matmul_tiles`` function provided in ``tt_metal/include/compute_kernel_api/matmul.h``.
1169+
To multiply two tiles, you will need to use the ``matmul_tiles`` function provided in ``tt_metal/hw/inc/api/compute/matmul.h``.
11701170
This function accumulates the result into the destination register; i.e. it adds to the existing values in the register rather than overwriting existing content.
11711171
By judiciously choosing when to call ``tile_regs_acquire``, which initializes all tiles in the destination register array to zero, and when to call
11721172
``tile_regs_commit``, which signals that the compute core is done writing to the destination register,

docs/source/tt-metalium/tt_metal/labs/matmul/lab2/lab2.rst

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -752,7 +752,7 @@ Follow these steps to complete the exercise:
752752
into the destination register requires a new operation:
753753

754754
- ``copy_tile(in_cb_id, in_tile_index, dst_tile_index)`` defined in
755-
``tt_metal/include/compute_kernel_api/tile_move_copy.h`` copies a tile from the intermediate
755+
``tt_metal/hw/inc/api/compute/tile_move_copy.h`` copies a tile from the intermediate
756756
CB to the destination register array at the specified index.
757757
- Before calling this function, you need to call ``copy_tile_to_dst_init_short(in_cb_id)``
758758
to set up the Tensix Engine for the copy operation.

models/demos/deepseek_v3_b1/kernel_includes/tt_metal/include/compute_kernel_api/add_rsqrt.h

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,8 @@
44

55
#pragma once
66

7-
#include "compute_kernel_api.h"
8-
#include "compute_kernel_api/common.h"
7+
#include "api/compute/compute_kernel_api.h"
8+
#include "api/compute/common.h"
99
#ifdef TRISC_MATH
1010
#include "../../hw/ckernels/blackhole/metal/llk_api/llk_sfpu/llk_math_eltwise_unary_sfpu_add_rsqrt.h"
1111
#endif

models/demos/deepseek_v3_b1/kernel_includes/tt_metal/include/compute_kernel_api/custom_mm.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44

55
#pragma once
66

7-
#include "compute_kernel_api/common.h"
7+
#include "api/compute/common.h"
88
#ifdef TRISC_MATH
99
#include "../../hw/ckernels/blackhole/metal/llk_api/llk_math_custom_mm_api.h"
1010
#endif

models/demos/deepseek_v3_b1/kernel_includes/tt_metal/include/compute_kernel_api/deepseek_moe_gate.h

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,9 +4,9 @@
44

55
#pragma once
66

7-
#include "compute_kernel_api.h"
8-
#include "compute_kernel_api/common.h"
9-
#include "compute_kernel_api/transpose_wh.h"
7+
#include "api/compute/compute_kernel_api.h"
8+
#include "api/compute/common.h"
9+
#include "api/compute/transpose_wh.h"
1010
#ifdef TRISC_MATH
1111
#ifdef ARCH_BLACKHOLE
1212
#include "../../hw/ckernels/blackhole/metal/llk_api/llk_sfpu/llk_math_deepseek_moe_gate_topk_single_face.h"

models/demos/deepseek_v3_b1/kernel_includes/tt_metal/include/compute_kernel_api/rmsnorm.h

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,8 +4,8 @@
44

55
#pragma once
66

7-
#include "compute_kernel_api.h"
8-
#include "compute_kernel_api/common.h"
7+
#include "api/compute/compute_kernel_api.h"
8+
#include "api/compute/common.h"
99
#ifdef TRISC_MATH
1010
#include "../../hw/ckernels/blackhole/metal/llk_api/llk_math_rmsnorm_bcast_scalar_dest_reuse_api.h"
1111
#endif

models/demos/deepseek_v3_b1/micro_ops/deepseek_moe_gate/kernels/deepseek_moe_gate_compute.cpp

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -7,12 +7,12 @@
77
#define REDUCE_OP PoolType::SUM
88
#define REDUCE_DIM ReduceDim::REDUCE_ROW
99

10-
#include "compute_kernel_api.h"
11-
#include "compute_kernel_api/common.h"
12-
#include "compute_kernel_api/tile_move_copy.h"
13-
#include "compute_kernel_api/pack.h"
14-
#include "compute_kernel_api/reconfig_data_format.h"
15-
#include "compute_kernel_api/eltwise_binary.h"
10+
#include "api/compute/compute_kernel_api.h"
11+
#include "api/compute/common.h"
12+
#include "api/compute/tile_move_copy.h"
13+
#include "api/compute/pack.h"
14+
#include "api/compute/reconfig_data_format.h"
15+
#include "api/compute/eltwise_binary.h"
1616
#include "../../../kernel_includes/tt_metal/include/compute_kernel_api/deepseek_moe_gate.h"
1717

1818
void kernel_main() {

models/demos/deepseek_v3_b1/micro_ops/dram_streaming_matmul/kernels/compute.cpp

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44

55
#include <cstdint>
66

7-
#include "compute_kernel_api/tile_move_copy.h"
7+
#include "api/compute/tile_move_copy.h"
88
#include "../../../kernel_includes/tt_metal/include/compute_kernel_api/custom_mm.h"
99

1010
// Fused SiLU activation support (only when FUSE_SILU is defined)
@@ -13,7 +13,7 @@
1313
// - ITERATIONS: minimum 2 required for SFPU, then scales (m<=4->2, m=8->4, m>=16->8)
1414
// Total: significant speedup vs default silu_tile()
1515
#ifdef FUSE_SILU
16-
#include "compute_kernel_api.h" // for silu_tile_init() and llk_math_eltwise_unary_sfpu_silu
16+
#include "api/compute/compute_kernel_api.h" // for silu_tile_init() and llk_math_eltwise_unary_sfpu_silu
1717
#endif
1818

1919
/**

0 commit comments

Comments
 (0)