Skip to content

Anatoliylitv/bloat 16 use hip bfloat 16 as parent#3112

Draft
anatoliylitv wants to merge 4 commits intorelease/2.9from
anatoliylitv/bloat_16_use_hip__bfloat_16_as_parent
Draft

Anatoliylitv/bloat 16 use hip bfloat 16 as parent#3112
anatoliylitv wants to merge 4 commits intorelease/2.9from
anatoliylitv/bloat_16_use_hip__bfloat_16_as_parent

Conversation

@anatoliylitv
Copy link
Copy Markdown

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

…tage of possible optimizations from compiler.
…ompatibility and performance. Updated member functions to utilize the base class directly, ensuring consistent behavior across HIP and CUDA environments.
…dd conversion operator to __hip_bfloat16. This change enhances compatibility with HIP and maintains consistent behavior across different platforms.
…, enhancing compatibility with HIP. Updated constructor initialization and conversion operators for improved consistency across platforms.
@anatoliylitv anatoliylitv marked this pull request as draft March 31, 2026 13:21
@rocm-repo-management-api
Copy link
Copy Markdown

rocm-repo-management-api bot commented Mar 31, 2026

Jenkins build for 223212a09a29a4d0caafd53d5e6ad80ef58c6759 commit finished as FAILURE
Links: Pipeline Overview / Build artifacts / Test Results

Detected error during Pytorch building:

Warning: Unused direct dependencies:
	/lib/x86_64-linux-gnu/libm.so.6
	/lib/x86_64-linux-gnu/libgcc_s.so.1
[5729/8156] Building CXX object c10/test/CMakeFiles/c10_Synchronized_test.dir/util/Synchronized_test.cpp.o
[5730/8156] Building HIPCC object caffe2/aten/src/ATen/CMakeFiles/fbgemm_genai.dir/__/__/__/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fbgemm_genai_generated_fp8_rowwise_grouped_128x16x32x256_16x16_1x1_16x8x1_16x8x1_1x16x1x8_4x4x1_1x1_interwave_v2.hip.o
FAILED: caffe2/aten/src/ATen/CMakeFiles/fbgemm_genai.dir/__/__/__/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fbgemm_genai_generated_fp8_rowwise_grouped_128x16x32x256_16x16_1x1_16x8x1_16x8x1_1x16x1x8_4x4x1_1x1_interwave_v2.hip.o /var/lib/jenkins/pytorch/build/caffe2/aten/src/ATen/CMakeFiles/fbgemm_genai.dir/__/__/__/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fbgemm_genai_generated_fp8_rowwise_grouped_128x16x32x256_16x16_1x1_16x8x1_16x8x1_1x16x1x8_4x4x1_1x1_interwave_v2.hip.o 
cd /var/lib/jenkins/pytorch/build/caffe2/aten/src/ATen/CMakeFiles/fbgemm_genai.dir/__/__/__/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels && /opt/conda/envs/py_3.12/lib/python3.12/site-packages/cmake/data/bin/cmake -E make_directory /var/lib/jenkins/pytorch/build/caffe2/aten/src/ATen/CMakeFiles/fbgemm_genai.dir/__/__/__/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/. && /opt/conda/envs/py_3.12/lib/python3.12/site-packages/cmake/data/bin/cmake -D verbose:BOOL=OFF -D build_configuration:STRING=RELEASE -D generated_file:STRING=/var/lib/jenkins/pytorch/build/caffe2/aten/src/ATen/CMakeFiles/fbgemm_genai.dir/__/__/__/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/./fbgemm_genai_generated_fp8_rowwise_grouped_128x16x32x256_16x16_1x1_16x8x1_16x8x1_1x16x1x8_4x4x1_1x1_interwave_v2.hip.o -P /var/lib/jenkins/pytorch/build/caffe2/aten/src/ATen/CMakeFiles/fbgemm_genai.dir/__/__/__/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fbgemm_genai_generated_fp8_rowwise_grouped_128x16x32x256_16x16_1x1_16x8x1_16x8x1_1x16x1x8_4x4x1_1x1_interwave_v2.hip.o.cmake
In file included from /var/lib/jenkins/pytorch/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fp8_rowwise_grouped_128x16x32x256_16x16_1x1_16x8x1_16x8x1_1x16x1x8_4x4x1_1x1_interwave_v2.hip:9:
In file included from /var/lib/jenkins/pytorch/third_party/fbgemm/fbgemm_gpu/experimental/gen_ai/src/quantize/ck_extensions/fp8_rowwise_grouped/kernels/fp8_rowwise_grouped_common.h:9:
In file included from /var/lib/jenkins/pytorch/aten/src/ATen/ATen.h:7:
In file included from /var/lib/jenkins/pytorch/aten/src/ATen/Context.h:4:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant