[CUDA] Support FP8 (E4M3) KV Cache for Group Query Attention · microsoft/onnxruntime@2a25780

Run reviewdog/action-cpplint@master

reviewdog: Too many results (annotations) in diff. You may miss some annotations due to GitHub limitation for annotation created by logging command. Please check GitHub Actions log console to see all results. Limitation: - 10 warning annotations and 10 error annotations per step - 50 annotations per job (sum of annotations from all the steps) - 50 annotations per run (separate from the job annotations, these annotations aren't created by users) Source: https://github.com/orgs/community/discussions/26680#discussioncomment-3252835

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/xqa/xqa_loader_bf16_fp8_128.cu#L8

[cpplint] reported by reviewdog 🐶 Include the directory when naming header files [build/include_subdir] [4] Raw Output: onnxruntime/contrib_ops/cuda/bert/xqa/xqa_loader_bf16_fp8_128.cu:8: Include the directory when naming header files [build/include_subdir] [4]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qkv.cuh#L355

[cpplint] reported by reviewdog 🐶 If an else has a brace on one side, it should have it on both [readability/braces] [5] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qkv.cuh:355: If an else has a brace on one side, it should have it on both [readability/braces] [5]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qkv.cuh#L222

[cpplint] reported by reviewdog 🐶 If/else bodies with multiple statements require braces [readability/braces] [4] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qkv.cuh:222: If/else bodies with multiple statements require braces [readability/braces] [4]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qkv.cuh#L222

[cpplint] reported by reviewdog 🐶 If an else has a brace on one side, it should have it on both [readability/braces] [5] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qkv.cuh:222: If an else has a brace on one side, it should have it on both [readability/braces] [5]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh#L290

[cpplint] reported by reviewdog 🐶 If/else bodies with multiple statements require braces [readability/braces] [4] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh:290: If/else bodies with multiple statements require braces [readability/braces] [4]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh#L290

[cpplint] reported by reviewdog 🐶 If an else has a brace on one side, it should have it on both [readability/braces] [5] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh:290: If an else has a brace on one side, it should have it on both [readability/braces] [5]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh#L283

[cpplint] reported by reviewdog 🐶 Using C-style cast. Use static_cast<int64_t>(...) instead [readability/casting] [4] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh:283: Using C-style cast. Use static_cast<int64_t>(...) instead [readability/casting] [4]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh#L282

[cpplint] reported by reviewdog 🐶 Using C-style cast. Use static_cast<int64_t>(...) instead [readability/casting] [4] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh:282: Using C-style cast. Use static_cast<int64_t>(...) instead [readability/casting] [4]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh#L281

[cpplint] reported by reviewdog 🐶 Using C-style cast. Use static_cast<int64_t>(...) instead [readability/casting] [4] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh:281: Using C-style cast. Use static_cast<int64_t>(...) instead [readability/casting] [4]

Run reviewdog/action-cpplint@master: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh#L158

[cpplint] reported by reviewdog 🐶 If an else has a brace on one side, it should have it on both [readability/braces] [5] Raw Output: onnxruntime/contrib_ops/cuda/bert/group_query_attention_qdq.cuh:158: If an else has a brace on one side, it should have it on both [readability/braces] [5]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] Support FP8 (E4M3) KV Cache for Group Query Attention #50649

Optional Lint C++

[CUDA] Support FP8 (E4M3) KV Cache for Group Query Attention #50649

Uh oh!

Annotations

Optional Lint C++

Re-running jobs...