Skip to content

Add CUDA implementation for attn_probs

239df8b
Select commit
Loading
Failed to load commit list.
Closed

Make MultiHeadAttention op return attention probabilities #23125

Add CUDA implementation for attn_probs
239df8b
Select commit
Loading
Failed to load commit list.
This check has been archived and is scheduled for deletion. Learn more about checks retention
Azure Pipelines / ONNX Runtime Web CI Pipeline (Precheck_and_extract_commit Precheck_and_extract_commit) succeeded Dec 17, 2024 in 1m 52s

Precheck_and_extract_commit Precheck_and_extract_commit succeeded