I successfully compiled the MILC code (master branch) with QUDA version 1.1.0 using CUDA v11.8.
The QUDA compilation passed all the tests.
I compiled the su3_rhmc_hisq target for ks_imp_rhmc.
I then launched a test job on 4 nodes, each with 4 Nvidia A100 GPUs.
The job aborted with the following error:
“
ERROR: Error in unitarization component of the hisq fattening: 1048576 failures (/leonardo/pub/userexternal/lcosmai0/AREA_COMPILAZIONE_QUDA/quda-1.1.0/lib/interface_quda.cpp:4154 in computeKSLinkQuda())
“
Could you please provide any suggestions on how to resolve this issue?
Best regards,
Leonardo
I successfully compiled the MILC code (master branch) with QUDA version 1.1.0 using CUDA v11.8.
The QUDA compilation passed all the tests.
I compiled the su3_rhmc_hisq target for ks_imp_rhmc.
I then launched a test job on 4 nodes, each with 4 Nvidia A100 GPUs.
The job aborted with the following error:
“
ERROR: Error in unitarization component of the hisq fattening: 1048576 failures (/leonardo/pub/userexternal/lcosmai0/AREA_COMPILAZIONE_QUDA/quda-1.1.0/lib/interface_quda.cpp:4154 in computeKSLinkQuda())
“
Could you please provide any suggestions on how to resolve this issue?
Best regards,
Leonardo