PFA snip for one instance which shows the JSON parsing errors for the use case of ISL 7K and OSL 1K
model being used is Llam3.1-70B with 2 replicas deployed( fp8-tp2-pp1-latency NIM profile)
- Is there way to specify that the benchmark be terminated instead of continuing when JSON parsing errors are encountered?
- Why are these errors not accounted and being shown under errors in the dashboard?
- please let me know if there are any inputs to avoid getting into these JSON parsing errors
aiperf --version
0.3.0
