training metric train/code_usage_step

Thanks for your great work.
In stage 1 and stage 2, what is the typical value of the training metric train/code_usage_step?
When I try to reproduce the results on some datasets, this  train/code_usage_step metric is around 0.25 and  train/code_usage_epoch is around 0.3  — is that considered too low?