We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
For example figure 1:
in general, I am trying to figure out if in general people train transformers wrt epochs or iterations (1 iteration is one batch).