Skip to content

psgd-jax 0.2.9

Latest

Choose a tag to compare

@evanatyourservice evanatyourservice released this 31 Dec 22:16

What's Changed

  • swapped normalize_grads out for clipping outputs by RMS. This is more stable, more accurate, and will work in a wider variety of situations. normalizing input grads is worse due to getting rid of valuable info for preconditioners.