Why adaptive masked training is applied for just 30% of the iterations and not 100% percent.
Why adaptive masked training is applied for just 30% of the iterations and not 100% percent.