First, thank you for providing your implementation, it is quite instructive.
I had a question regarding what normalization was done for MSE during evaluation - was the normalization denominator the variance(y_true) or mean((y_true)^2)?
Because the code for training uses just mse without normalization -
I could not find a reference to normalized mse for evaluation in this repo (though I may have missed it)
First, thank you for providing your implementation, it is quite instructive.
I had a question regarding what normalization was done for MSE during evaluation - was the normalization denominator the variance(y_true) or mean((y_true)^2)?
Because the code for training uses just mse without normalization -
I could not find a reference to normalized mse for evaluation in this repo (though I may have missed it)