Maybe I was wrong, but i have run your code and results do not match the description in the paper. The mask generated by TimerCovariateMask is a simple upper triangular matrix, not the same as that shown in paper Figure 2.