There's already a dependency on a large part of the previous frame's decoding process for inter frames, and some very simple inter prediction would allow for good compression efficiency gains without a huge hit in performance.
I didn't see this discussed anywhere else, so apologies if it's a duplicate.