So when I tried to tokenized the wenetspeech, I got RuntimeError: CUDA out of memory. Is there any possible for on-the-fly?