Hi im trying to use this model for arabic language and i see that we need to create transforms for the language we are using for good results. Im wondering how we can do this? Can we just use the trainer.py file and not specify the vocabulary path?
By doing this will it automatically generate a transform .txt file containing transforms specifically for my dataset?
Many thanks!