Has anyone had got a good trained model and can share what dataset, model and training parameters/flags they used for training and eval?
I cant get good scores with kblam using Llama-3-8B-Instruct, text-embedding-3-large and the synthetic dataset.
Maybe also share what the final train loss and step count was.