Hello,
I am currently studying the code of your project and I am wondering what the pre-training dataset used for the model is. Could you please provide more details about the pre-training dataset?
It would be very helpful for my understanding.
Thank you!