Thank you again for your excellent work. I have trained a model mT0 using my own dataset, and it performs well. Now, I am attempting to train bloomz model, but I'm encountering an issue where the training loss does not decrease at all. Despite trying both the quantized and non-quantized versions. Could you please help me on how to address this issue?
Thank you again for your excellent work. I have trained a model mT0 using my own dataset, and it performs well. Now, I am attempting to train bloomz model, but I'm encountering an issue where the training loss does not decrease at all. Despite trying both the quantized and non-quantized versions. Could you please help me on how to address this issue?