what are the steps that i have to follow to train on my custom dataset. i have 2500 files for each keyword.