In the Week 4-part 01 assignment there is a question - how to make the training and test sets more similar.
However, in the first chapter of textbook, it used hand writting of digital numbers from two population of entirely different writers as training and test data. That is, it required training and test at a certain degree of disimilarity.
I was wondering at which level traning and test data are desired to be similar? Thank you!