Is it possible that the word tokenizer does not split off apostrophe and apostrophe s:
E.g. Toyota's is considered a single token as opposed to being split into Toyota and 's
This has caused me quite a bit of headache. Would it not be more common to split these?