Fix input text chunking algorithm

Currently, 
- Text chunks are too long (particularly in first token). This leads to long time-to-first-byte, dropped words, and pathological repetition
- Final chunk is too short, leading to weird artifacts at the end of output