'gbk' codec can't decode byte 0xb4 in position 35: illegal multibyte sequence 强制换成utf-8 能跑,但是训练到几千次的时候会报类似的错误,也是can't decode之类的