Have you ever tried using Qwen as the base model before? When i use qwen3,it doesn't work.Even it can't learn only output the answer