-
Notifications
You must be signed in to change notification settings - Fork 1
Observations
Nicholas Guttenberg edited this page Feb 21, 2025
·
2 revisions
See something interesting or unusual, or have a small result that doesn't quite rise to the level of a full project? Document it here! If it's complicated to describe, we recommend creating a new wiki page and linking to it from here, with a brief summary here.
Failure to learn CoT with tsumego problems/Qwen-2.5-3b-Instruct - Training methodology that was confirmed to learn CoT on the countdown game did not learn CoT for a different problem domain (tsumego problems for the game of Go)