Improving LLM reasoning abilities by disentangling its representation extraction abilities from reasoning on these representations. Contains:
- classification on full dataset of inner states (with or without few-shot examples, with or without CoT)
- classification on small sequences of inner states
Currently supports MCQA datasets only.
Overwrite constants.py, and run the numbered python files 1-compute-states.py, etc. in order.