LLM Representation Learning

Improving LLM reasoning abilities by disentangling its representation extraction abilities from reasoning on these representations. Contains:

classification on full dataset of inner states (with or without few-shot examples, with or without CoT)
classification on small sequences of inner states

Currently supports MCQA datasets only.

Overwrite constants.py, and run the numbered python files 1-compute-states.py, etc. in order.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
0-split_data.py		0-split_data.py
1-fine-tuning.py		1-fine-tuning.py
2-compute-states.py		2-compute-states.py
3-direct-results.py		3-direct-results.py
4-neural-network.py		4-neural-network.py
README.md		README.md
dataset.py		dataset.py
llm.py		llm.py
utils.py		utils.py

Provide feedback