SPAR: Self-Forecasting

andrew-introspection-steering Public

Python 1

emanuel-ai-psychosis-self-prediction Public

Self-prediction vs cross-prediction experiment on AI psychosis red-teaming scores

HTML 1

gemma2-boolq-calibration Public

RL training of Gemma 2 2B IT for calibrated YES/NO probability estimates on BoolQ using GRPO

Python 1

lydia-demo-first-token Public

We have a list of base_prompts, e.g. "What is 2+2?". We have a prefix wrapper: "WRAPPER = 'What would you say in response to this prompt: "{p}"'. We compare the top-1 agreement, JS-divergence betwe…

Python

emanuel-infra-competitive-programming Public

Framework for testing LLMs' ability to predict their own behavior in multi-turn and agentic scenarios

Python

andrew-bloom-self-prediction Public

Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPAR: Self-Forecasting

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!