Skip to content
@SPAR-Self-Forecasting

SPAR: Self-Forecasting

Popular repositories Loading

  1. andrew-introspection-steering andrew-introspection-steering Public

    Python 1

  2. emanuel-ai-psychosis-self-prediction emanuel-ai-psychosis-self-prediction Public

    Self-prediction vs cross-prediction experiment on AI psychosis red-teaming scores

    HTML 1

  3. gemma2-boolq-calibration gemma2-boolq-calibration Public

    RL training of Gemma 2 2B IT for calibrated YES/NO probability estimates on BoolQ using GRPO

    Python 1

  4. lydia-demo-first-token lydia-demo-first-token Public

    We have a list of base_prompts, e.g. "What is 2+2?". We have a prefix wrapper: "WRAPPER = 'What would you say in response to this prompt: "{p}"'. We compare the top-1 agreement, JS-divergence betwe…

    Python

  5. emanuel-infra-competitive-programming emanuel-infra-competitive-programming Public

    Framework for testing LLMs' ability to predict their own behavior in multi-turn and agentic scenarios

    Python

  6. andrew-bloom-self-prediction andrew-bloom-self-prediction Public

    Python

Repositories

Showing 7 of 7 repositories

Top languages

Loading…

Most used topics

Loading…