Personalized Decision Modeling: Utility Optimization or Textualized-Symbolic Reasoning

🧩 Overview

Traditional utility-based models explain what people would do; ATHENA aims to explain what people actually do.

The framework has two key stages:

Group-Level Symbolic Utility Discovery — LLM-guided symbolic regression discovers interpretable, group-level utility functions.
Individual-Level Semantic Adaptation — personalized templates are optimized with textual gradients to capture individual preferences and constraints.

Getting Started

Prerequisites

Python 3.10 or newer
Optional local LLM runtime (e.g. Ollama) if you prefer not to use hosted APIs

Setup

git clone https://github.com/your-org/UtilitySymReg.git
cd UtilitySymReg
conda create -n athena python=3.10 -y
conda activate athena
pip install -r requirements.txt

Environment variables (export the ones that match the backends you plan to call):

Provider/Backend	Variable	Notes
OpenAI (`--chatbot openai`)	`OPENAI_API_KEY`	Required for GPT-4o/mini and any TextGrad run using the OpenAI endpoint.
Google Gemini (`--chatbot gemini`)	`GEMINI_API_KEY`	Required for Gemini 2.0 Flash.
Lambda Labs (`--chatbot lambda`)	`LAMBDA_API_KEY`	Needed for Lambda-hosted DeepSeek models and TextGrad with `base_url=LAMBDA_BASE_URL`.
DeepInfra (`--chatbot deepinfra`)	`DEEPINFRA_API_KEY`	Needed for DeepInfra-hosted DeepSeek models and TextGrad with `base_url=DEEPINFRA_BASE_URL`.

Example setup on macOS/Linux:

export OPENAI_API_KEY="sk-..."
export GEMINI_API_KEY="AIza..."
export LAMBDA_API_KEY="lam-..."
export DEEPINFRA_API_KEY="din-..."

You can store these exports in ~/.zshrc or your preferred shell profile so they persist across sessions. Local Ollama backends expect a listening server at http://localhost:11434 (OllamaAPI); update the wrapper if your host/port differs.

Running ATHENA Pipelines

ATHENA exposes a CLI via main.py. Key parameters include the task (travel-mode or vaccine), chatbot backend, model identifier, optimisation iterations, and TextGrad steps.

Travel-mode (SwissMetro)

python main.py \
  --task travel-mode \
  --chatbot ollama \
  --model deepseek-r1:32b \
  --results-dir results-travel \
  --persona-csv cache/persona_travel.csv \
  --selected-pids-file data/swissmetro/selected_pids_subset_100.txt

The command performs group-level symbolic utility discovery, saves intermediate CSVs under results-travel, and optimises personalised prompts with TextGrad. Persona strings are cached to cache/persona_travel.csv to allow resumable runs.

Vaccine-choice

python main.py \
  --task vaccine \
  --chatbot deepinfra \
  --model deepseek-r1:32b \
  --results-dir results-vaccine \
  --selected-pids-file data/vaccine/selected_pids.txt

The vaccine pipeline reuses the same CLI knobs except --persona-csv, which is not required. By default the run resumes from the latest iteration found in results-vaccine and skips recomputation when final CSVs already exist.

Resuming & Customising

To pick up from previous progress, leave --results-dir pointing to an existing directory; the pipeline auto-detects the last completed iteration.
Adjust --top-k, --bottom-k, and --n-candidates to control exploration versus exploitation in symbolic search.
--tg-steps governs the number of TextGrad updates per individual.

Persona Cache Utility

For vaccine experiments you can pre-compute personas to avoid repeated API calls:

python save_persona.py --results-dir results-vaccine --selected-pids-file data/vaccine/selected_pids.txt

This populates results-vaccine/persona.csv, which main.py --task vaccine will reuse.

Baseline Reproduction

The baseline_model/ directory includes reference implementations for zero-shot, few-shot, and TextGrad-only baselines.

Example (SwissMetro zero-shot):

python baseline_model/travel_mode/travel_mode_zero_shot.py --chatbot openai --model gpt-4o-mini

Example (Vaccine few-shot):

python baseline_model/vaccine/vaccine_fewshot.py --chatbot gemini --model gemini-2.0-flash

Each script shares a similar CLI for selecting chatbot providers and output directories.

Logging & Outputs

Logs follow the configuration in athena/config/logging.json and are written to athena/config/logs/ by default.
Group-level optimisation artefacts are stored as CSV files named utility_functions_results_group_{gid}_iteration_{k}.csv inside the chosen results_dir.
Persona caches live inside results_dir/persona.csv.

Citation

@inproceedings{zhao2025athena,
  title        = {Personalized Decision Modeling: Utility Optimization or Textualized-Symbolic Reasoning},
  author       = {Yibo Zhao, Yang Zhao, Hongru Du, Hao Frank Yang},
  booktitle    = {The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS)},
  year         = {2025}
}

License

This project is released under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
assets		assets
athena		athena
baseline_model		baseline_model
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
save_persona.py		save_persona.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Personalized Decision Modeling: Utility Optimization or Textualized-Symbolic Reasoning

🧩 Overview

Getting Started

Running ATHENA Pipelines

Travel-mode (SwissMetro)

Vaccine-choice

Resuming & Customising

Persona Cache Utility

Baseline Reproduction

Logging & Outputs

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Personalized Decision Modeling: Utility Optimization or Textualized-Symbolic Reasoning

🧩 Overview

Getting Started

Running ATHENA Pipelines

Travel-mode (SwissMetro)

Vaccine-choice

Resuming & Customising

Persona Cache Utility

Baseline Reproduction

Logging & Outputs

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages