Shoot First, Ask Questions Later: Building Rational Agents That Explore and Act Like People

This is the official code repository for the paper Shoot First, Ask Questions Later: Building Rational Agents That Explore and Act Like People by Gabriel Grand, Valerio Pepe, Joshua B. Tenenbaum, and Jacob Andreas.

Getting Started

git clone git@github.com:gabegrand/battleship.git

With Poetry (Recommended)

This codebase uses Poetry to manage dependencies. If you don't have Poetry installed, you can do so by following the instructions here.

cd battleship
poetry install

Note

If you also want to install optional development dependencies (e.g., for running unit tests, code formatting, and plotting), you can do so with:

poetry install --with dev

Once the installation is complete, you can activate the virtual environment:

# Default with Poetry v2.0
eval $(poetry env activate)

# Alternative with Poetry v1.x or with the poetry-shell plugin
poetry shell

With pip

For convenience, we also provide a [build-system] section in pyproject.toml, so you can install the package with pip. We recommend using a virtual environment (e.g., via venv or conda) to avoid dependency conflicts.

cd battleship
pip install -e .

Running Experiments

We're currently working on providing detailed documentation for running experiments. In the meantime, here are some example commands to get you started.

Spotter Experiments

# Start new experiment
python run_spotter_benchmarks.py --llms gpt-4o-mini --spotter-models CodeSpotterModel

# Resume interrupted experiment
python run_spotter_benchmarks.py --resume --experiment-dir {EXPERIMENT_DIR}

# Force restart (clear existing results)
python run_spotter_benchmarks.py --force-restart --experiment-dir {EXPERIMENT_DIR}

# Resume with additional configurations
python run_spotter_benchmarks.py --resume --experiment-dir {EXPERIMENT_DIR} --llms gpt-4o gpt-4o-mini

Captain Experiments

# Start new captain experiment
python run_captain_benchmarks.py --captains LLMDecisionCaptain --captain-llm gpt-4o-mini --spotter-llm gpt-4o-mini

# Resume interrupted captain experiment
python run_captain_benchmarks.py --resume --experiment-dir {EXPERIMENT_DIR}

# Force restart captain experiment (clear existing results)
python run_captain_benchmarks.py --force-restart --experiment-dir {EXPERIMENT_DIR}

# Resume with additional configurations
python run_captain_benchmarks.py --resume --experiment-dir {EXPERIMENT_DIR} --captains LLMDecisionCaptain RandomCaptain --seeds 42 123

Dataset

The BattleshipQA dataset can be found in data/human-dataset.csv. This file contains a compiled summary of all 126 human games.

The raw data, including individual message logs, can be found in experiments/collaborative/data/battleship-final-data/. The various files game.csv, round.csv, stage.csv store structured information about player interactions at various levels of granularity. More information about this data format can be found in the Empirica v2 docs.

The data for the 18 randomly-generated board contexts used in the experiments can be found in experiments/collaborative/contexts/board_BXX.txt, where BXX is the board ID.

Citations

If you use this code or data in your research, please cite our paper:

@article{grand2025battleship,
  title={Shoot First, Ask Questions Later? Building Rational Agents That Explore and Act Like People},
  author={Gabriel Grand and Valerio Pepe and Joshua B. Tenenbaum and Jacob Andreas},
  journal={ArXiv},
  year={2025},
  volume={abs/2510.20886},
  url={https://arxiv.org/abs/2510.20886}
}

Name		Name	Last commit message	Last commit date
Latest commit History 503 Commits
EIG @ f4655de		EIG @ f4655de
battleship		battleship
data		data
docs		docs
experiments		experiments
hfppl @ ffb5f27		hfppl @ ffb5f27
notebooks		notebooks
question_dataset @ cb2f2ad		question_dataset @ cb2f2ad
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
poetry.toml		poetry.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Shoot First, Ask Questions Later: Building Rational Agents That Explore and Act Like People

Getting Started

With Poetry (Recommended)

With pip

Running Experiments

Spotter Experiments

Captain Experiments

Dataset

Citations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

gabegrand/battleship

Folders and files

Latest commit

History

Repository files navigation

Shoot First, Ask Questions Later: Building Rational Agents That Explore and Act Like People

Getting Started

With Poetry (Recommended)

With pip

Running Experiments

Spotter Experiments

Captain Experiments

Dataset

Citations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages