NNBellman

Description

This project addressed the need for a faster method of generating agent decisions for an agent-based model. The original policy iteration method, while sufficient for a small number of agents was computationally cost prohibitive at the population scales demanded by the ABM. Using scripts documented in this repository, an exhaustive search for suitable architecture and hyperparameters was conducted to train a feedforward multilayer perceptron (MLP) as a replacement for the original iterative method. All data used to train the model and the models and metrics resulting from the hyperparameter grid search are available at doi:10.5281/zenodo.14987728. The code structure for model training was based on the PyTorch Template Project.

Installation

git clone https://github.com/vmgaribay/nnbellman.git
conda env create -f environment.yml

Usage Notes

The code in this repository was admitedly not designed for general reuse but the main files of interest to others may be...

Iterative_vs_NN.py - Used to obtain the performance comparison values (set device to cpu if needed)
GenerateBellmanSample.ipynb - Used to generate the training and testing data
model.py - Experimental neural network achitectures
as_both_grid_search_S21gpu.sh - Example of script used to conduct architecture/hyperparameter grid search
nn_performance_comparison.ipynb - Used to extract best model performance from log files

If cuda is unavailable on your machine, the code may still be run on cpu.

Performance Comparison

The comparison of performance¹ for the original iterative method and the neural network equation mapping with chosen architecture and hyperparameters is listed in the following tables for n agents. The values and numbers in parenthesis respectively represent the mean and standard deviation² of 10 randomly seeded runs.

n=1

Metric	Iterative	MLP
Execution Time (s)	34.80 (5.30)	0.16 (0.15)
CPU Time (s)	34.70 (5.28)	0.12 (0.03)
CPU Memory Usage (MB)	0.05 (0.01)	18.56 (5.31)
GPU Time (s)	34.8 (5.30)	0.16 (0.15)
GPU Memory Usage (MB)	7.67 (2.69)	91.69 (2.69)

n=10

Metric	Iterative	MLP
Execution Time (s)	361.77 (29.53)	0.12 (0.01)
CPU Time (s)	360.77 (29.46)	0.12 (0.00)
CPU Memory Usage (MB)	0.08 (0.00)	16.88 (0.00)
GPU Time (s)	361.77 (29.53)	0.12 (0.01)
GPU Memory Usage (MB)	8.52 (0.00)	92.55 (0.00)

n=100

Metric	Iterative	MLP
Execution Time (s)	3831.86 (347.49)	0.18 (0.06)
CPU Time (s)	3821.24 (346.52)	0.13 (0.00)
CPU Memory Usage (MB)	0.23 (0.00)	16.88 (0.00)
GPU Time (s)	3831.87 (347.49)	0.18 (0.06)
GPU Memory Usage (MB)	8.52 (0.00)	92.55 (0.00)

n=1000

Metric	Iterative	MLP
Execution Time (s)	38957.39 (955.84)	0.53 (0.23)
CPU Time (s)	38852.38 (952.87)	0.19 (0.03)
CPU Memory Usage (MB)	3.01 (2.13)	30.32 (7.08)
GPU Time (s)	38957.58 (955.93)	0.53 (0.23)
GPU Memory Usage (MB)	1.72 (3.59)	85.75 (3.59)

¹ Run on Snellius gcn partition, established Q4 2022
Hardware Information: Lenovo ThinkSystem SD650-N v2
    OS: Red Hat Enterprise Linux 9.4 (Plow)
    CPU: Intel(R) Xeon(R) Platinum 8360Y CPU @ 2.40GHz
        CPU family: 6
        Model: 106
        Thread(s) per core: 1
        Core(s) per socket: 36
        Socket(s): 2
        Stepping: 6
        CPU(s) scaling MHz: 100%
        CPU max MHz: 2400.0000
        CPU min MHz: 800.0000
        DRAM GiB per core: 7.111

    GPU: NVIDIA A100-SXM4-40GB
        Driver Version: 565.57.01
        CUDA Version: 12.7
        Power Cap W: 400

² Caveat: Small/No variation on memory usage metrics may be sign of improper reset between runs; values should still be representative of peak useage.

Context

For more information on the dataset generation and training process please refer to the main manuscript, V. M. Garibay "Accelerated Approximation of Bellman Equation Solutions: Agent Policy Optimization With a Feedforward Neural Network". ICCS (Workshops 2) 2025: 224-238 doi:10.1007/978-3-031-97557-8_17.

Contact

Victoria Garibay, Ph.D. - Contact Form | GitHub Profile

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
BatchRuns		BatchRuns
__pycache__		__pycache__
base		base
data_loader		data_loader
logger		logger
model		model
nn_data		nn_data
saved		saved
trainer		trainer
utils		utils
GenerateBellmanSample.ipynb		GenerateBellmanSample.ipynb
Iterative_vs_NN.py		Iterative_vs_NN.py
LICENSE		LICENSE
Pipfile		Pipfile
PyRun.sh		PyRun.sh
PyTorchTemplateREADME.md		PyTorchTemplateREADME.md
README.md		README.md
environment.yml		environment.yml
model_loader.py		model_loader.py
new_project.py		new_project.py
nn_performance_comparison.ipynb		nn_performance_comparison.ipynb
parse_config.py		parse_config.py
requirements.txt		requirements.txt
tboard_log.py		tboard_log.py
test.py		test.py
testconfig.json		testconfig.json
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NNBellman

Description

Installation

Usage Notes

Performance Comparison

n=1

n=10

n=100

n=1000

Context

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

vmgaribay/nnbellman

Folders and files

Latest commit

History

Repository files navigation

NNBellman

Description

Installation

Usage Notes

Performance Comparison

n=1

n=10

n=100

n=1000

Context

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages