TransInferSim - fast analysis of Transformer Network inference

TransInferSim is a cycle-accurate simulator for analyzing the hardware performance of Transformer NN inference on custom systolic-array accelerators. Combined with Accelergy, it reports latency, energy, area, and other efficiency metrics, enabling cache-policy analysis, memory-hierarchy optimization, hardware design-space exploration, and exportable execution plans for RTL validation and deployment.

Features

Analyzes Transformer NN inference on hardware
Integrates with Accelergy for energy estimation
Includes various plugins for Accelergy's flexibility

Reference

If you find our work useful, please refer our paper.

J. Klhufek, A. Marchisio, V. Mrazek, L. Sekanina and M. Shafique, "TransInferSim: Toward Fast and Accurate Evaluation of Embedded Hardware Accelerators for Transformer Networks," in IEEE Access, vol. 13, pp. 177215-177226, 2025, doi: 10.1109/ACCESS.2025.3621062.

@ARTICLE{transinfersim,
  author={Klhufek, Jan and Marchisio, Alberto and Mrazek, Vojtech and Sekanina, Lukas and Shafique, Muhammad},
  journal={IEEE Access}, 
  title={TransInferSim: Toward Fast and Accurate Evaluation of Embedded Hardware Accelerators for Transformer Networks}, 
  year={2025},
  volume={13},
  number={},
  pages={177215-177226},
  keywords={Transformers;Accuracy;Hardware acceleration;Computational modeling;Schedules;Analytical models;Data models;Computer architecture;Memory management;Register transfer level;Transformers;hardware accelerators;modeling tools;memory subsystem;evaluation and optimizations},
  doi={10.1109/ACCESS.2025.3621062}}

Installation

To get started with TransInferSim, follow these steps:

Prerequisites

Python 3.8 or higher

This project requires Graphviz to be installed on your system. On Ubuntu/Debian, you can install it using:

sudo apt-get install graphviz

Clone and build the Repository

Clone the repository and its submodules and build using pip:

git clone --recurse-submodules https://github.com/ehw-fit/TransInferSim
cd TransInferSim
python3 -m venv venv
source venv/bin/activate
pip install --upgrade pip wheel setuptools
./scripts/setup_submodules.sh
pip install .

Usage

You can find an example run in the example.py script, which demonstrates how to instantiate a transformer model or layer of your choice along with a showcase of an example hardware specification. The script then runs an inference simulation, and the runtime performance statistics are saved to a stats_out.txt file.

Licence

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
accelergy @ 6911d15		accelergy @ 6911d15
accelergy_plugins		accelergy_plugins
analyzer		analyzer
scripts		scripts
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
compound_components.yaml		compound_components.yaml
example.py		example.py
overall.jpg		overall.jpg
requirements.txt		requirements.txt
setup.py		setup.py
stats_out.txt		stats_out.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TransInferSim - fast analysis of Transformer Network inference

Features

Reference

Installation

Prerequisites

Clone and build the Repository

Usage

Licence

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

ehw-fit/TransInferSim

Folders and files

Latest commit

History

Repository files navigation

TransInferSim - fast analysis of Transformer Network inference

Features

Reference

Installation

Prerequisites

Clone and build the Repository

Usage

Licence

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages