GitHub - liquidos-ai/autoagents-bench: AutoAgents Benchamark

AutoAgents Benchmark

Concurrent completion benchmarks for the autoagents framework alongside LangGraph and CrewAI agents.

All runners read their workload settings from benchmark.yaml (or a path provided via BENCH_CONFIG). Update that file to change request count, concurrency, model, or prompt template once and share it across languages.

All runners require an OPENAI_API_KEY that can call the configured models.

Disclaimer

The benchmarks are written in Rust and Python. The Rust benchmarks use the autoagents framework, while the Python benchmarks use LangGraph and CrewAI agents. The benchmarks are designed to measure the performance of these agents in processing large amounts of data. If you feel like the benchmarks are not accurate or you have any suggestions, please feel free to open an issue or submit a pull request.

Benchmark

The below bencmark is run for 250 parallel requests to process an ReAct Style Agent to process and parquet file to calculate the average duration time.

The below bencmark is run for 100 parallel requests to process an ReAct Style Agent to process and parquet file to calculate the average duration time. In this we use structured_output from the agent to evaluate if the generated value is correct or not.

Rust benchmark (AutoAgents)

export OPENAI_API_KEY=sk-your-key

cargo run --release

Python benchmark (LangGraph, CrewAI)

export OPENAI_API_KEY=sk-your-key

# Using uv (recommended) or your preferred Python runner
uv run main.py

Note

Python Files are in _src folder and Rust in src

TODO

Bench 300 for AutoAgents
Need to run 300 bench for langgraph and crewai
Add Memory in the benchmark

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.idea		.idea
_src		_src
plots		plots
src		src
.gitignore		.gitignore
.python-version		.python-version
AGENTS.md		AGENTS.md
APACHE_LICENSE		APACHE_LICENSE
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
benchmark.yaml		benchmark.yaml
benchmark_results.json		benchmark_results.json
benchmark_results_100.json		benchmark_results_100.json
benchmark_results_250.json		benchmark_results_250.json
main.py		main.py
plot_benchmarks.py		plot_benchmarks.py
pyproject.toml		pyproject.toml
trip_data.parquet		trip_data.parquet
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

AutoAgents Benchmark

Disclaimer

Benchmark

Rust benchmark (AutoAgents)

Python benchmark (LangGraph, CrewAI)

Note

TODO

About

Licenses found

Uh oh!

Releases

Packages

Languages

License

Licenses found

liquidos-ai/autoagents-bench

Folders and files

Latest commit

History

Repository files navigation

AutoAgents Benchmark

Disclaimer

Benchmark

Rust benchmark (AutoAgents)

Python benchmark (LangGraph, CrewAI)

Note

TODO

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages