Plasmate Benchmarks

Reproducible benchmarks for Plasmate - the browser engine for AI agents.

Run these yourself. Verify our claims. No trust required.

Quick Start

# Install Plasmate
pip install plasmate
# or: cargo install plasmate
# or: npm install -g plasmate

# Clone this repo
git clone https://github.com/plasmate-labs/plasmate-benchmarks
cd plasmate-benchmarks

# Run the full benchmark
./run-benchmark.sh

What We Claim

Metric	Claim	How to Verify
Token compression	16.6x fewer tokens vs raw HTML	`./run-benchmark.sh`
Cost savings	~94% reduction at GPT-4 rates	`./run-cost-analysis.sh`
Speed	50x faster than Chrome headless	`./compare-chrome.sh`
Memory	~30MB vs Chrome's 300-500MB	`./compare-chrome.sh`

Benchmark URLs

We test against 49 real-world websites spanning:

News sites (CNN, BBC, Reuters, TechCrunch)
E-commerce (Amazon, eBay, Etsy)
Documentation (MDN, Python docs, Rust docs)
Social platforms (Reddit, HN, GitHub)
SaaS dashboards (Stripe, Vercel, Netlify)
Government sites (USA.gov, NHS)
Wikipedia, Stack Overflow, and more

See urls.txt for the full list.

run-benchmark.sh

Runs Plasmate against all 49 URLs and compares SOM token count vs raw HTML token count.

Output:

Per-URL token counts (HTML vs SOM)
Compression ratios
Aggregate statistics (mean, median, total)
Results saved to results/benchmark-YYYY-MM-DD.json

run-cost-analysis.sh

Calculates the dollar cost of processing these pages with different LLMs:

GPT-4 ($30/1M input tokens)
GPT-4o ($2.50/1M input tokens)
Claude 3.5 Sonnet ($3/1M input tokens)

Shows the cost difference between Chrome (raw HTML tokens) and Plasmate (SOM tokens) at scale (1K, 10K, 100K, 1M pages/month).

compare-chrome.sh

Requires Chrome/Chromium installed. Runs the same URLs through both:

Chrome headless (dumps full DOM)
Plasmate (outputs SOM)

Measures and compares:

Wall clock time
Peak memory usage
Output size (bytes)
Token count

Methodology

Token counting uses tiktoken with the cl100k_base encoding (GPT-4 tokenizer)
Each URL is fetched fresh (no caching)
Chrome uses --headless=new --dump-dom
Plasmate uses plasmate fetch <url> (outputs SOM JSON to stdout)
Memory measured via /usr/bin/time -v (Linux) or command time -l (macOS)

Contributing

Found a URL that breaks Plasmate? Open an issue with the URL and we'll investigate.

Want to add URLs to the benchmark? PRs welcome - just add to urls.txt.

License

Apache 2.0 - same as Plasmate itself.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
results		results
scripts		scripts
LICENSE		LICENSE
README.md		README.md
compare-chrome.sh		compare-chrome.sh
run-benchmark.sh		run-benchmark.sh
run-cost-analysis.sh		run-cost-analysis.sh
urls.txt		urls.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Plasmate Benchmarks

Quick Start

What We Claim

Benchmark URLs

run-benchmark.sh

run-cost-analysis.sh

compare-chrome.sh

Methodology

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Plasmate Benchmarks

Quick Start

What We Claim

Benchmark URLs

run-benchmark.sh

run-cost-analysis.sh

compare-chrome.sh

Methodology

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages