Auto Throttle

Auto Throttle is a lightweight, adaptive concurrency control library designed for Spring Boot applications. Unlike traditional rate limiters that require static configuration, Auto Throttle dynamically adjusts concurrency limits in real-time based on the server's response time (RTT), protecting the application from overload while maximizing throughput.

Overview

In distributed systems, static rate limiting is often insufficient because the capacity of a service fluctuates depending on downstream dependencies, database performance, and garbage collection pauses.

Auto Throttle implements the TCP Vegas congestion avoidance algorithm, adapted for application-layer concurrency control. It treats the service as a network pipe, measuring the round-trip time of requests to detect queuing delay. When latency increases, it gracefully reduces the concurrency limit; when latency recovers, it explores higher limits.

Key Features

Adaptive Control: Automatically finds the optimal concurrency limit without manual tuning.
TCP Vegas Algorithm: Uses queue size estimation based on minimum RTT vs. current RTT.
Zero-Overhead: Built on Java 21 Virtual Threads and lock-free atomic primitives for nanosecond-level performance.
Fail-Fast: Immediately rejects excess traffic with HTTP 503 Service Unavailable to prevent cascading failures.
Observability: Seamless integration with Spring Boot Actuator and Micrometer.

Requirements

Java 21 or later
Spring Boot 3.2 or later

Installation

Add the dependency to your build.gradle.kts:

dependencies {
    implementation("io.github.tenoenc:auto-throttle-starter:1.0.0")
}

Usage

Once the dependency is added, Auto Throttle is active by default. It intercepts incoming HTTP requests using a Servlet Filter.

Configuration

While the library is designed to work with zero configuration, you can tune the algorithm parameters in application.yml if necessary.

auto-throttle:
  # Enable or disable the limiter (default: true)
  enabled: true
  
  # The time window for aggregating statistics (default: 100ms)
  window-size-ms: 100
  
  # TCP Vegas Alpha: The minimum expected queue size (default: 3)
  # If the estimated queue is smaller than this, the limit increases.
  alpha: 3
  
  # TCP Vegas Beta: The maximum expected queue size (default: 6)
  # If the estimated queue is larger than this, the limit decreases.
  beta: 6

Monitoring

Auto Throttle integrates with Spring Boot Actuator to provide real-time visibility.

Actuator Endpoint: GET /actuator/autothrottle

Response:

{
  "limit": 50,
  "inflight": 12
}

Micrometer Metrics: If you use Prometheus or other monitoring systems, the following metrics are exposed:

auto.throttle.limit: The current dynamic concurrency limit.
auto.throttle.inflight: The number of requests currently being processed.

To enable these endpoints, ensure your application.yml includes:

management:
  endpoints:
    web:
      exposure:
        include: "autothrottle, prometheus, health"

Performance Benchmarks

Auto Throttle is designed to be Zero-Overhead. We verified the performance using two different methods: Microbenchmark (JMH) and Load Testing (k6).

1. Microbenchmark (Internal Overhead)

How much time does it take to make a decision? Less than 20 nanoseconds.

We measured the core logic performance using JMH (Java Microbenchmark Harness).

Operation	Throughput (ops/s)	Average Time (ns/op)	Note
Acquire (Decision)	~160,000,000	~6.1 ns	Lock-Free / Zero-Allocation
Release (Feedback)	~75,000,000	~13.3 ns	High-Performance RingBuffer

Result: The overhead is negligible compared to typical HTTP request processing times (10ms+).

2. Load Testing (Protection Capability)

Does it actually protect the server under heavy load?

We verified the effectiveness of Auto Throttle using k6 load testing.

Test Scenario

Hardware: Local Dev Machine (Intel i7-8700, 32GB RAM)
Environment: Spring Boot 3.2 + Java 21 (Virtual Threads)
Traffic: Ramp up to 3,000 concurrent users (VUs)
Endpoint: Simulated slow processing (100ms delay)

Results

The table below compares the server performance under extreme load.

Metric	Without Auto-Throttle	With Auto-Throttle	Impact
P95 Latency	995.64 ms (Severe Lag)	148.66 ms (Stable)	6.7x Faster Response
System State	Overloaded (Queue Buildup)	Healthy (Fast Failure)	Prevented Cascading Failure
Load Shedding	0 requests rejected	~46,000 requests rejected	Effective Protection

Conclusion: Without Auto Throttle, the server suffered from a backlog, causing response times to skyrocket to ~1 second. With Auto Throttle enabled, the server maintained its optimal response time (~150ms) by intelligently shedding excess load (HTTP 503), protecting existing users from degradation.

How It Works

Measurement: The library measures the Round-Trip Time (RTT) of every request using a high-performance ring buffer.
Aggregation: Every 100ms (configurable), it calculates the average RTT and tracks the minimum RTT (minRTT) seen so far.
Estimation: It calculates the estimated queue size using Little's Law principles derived from TCP Vegas:

QueueSize = CurrentLimit * (1 - minRTT / CurrentRTT)

Adjustment:

If QueueSize < Alpha: The system is underutilized. Increase the limit.
If QueueSize > Beta: The system is congested. Decrease the limit.
Otherwise: Maintain the current limit.

Caveats

1. Low Traffic Environments

Auto Throttle is optimized for high-concurrency scenarios. If your traffic is very low (e.g., < 50 RPS), the algorithm might not collect enough samples (MIN_SAMPLES) within the default window (100ms).

Solution: Increase the update interval window size in configuration.

2. JVM Warm-up Phase

During the initial startup (Cold Start), request latencies may be higher due to JVM warmup (JIT compilation). The limiter might behave conservatively during this phase but will automatically adapt as the application stabilizes and latencies decrease.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
auto-throttle-core		auto-throttle-core
auto-throttle-starter		auto-throttle-starter
gradle/wrapper		gradle/wrapper
src/main/java/io/github/tenoenc		src/main/java/io/github/tenoenc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Auto Throttle

Overview

Key Features

Requirements

Installation

Usage

Configuration

Monitoring

Performance Benchmarks

1. Microbenchmark (Internal Overhead)

2. Load Testing (Protection Capability)

Test Scenario

Results

How It Works

Caveats

1. Low Traffic Environments

2. JVM Warm-up Phase

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

License

tenoenc/auto-throttle

Folders and files

Latest commit

History

Repository files navigation

Auto Throttle

Overview

Key Features

Requirements

Installation

Usage

Configuration

Monitoring

Performance Benchmarks

1. Microbenchmark (Internal Overhead)

2. Load Testing (Protection Capability)

Test Scenario

Results

How It Works

Caveats

1. Low Traffic Environments

2. JVM Warm-up Phase

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages