Token Tracker for Open Web UI

Token Tracker is a Python module for tracking token usage in Open Web UI. It provides comprehensive monitoring of token consumption, costs, and performance metrics across different LLM providers. The data can be exported to OpenTelemetry-compatible systems or logged to files for analysis.

WIP

This is still in development, and should be considered as early alpha. Some referenced settings doesn't work yet, specially redacting PII (TOKEN_TRACKER_REDACT_PII) and hash user id's (TOKEN_TRACKER_HASH_USER_IDS)

When project is ready for alpha, there will be a release.

Installation

Copy source to your environment. For Open Web UI, place the source in backend dir, install Open Web UI first, after that:

RUN pip3 install --no-cache-dir -e ./token-tracker && \
    # Verify installation
    python3 -c "import token_tracker; print('Token tracker installed successfully')"

Then you need to add Token Tracker to backend/open_webui/main.py:

# Add after OPENTELEMETRY section
########################################
#
# TOKEN TRACKER
#
########################################
try:
    from token_tracker.middleware import TokenUsageMiddleware
    from token_tracker.config import TokenTrackerConfig

    config = TokenTrackerConfig.from_env()
    if config.enabled:
        app.add_middleware(TokenUsageMiddleware, config=config)
        log.info("Token tracking middleware enabled")
    else:
        log.info("Token tracking disabled")
except ImportError as e:
    log.error("Token tracker not available: %s", e)

Configuration

Token Tracker is configurable through environment variables.

Core Configuration

Environment Variable	Description	Default
`TOKEN_TRACKER_ENABLED`	Enable/disable token tracking	`true`
`TOKEN_TRACKER_CLIENT_ID`	Client identifier for tracking	`None`

OpenTelemetry Configuration

Environment Variable	Description	Default
`TOKEN_TRACKER_OTEL_SERVICE_NAME`	Service name for telemetry	`token-tracker`
`TOKEN_TRACKER_OTEL_ENDPOINT`	OTLP endpoint URL	`OTEL_EXPORTER_OTLP_ENDPOINT`
`TOKEN_TRACKER_OTEL_HEADERS`	Headers for OTLP in JSON format	`OTEL_EXPORTER_OTLP_HEADERS`
`TOKEN_TRACKER_OTEL_INSECURE`	Allow insecure connections	`true`
`TOKEN_TRACKER_OTEL_EXPORT_INTERVAL`	Export interval in seconds	`30`

File Logging Configuration

Environment Variable	Description	Default
`TOKEN_TRACKER_FILE_LOGGING`	Enable file logging	`false`
`TOKEN_TRACKER_LOG_FILE`	Path to log file	`token_usage.log`
`TOKEN_TRACKER_LOG_ROTATION`	Log rotation size	`100MB`
`TOKEN_TRACKER_LOG_LEVEL`	Logging level	`INFO`

Storage Configuration

You can store user inputs and the answers, setting TOKEN_TRACKER_STORE_SAMPLES to true. But think of privacy concerns before doing so. And also make sure you tell the users that you are doing this.

Environment Variable	Description	Default
`TOKEN_TRACKER_MAX_BODY_SIZE`	Maximum request body size to process	`10485760` (10MB)
`TOKEN_TRACKER_STORE_SAMPLES`	Store prompt/completion samples	`false`
`TOKEN_TRACKER_SAMPLE_LENGTH`	Maximum length of stored samples	`500`

Tracking Configuration

Environment Variable	Description	Default
`TOKEN_TRACKER_ESTIMATE_TOKENS`	Estimate tokens when not provided by API	`true`
`TOKEN_TRACKER_TRACK_COSTS`	Track token costs	`true`
`TOKEN_TRACKER_TRACK_PERFORMANCE`	Track performance metrics	`true`

Pricing Configuration

If you provide a TOKEN_TRACKER_PRICING_FILE - it needs to be in utf8.

Environment Variable	Description	Default
`TOKEN_TRACKER_PRICING_JSON`	Pricing configuration in JSON format	`{}`
`TOKEN_TRACKER_PRICING_FILE`	Path to JSON file with pricing data	`None`

Endpoint Configuration

Environment Variable	Description	Default
`TOKEN_TRACKER_MONITORED_ENDPOINTS`	List of endpoints to monitor (JSON array)	See default list below
`TOKEN_TRACKER_EXCLUDED_ENDPOINTS`	List of endpoints to exclude (JSON array)	`[]`

Default monitored endpoints:

[
  "/api/chat/completions",
  "/api/chat/completed",
  "/api/v1/chat/completions",
  "/chat/completions",
  "/v1/chat/completions",
  "/api/completions",
  "/v1/completions",
  "/ollama/api/chat",
  "/ollama/api/generate"
]

Performance Configuration

Environment Variable	Description	Default
`TOKEN_TRACKER_ASYNC_LOGGING`	Use async logging	`true`
`TOKEN_TRACKER_QUEUE_SIZE`	Maximum queue size for async logging	`10000`
`TOKEN_TRACKER_FLUSH_INTERVAL`	Flush interval in seconds	`5`

Privacy Configuration

These are placeholders, not working at the moment.

Environment Variable	Description	Default
`TOKEN_TRACKER_REDACT_PII`	Redact personally identifiable information	`false`
`TOKEN_TRACKER_HASH_USER_IDS`	Hash user IDs for privacy	`false`

Setting Up Cost Profiles

Token Tracker supports cost tracking for different models and providers. There are three ways to configure pricing:

1. Using a JSON Configuration File

Create a JSON file with your pricing structure:

{
  "openai": {
    "gpt-4": {"prompt": 0.03, "completion": 0.06},
    "gpt-3.5-turbo": {"prompt": 0.0005, "completion": 0.0015}
  },
  "anthropic": {
    "claude-3-opus": {"prompt": 0.015, "completion": 0.075},
    "claude-3-sonnet": {"prompt": 0.003, "completion": 0.015}
  }
}

Then set the environment variable:

TOKEN_TRACKER_PRICING_FILE=/path/to/your/pricing.json

2. Using Environment Variables

You can set pricing directly through environment variables:

TOKEN_TRACKER_PRICE_OPENAI_GPT4_PROMPT=0.03
TOKEN_TRACKER_PRICE_OPENAI_GPT4_COMPLETION=0.06
TOKEN_TRACKER_PRICE_ANTHROPIC_CLAUDE3OPUS_PROMPT=0.015
TOKEN_TRACKER_PRICE_ANTHROPIC_CLAUDE3OPUS_COMPLETION=0.075

3. Using JSON in Environment Variable

You can also provide JSON directly, if you like to complicate things 🙂.

TOKEN_TRACKER_PRICING_JSON='{"openai":{"gpt-4":{"prompt":0.03,"completion":0.06}}}'

Usage Examples

There is no guarantees, but Token Tracker could be added to other projects then Open WebUI, at least if they are using Fast API. But this have not been tested.

Basic Integration with FastAPI

from fastapi import FastAPI
from token_tracker.middleware import TokenUsageMiddleware

app = FastAPI()
app.add_middleware(TokenUsageMiddleware)

Manual Token Logging

from token_tracker.logger import get_token_logger

# Get the global logger instance
logger = get_token_logger()

# Log token usage
logger.log_token_usage(
    model="gpt-4",
    prompt_tokens=100,
    completion_tokens=50,
    user_id="user123",
    session_id="session456",
    endpoint="/api/chat/completions",
    duration_ms=1500
)

Custom Token Counting

from token_tracker.logger import get_token_logger, TokenCounter

# Get components
logger = get_token_logger()
token_counter = TokenCounter(logger.config)

# Register a custom counter for a specific model
def count_my_model_tokens(text):
    # Your custom logic here
    return len(text.split())

token_counter.register_custom_counter("my-custom-model", count_my_model_tokens)

OpenTelemetry Integration

Token Tracker automatically exports metrics and traces to your configured OpenTelemetry endpoint. The following metrics are available:

token_usage_total: Counter for token usage (prompt and completion)
token_usage_cost: Counter for token costs
token_request_duration: Histogram for request durations

Traces include detailed information about each request including token counts, costs, and samples.

Docker Environment Example

version: '3'
services:
  open-webui:
    image: openwebui/open-webui:latest
    environment:
      - TOKEN_TRACKER_ENABLED=true
      - TOKEN_TRACKER_FILE_LOGGING=true
      - TOKEN_TRACKER_LOG_FILE=/data/logs/token_usage.log
      - TOKEN_TRACKER_STORE_SAMPLES=true
      - TOKEN_TRACKER_OTEL_ENDPOINT=http://otel-collector:4317
      - TOKEN_TRACKER_PRICING_FILE=/data/config/pricing.json

Troubleshooting

If you encounter issues with token tracking:

Check that TOKEN_TRACKER_ENABLED is set to true
Verify that your endpoints match the monitored endpoints list
For OpenTelemetry issues, check connectivity to your OTLP endpoint
Enable debug logging with TOKEN_TRACKER_LOG_LEVEL=DEBUG

Advanced Usage

Custom Pricing Patterns

You can use regex patterns in your pricing configuration to match model families:

{
  "openai": {
    "regex:gpt-4.*": {"prompt": 0.03, "completion": 0.06},
    "regex:gpt-3.*": {"prompt": 0.0005, "completion": 0.0015}
  }
}

Provider-wide Defaults

Set default pricing for all models from a provider:

{
  "ollama": {
    "*": {"prompt": 0.0001, "completion": 0.0001}
  }
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
token_tracker		token_tracker
LICENSE.md		LICENSE.md
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Token Tracker for Open Web UI

WIP

Installation

Configuration

Core Configuration

OpenTelemetry Configuration

File Logging Configuration

Storage Configuration

Tracking Configuration

Pricing Configuration

Endpoint Configuration

Performance Configuration

Privacy Configuration

Setting Up Cost Profiles

1. Using a JSON Configuration File

2. Using Environment Variables

3. Using JSON in Environment Variable

Usage Examples

Basic Integration with FastAPI

Manual Token Logging

Custom Token Counting

OpenTelemetry Integration

Docker Environment Example

Troubleshooting

Advanced Usage

Custom Pricing Patterns

Provider-wide Defaults

About

Uh oh!

Releases

Packages

Languages

License

Digitalist-Open-Cloud/token-tracker

Folders and files

Latest commit

History

Repository files navigation

Token Tracker for Open Web UI

WIP

Installation

Configuration

Core Configuration

OpenTelemetry Configuration

File Logging Configuration

Storage Configuration

Tracking Configuration

Pricing Configuration

Endpoint Configuration

Performance Configuration

Privacy Configuration

Setting Up Cost Profiles

1. Using a JSON Configuration File

2. Using Environment Variables

3. Using JSON in Environment Variable

Usage Examples

Basic Integration with FastAPI

Manual Token Logging

Custom Token Counting

OpenTelemetry Integration

Docker Environment Example

Troubleshooting

Advanced Usage

Custom Pricing Patterns

Provider-wide Defaults

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages