[token] Reduce DB reads on tracked validations by mfittko · Pull Request #268 · sofatutor/llm-proxy

mfittko · 2026-01-06T20:53:40Z

Summary

Reduce unnecessary DB reads on the uncached request path by ensuring token caching applies to tracked validations, while keeping quota enforcement correct.

Changes

Populate token cache after successful ValidateTokenWithTracking() to avoid repeated token DB lookups.
For cached limited tokens, perform usage increment (quota enforcement) without an extra DB read.
Tighten IncrementTokenUsage DB update gating for is_active and expires_at, and return semantic errors.
Add/adjust tests for limited-token caching behavior and inactive/expired increment behavior.

Testing

make test
make lint

Notes

This targets proxy-side latency under concurrency for uncached calls; limited tokens still require a per-request DB write to enforce max_requests.

- Populate token cache after ValidateTokenWithTracking - For cached limited tokens: increment usage without extra DB read - Tighten DB IncrementTokenUsage gating (inactive/expired/quota) and return semantic errors Testing: make test, make lint

Copilot

Pull request overview

This PR optimizes token validation performance by extending cache usage to tracked validations (previously only used for non-tracking validation). The key insight is that for limited tokens, the cached metadata (active/expiry/project) can be reused while still enforcing max_requests through a synchronous DB write that avoids an extra DB read.

Caches tokens after successful tracked validation to avoid redundant DB reads on subsequent requests
For cached limited tokens, uses cached metadata but still calls IncrementTokenUsage for quota enforcement
Tightens DB-level enforcement by adding is_active and expires_at checks to the UPDATE's WHERE clause, returning semantic errors when these constraints aren't met

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
`internal/token/cache.go`	Implements limited-token caching by calling `IncrementTokenUsage` directly on cache hits; replaces `IsValid()` with new `isCacheableTokenValid()` helper that excludes quota checks; populates cache after successful tracking
`internal/database/token.go`	Adds `is_active` and `expires_at` guards to the UPDATE WHERE clause in `IncrementTokenUsage`; expands error-path SELECT to return `ErrTokenInactive` or `ErrTokenExpired` when appropriate
`internal/token/cache_test.go`	Adds `countingStore` mock and test verifying that limited tokens hit the cache on second call with only one increment and no extra DB read
`internal/token/token_integration_test.go`	Updates `MockStore.IncrementTokenUsage` to match new DB behavior (check active/expired/quota before incrementing)
`internal/database/token_test.go`	Adds test cases verifying `IncrementTokenUsage` returns `ErrTokenInactive` and `ErrTokenExpired` when appropriate

internal/token/cache_test.go

- Use SQL boolean literal TRUE for is_active check (works across sqlite/mysql/postgres) - Tighten cache test assertion per review

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

internal/token/cache.go

internal/database/token.go

- Invalidate cached limited-token entries when IncrementTokenUsage returns rate-limit/inactive/expired - Treat any non-NULL expires_at as authoritative (remove IsZero guard) Testing: go test ./internal/token ./internal/database

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

internal/token/cache_test.go

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings January 6, 2026 20:53

mfittko self-assigned this Jan 6, 2026

Copilot started reviewing on behalf of mfittko January 6, 2026 20:54 View session

Copilot AI reviewed Jan 6, 2026

View reviewed changes

internal/token/cache_test.go Outdated Show resolved Hide resolved

[db] Fix postgres IncrementTokenUsage active check

19f2c45

- Use SQL boolean literal TRUE for is_active check (works across sqlite/mysql/postgres) - Tighten cache test assertion per review

mfittko requested a review from Copilot January 6, 2026 21:18

Copilot started reviewing on behalf of mfittko January 6, 2026 21:19 View session

Copilot AI reviewed Jan 6, 2026

View reviewed changes

internal/token/cache.go Show resolved Hide resolved

internal/database/token.go Outdated Show resolved Hide resolved

[token] Invalidate cache on terminal tracking errors

f32634e

- Invalidate cached limited-token entries when IncrementTokenUsage returns rate-limit/inactive/expired - Treat any non-NULL expires_at as authoritative (remove IsZero guard) Testing: go test ./internal/token ./internal/database

mfittko requested a review from Copilot January 6, 2026 21:31

Copilot started reviewing on behalf of mfittko January 6, 2026 21:31 View session

Copilot AI reviewed Jan 6, 2026

View reviewed changes

internal/token/cache_test.go Outdated Show resolved Hide resolved

internal/token/cache_test.go Outdated Show resolved Hide resolved

Apply suggestions from code review

22e1f26

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

mfittko merged commit dcb6e33 into main Jan 6, 2026
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[token] Reduce DB reads on tracked validations#268

[token] Reduce DB reads on tracked validations#268
mfittko merged 4 commits intomainfrom
perf/reduce-db-reads-tracked-tokens

mfittko commented Jan 6, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mfittko commented Jan 6, 2026

Summary

Changes

Testing

Notes

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants