Skip to content

Feat/litellm passthrough #199#201

Open
endre82 wants to merge 2 commits intorynfar:mainfrom
endre82:feat/litellm-passthrough
Open

Feat/litellm passthrough #199#201
endre82 wants to merge 2 commits intorynfar:mainfrom
endre82:feat/litellm-passthrough

Conversation

@endre82
Copy link
Copy Markdown

@endre82 endre82 commented Mar 31, 2026

#199

feat: add LiteLLM passthrough adapter with x-litellm-* header detection

  • Add tsx as dev dependency and update supervisor to prefer bun > tsx > npx
  • Detect LiteLLM by user-agent header (litellm/) in addition to x-litellm- headers
  • Force stream=false for all LiteLLM requests (healthchecks don't send x-litellm-* headers)
  • Increase MAX_CONCURRENT_SESSIONS default from 10 to 50
  • Increase rate-limit retry attempts (2→3) and base delay (1s→2s) with exponential backoff
  • Allow rate-limit retry even after partial content was yielded
  • Add DEBUG_PROXY=true flag for detailed error diagnosis
  • Add prefersStreaming() to AgentAdapter interface (unused but available)

endre82 added 2 commits March 31, 2026 10:41
… rate-limit retries

- Add tsx as dev dependency and update supervisor to prefer bun > tsx > npx
- Detect LiteLLM by user-agent header (litellm/*) in addition to x-litellm-* headers
- Force stream=false for all LiteLLM requests (healthchecks don't send x-litellm-* headers)
- Increase MAX_CONCURRENT_SESSIONS default from 10 to 50
- Increase rate-limit retry attempts (2→3) and base delay (1s→2s) with exponential backoff
- Allow rate-limit retry even after partial content was yielded
- Add DEBUG_PROXY=true flag for detailed error diagnosis
- Add prefersStreaming() to AgentAdapter interface (unused but available)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant