-
-
Notifications
You must be signed in to change notification settings - Fork 572
Open
Labels
enhancementNew feature or requestNew feature or request
Description
Routine Checklist
- I have checked for similar issues.
- I have updated to the latest version.
- GPT-Load, v1.4.1 - Latest Version
- I have read the README and confirmed that the current version does not meet my needs.
- I understand and am willing to follow up on this issue, assist with testing, and provide feedback.
- I understand and agree to the above, and I understand that the maintainers have limited time, so issues that do not follow the rules may be ignored or closed directly.
Description
Currently, the framework retries all available keys on any error response type from the backend.
Even then the code is valid and there is no limit issues, when the client sends a misconfigured request (e.g. wrong model name or incorrect parameters), the gpt-load framework sends retry requests using the same wrong payload.
Desired behaviour
The framework would have an additional input field with a list (CSV) of the backend response codes, which should be re-tried with different keys, e.g.:
- Section: "Settings" --> "Key Configuration" --> "Retry codes" (Retry on the following backend response codes only):
401, 403, 429401: Invalid or Missing API Key403: Forbidden or Insufficient Permissions429: Too Many Requests
FakerL and GuJi08233
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request