Skip to content

Implement Straming mode for Medium Model #4

@ASCII125

Description

@ASCII125

The Medium model is slower because it makes more LLM calls. However, this is not intended as a client-facing review—it’s an internal behavior that may seem unusual. A possible solution is to use an AsyncGenerator for the response in chunks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions