This file provides a summary of each release. For detailed release notes with migration instructions, code examples, and full contributor credits, see the linked documents.
0.5.0 - 2026-02-05
Release 0.5 brings significant improvements to API consistency, OpenAI conformance, provider capabilities, and a major architectural refactoring of all APIs to use FastAPI routers.
- Connectors API for managing MCP server connections (#4263)
- Unified network configuration with TLS/mTLS, proxy, and timeout support for all remote providers (#4748)
- Endpoint authorization via YAML-based access control (#4448)
- Rerankers for hybrid search in vector stores (#4456)
- Response API enhancements including
reasoning.effort,max_output_tokens, andparallel_tool_calls - New providers: Elasticsearch and OCI 26ai vector stores
- PGVector improvements: HNSW/IVFFlat indexes, configurable distance metrics
- FastAPI router migration across all APIs for better OpenAPI docs and validation
- ARM64 container image support (#4474)
| Change | Type | PR |
|---|---|---|
| Post-Training API endpoints restructured (path params) | Hard | #4606 |
Embeddings API rejects explicit null for optional fields |
Hard | #4644 |
| Safety API provider interface changed to request objects | Hard | #4643 |
| Builtin GPU inference provider removed | Hard | #4828 |
| Scope-based endpoint authorization removed | Hard | #4734 |
image_name renamed to distro_name |
Deprecated | #4396 |
| Eval API calling convention uses request objects | Deprecated | #4425 |
vLLM tls_verify moved to network.tls.verify |
Deprecated | #4748 |
See the full release notes for migration instructions and detailed upgrade guide.