Skip to content

Latest commit

 

History

History
36 lines (27 loc) · 2.63 KB

File metadata and controls

36 lines (27 loc) · 2.63 KB

Llama Stack Release Notes

This file provides a summary of each release. For detailed release notes with migration instructions, code examples, and full contributor credits, see the linked documents.

0.5.0 - 2026-02-05

Release 0.5 brings significant improvements to API consistency, OpenAI conformance, provider capabilities, and a major architectural refactoring of all APIs to use FastAPI routers.

Highlights

  • Connectors API for managing MCP server connections (#4263)
  • Unified network configuration with TLS/mTLS, proxy, and timeout support for all remote providers (#4748)
  • Endpoint authorization via YAML-based access control (#4448)
  • Rerankers for hybrid search in vector stores (#4456)
  • Response API enhancements including reasoning.effort, max_output_tokens, and parallel_tool_calls
  • New providers: Elasticsearch and OCI 26ai vector stores
  • PGVector improvements: HNSW/IVFFlat indexes, configurable distance metrics
  • FastAPI router migration across all APIs for better OpenAPI docs and validation
  • ARM64 container image support (#4474)

Breaking Changes

Change Type PR
Post-Training API endpoints restructured (path params) Hard #4606
Embeddings API rejects explicit null for optional fields Hard #4644
Safety API provider interface changed to request objects Hard #4643
Builtin GPU inference provider removed Hard #4828
Scope-based endpoint authorization removed Hard #4734
image_name renamed to distro_name Deprecated #4396
Eval API calling convention uses request objects Deprecated #4425
vLLM tls_verify moved to network.tls.verify Deprecated #4748

See the full release notes for migration instructions and detailed upgrade guide.