This repository was archived by the owner on Jan 7, 2026. It is now read-only.
Releases: notsyncing/azarrot
Releases · notsyncing/azarrot
0.3.0
0.2.0
- Add IPEX-LLM backend
- Support InternVL2 on IPEX-LLM backend with OpenAI chat completion image input
- Support Qwen2 tool calling on IPEX-LLM and OpenVINO backend with OpenAI chat completion tools input
- Support embedding models on IPEX-LLM and OpenVINO backend with OpenAI embedding API
- Support parallel completion requests: concurrent completion requests can be submit on both OpenVINO and IPEX-LLM backends (not batching)
- Add README and changelog