LLM integration #424

zmsdev · 2025-08-27T13:44:55Z

ZMS now supports multiple Large Language Model (LLM) providers through an abstract interface. This allows you to use different AI backends including OpenAI, local Ollama deployments, and RAG (Retrieval-Augmented Generation) with Qdrant vector database.

OpenAI - Cloud-based GPT models
Ollama - Local LLM deployment
RAG - Retrieval-Augmented Generation with Qdrant vector database

The interface is located in the new module: src/Products/zms/llmapi.py and it's zmi/gui in src/Products/zms/zpt/object/manage_llm.zpt. For testing on a local system a simple docker-file is added : src/docker/addons/llm/docker-compose.llm.yml

Architecture

User Interface (manage_llm.zpt)
         ↓
REST API (rest_api.py)
         ↓
LLM API (llmapi.py)
         ↓
Provider Factory (_get_provider)
         ↓
    ┌────┴───┬─────────┐
    ↓        ↓         ↓
OpenAI    Ollama      RAG
Provider  Provider    Provider

Configuration-Guide: https://github.com/zms-publishing/ZMS/blob/fb_chatgpt/docs/llm_configuration.md

zmsdev · 2025-08-27T20:23:47Z

Implementation ready for test.
Unfortunately any openapi.api.key returns error:

[insufficient_quota] You exceeded your current quota, please check your plan and billing details. For more information on this error, read the docs: https://platform.openai.com/docs/guides/error-codes/api-errors.

Prepare upcoming response-schema https://platform.openai.com/docs/api-reference/responses

zmsdev marked this pull request as draft August 27, 2025 13:45

zmsdev force-pushed the fb_chatgpt branch 2 times, most recently from 3433988 to 03fd17c Compare August 27, 2025 20:22

zmsdev requested a review from drfho August 27, 2025 20:24

zmsdev force-pushed the fb_chatgpt branch from 03fd17c to a35e43f Compare August 27, 2025 20:31

OpenAI integration

8ee9789

zmsdev force-pushed the fb_chatgpt branch from a35e43f to 8ee9789 Compare August 27, 2025 20:36

drfho and others added 4 commits August 28, 2025 00:26

Merge branch 'main' into fb_chatgpt

d3c6e14

Merge branch 'main' into fb_chatgpt

9c30347

Merge branch 'main' into fb_chatgpt

5a7d18b

refact LLM-API and made it abstract

27d89d7

drfho changed the title ~~OpenAI integration~~ LLM integration Jan 18, 2026

drfho added 8 commits January 19, 2026 23:43

Merge branch 'main' into fb_chatgpt

c37c745

fix RAGProvider: utilize sentence-transformers for query

052dadf

using model cache to minimize requests to HuggingFace

86a7686

added more llm tuning paramters

77ae3fa

added btn copy to clipboard

c5355db

added btn copy to clipboard (2)

1c5ed33

normalized to OpenAI /v1/chat/completions API schema

b4e0ac7

Prepare upcoming response-schema https://platform.openai.com/docs/api-reference/responses

update doc

fc446cf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM integration #424

LLM integration #424

zmsdev commented Aug 27, 2025 •

edited by drfho

Loading

Uh oh!

zmsdev commented Aug 27, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LLM integration #424

Are you sure you want to change the base?

LLM integration #424

Conversation

zmsdev commented Aug 27, 2025 • edited by drfho Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Architecture

Uh oh!

zmsdev commented Aug 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zmsdev commented Aug 27, 2025 •

edited by drfho

Loading

zmsdev commented Aug 27, 2025 •

edited

Loading