Orchard

Python client for high-performance LLM inference on Apple Silicon.

Installation

pip install orchard

Usage

from orchard import Client

client = Client()

response = client.chat(
    model="meta-llama/Llama-3.1-8B-Instruct",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.text)

Streaming

for delta in client.chat(model="...", messages=[...], stream=True):
    print(delta.content, end="", flush=True)

Batch Inference

responses = client.chat_batch(
    model="...",
    conversations=[
        [{"role": "user", "content": "Question 1"}],
        [{"role": "user", "content": "Question 2"}],
    ],
)

Model Profiles

Chat templates and control tokens are loaded from the Pantheon submodule at orchard/formatter/profiles/. This provides a single source of truth shared across all Orchard SDKs (Python, Rust, Swift). See that repo for the list of supported model families.

Requirements

Python 3.10+
macOS 14+ (Apple Silicon)
PIE (Proxy Inference Engine)

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
.github/workflows		.github/workflows
orchard		orchard
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Orchard

Installation

Usage

Streaming

Batch Inference

Model Profiles

Requirements

Related

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

TheProxyCompany/orchard-py

Folders and files

Latest commit

History

Repository files navigation

Orchard

Installation

Usage

Streaming

Batch Inference

Model Profiles

Requirements

Related

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages