PyTEI

PyTEI is an unofficial minimal python interface for Hugging Face's Text Embeddings Inference.

PyTEI supports in-memory and persistent caching for text embeddings.

Installation

Install the package through pip:

pip install pytei-client

Installing from source

First, clone the git repository by running:

git clone https://github.com/daniel-gomm/PyTEI.git

Next, install this repository as python package using pip by running the following command from the root directory of this repository:

pip install . -e

Remove the -e-flag in case you do not want to modify the code.

Usage

Prerequisite for using PyTEI is a running Text Embeddings Inference instance, for example a local docker container running TEI. Such a docker contain can be spun-up by running:

docker run --gpus all -p 8080:80 \
  -v $PWD/data:/data \
  --pull always ghcr.io/huggingface/text-embeddings-inference:1.6 \
  --model-id Alibaba-NLP/gte-large-en-v1.5

TEI Client

For more details check out the Documentation.

Establish a connection to TEI through a TEIClient. The client gives you access to the text-embedding API of the TEI instance:

from pytei import TEIClient

client = TEIClient(url="127.0.0.1:8080/embed")

text_embedding = client.embed("Lorem Ipsum")

text_embedding_batch = client.embed(["Lorem Ipsum", "dolor sit amet", "consectetur adipiscing elit"])

denormalized_embedding = client.embed("Lorem Ipsum", normalize=False)

The default configuration uses in-memory caching of embeddings. For persistent caching use the DuckDBDataStore or implement your own caching solution by extending the DataStore base-class.

from pytei import TEIClient
from pytei.store import DuckDBEmbeddingStore

persistent_data_store = DuckDBEmbeddingStore(db_path="data/embedding_database.duckdb")
client = TEIClient(embedding_store=persistent_data_store, url="127.0.0.1:8080/embed")

text = "Lorem Ipsum"

# Embeddings are cached the first time a given text is embedded
text_embedding = client.embed(text)

# Previously cached embedding is retrieved from cache
cached_text_embedding = client.embed(text)

# You can explicitly specify to skip writing to cache
skipped_cache_embedding = client.embed("This will not be cached", skip_cache=True)

# PyTEI supports retrieval from cache for partly cached batches
embeddings = client.embed([text, "Previously un-cached text"])

For a more detailed description and the full description of the API check out the Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github		.github
docs		docs
src/pytei		src/pytei
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PyTEI

Installation

Installing from source

Usage

TEI Client

About

Uh oh!

Releases

Packages

Languages

License

daniel-gomm/PyTEI

Folders and files

Latest commit

History

Repository files navigation

PyTEI

Installation

Installing from source

Usage

TEI Client

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages