GitHub - hlquery/python-api: Python client library for hlquery with modular APIs, auth support, and type- safe responses.

A modular Python client library for hlquery, designed with a familiar and intuitive API structure.

Features

Modular Architecture: Clean separation of concerns with organized classes
Intuitive API: Familiar and easy-to-use structure
Authentication Support: Bearer token and X-API-Key authentication
Flexible Parameters: Support for multiple parameter formats
Auto-detection: Automatically detects searchable fields when not specified
Type-safe Responses: Response objects with helper methods
Comprehensive Validation: Input validation for all operations
No External Dependencies: Uses Python's built-in urllib and json modules
Professional Structure: Well-organized and modular architecture

Installation

Core client usage has no dependencies beyond Python's standard library. PDF parsing uses the optional PyPDF2 package.

Simply import the client:

from lib import Client

Or install as a package:

pip install -e .

Optional PDF Support

Install the optional PDF parser when you want to index local PDF files directly through the Python client:

pip install PyPDF2

CSV support is built in and does not require any extra package.

Quick Start

Basic Usage

from lib import Client

# Initialize client
client = Client('http://localhost:9200')

# Health check
health = client.health()
print(f"Status: {health.get_status_code()}")

# List collections
collections = client.list_collections(0, 10)
if collections.is_success():
    body = collections.get_body()
    print(f"Found {len(body.get('collections', []))} collections")

With Authentication

# Method 1: Set token in constructor
client = Client('http://localhost:9200', {
    'token': 'your_token_here',
    'auth_method': 'bearer'  # or 'api-key'
})

# Method 2: Set token dynamically
client = Client('http://localhost:9200')
client.set_auth_token('your_token_here', 'bearer')

# Method 3: Use X-API-Key
client.set_auth_token('your_token_here', 'api-key')

Reduce Text Example

If the ai_search module is enabled, you can use the raw request helper to ask hlquery to summarize a stored document:

summary = client.execute_request(
    "GET",
    "/modules/ai_search/talk",
    query_params={
        "q": "summarize onboarding guide in docs",
        "run": "true",
    },
)

print(summary.get_body())

Architecture

Core Classes

`Client`

Main client class that provides access to all API operations.

`Request`

Handles HTTP requests, authentication, and error handling.

`Response`

Response wrapper with helper methods:

get_status_code() - Get HTTP status code
get_body() - Get response body
is_success() - Check if request was successful
is_error() - Check if request failed
get_error() - Get error message
to_dict() - Convert to dictionary format (for backward compatibility)

API Classes

Collections manages collections. Documents handles document CRUD and import. Search handles search requests and flexible parameter input.

Utilities

Auth provides token helpers and validation. Config handles defaults and URL/config normalization. Validator validates request input before it is sent.

Exceptions

HlqueryException is the base type. Use AuthenticationException, RequestException, ValidationException, CollectionException, DocumentException, and SearchException for specific failures.

API Methods

System APIs

`health()`

Check server health status.

health = client.health()
if health.is_success():
    body = health.get_body()
    print(f"Status: {body.get('status')}")

`stats()`

Get server statistics.

stats = client.stats()

`info()`

Get server information.

info = client.info()

Collections API

Using the Collections API Object

collections = client.collections_api()

# List collections
result = collections.list(0, 10)

# Get collection
result = collections.get('my_collection')

# Create collection
result = collections.create('new_collection', schema)

# Delete collection
result = collections.delete('collection_name')

# Get formatted fields
result = collections.get_fields('my_collection')

Convenience Methods

# List collections
collections = client.list_collections(0, 10)

# Get collection details
collection = client.get_collection('my_collection')

# Get collection fields (formatted)
fields = client.get_collection_fields('my_collection')

Documents API

Using the Documents API Object

documents = client.documents_api()

# List documents
result = documents.list('collection', {'offset': 0, 'limit': 10})

# Get document
result = documents.get('collection', 'doc_id')

# Add document
result = documents.add('collection', document)

# Update document
result = documents.update('collection', 'doc_id', document)

# Delete document
result = documents.delete('collection', 'doc_id')

# Bulk import
result = documents.import_documents('collection', [doc1, doc2, doc3])

# Parse and add a local PDF file
result = documents.add_pdf('collection', './files/report.pdf')

# Parse and add a local CSV file
result = documents.add_csv('collection', './files/report.csv')

PDF Parsing

documents.add_pdf(collection, file_path, options=None) parses a local PDF file, normalizes the extracted text, and submits it as a regular document to hlquery.

result = client.documents_api().add_pdf('reports', './files/q1-report.pdf', {
    'id': 'report_q1',
    'document': {
        'source_type': 'pdf'
    }
})

The generated document includes:

title
content
file_name
file_path
mime_type
page_count
file_size_bytes
normalized PDF metadata fields such as pdf_meta_author

Note: hlquery currently rejects commas in string field values. The PDF helper replaces commas with spaces before indexing so extracted text can be stored successfully.

CSV Parsing

documents.add_csv(collection, file_path, options=None) parses a local CSV file, flattens rows into normalized text, and submits it as a regular document to hlquery.

result = client.documents_api().add_csv('reports', './files/metrics.csv', {
    'id': 'metrics_q1',
    'document': {
        'source_type': 'csv'
    }
})

The generated document includes:

title
content
file_name
file_path
mime_type
row_count
column_count
columns

CSV parsing uses the Python standard library only. Cells are flattened into plain text, and commas are replaced with spaces before indexing to satisfy hlquery field restrictions.

Search API

Using the Search API Object

search = client.search_api()

# Simple search
results = search.search('collection', {
    'q': 'search query',
    'query_by': 'title,content',
    'limit': 10
})

# Multi-search
results = search.multi_search([
    {'collection': 'col1', 'q': 'query1'},
    {'collection': 'col2', 'q': 'query2'}
])

Convenience Method

# Search documents
results = client.search('collection', {
    'q': 'search query',
    'query_by': 'title,content',
    'limit': 10
})

Ranking helpers

hlquery.utils.ranker exposes a reusable compute_rank_signal helper (for rank_signal, popularity_score, hit_log) plus attach_rank_sort so you can sort by the boosted field across all clients.

from hlquery.utils import compute_rank_signal, attach_rank_sort

params = {'q': 'guide'}
signal = compute_rank_signal(popularity_score, hit_log)
params['rank_signal'] = signal
attach_rank_sort(params)  # ensures sort_by=rank_signal:desc
results = client.search('collection', params)

Search Parameters

The search() method accepts flexible parameters:

Query Parameters

q - Query string (supports field clauses, OR, NOT, phrases, and wildcards)
- Examples: q="title:laptop", q="title:laptop OR title:notebook", q="title:laptop NOT title:refurbished", q="\"wireless keyboard\"", q="laptop*"
query_by - Fields to search in (string or list)
query - Structured query object

Pagination

from / offset - Starting offset
size / limit - Number of results
page - Page number (alternative to offset)
per_page - Results per page

Filtering & Sorting

filter_by - Filter conditions (string), for example filter_by="price:>100&&category:electronics"
filter - Filter object
sort_by - Sort fields (string or list)
sort - Sort specification

Faceting

facet_by - Facet fields (string or list)
facets - Facet specification

API Aliases

`indices(params=None)`

Alias for list_collections().

collections = client.indices({'offset': 0, 'limit': 10})

`get(params)`

Alias for get_collection() or get_document().

# Get document
doc = client.get({'index': 'collection', 'id': 'doc_id'})

# Get collection
collection = client.get({'index': 'collection'})

`cat(cat_type='indices', params=None)`

Cat API for listing collections and system information.

indices = client.cat('indices', {'limit': 10})

Requirements

Python >= 3.6
No external dependencies (uses built-in urllib and json)

Installation

This package can be installed via pip:

pip install -e .

Or add to your requirements.txt:

# No external dependencies required

Architecture

For detailed information about the library architecture, design decisions, and internal structure, see STRUCTURE.md.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
__pycache__		__pycache__
examples		examples
lib		lib
utils		utils
LICENSE		LICENSE
README.md		README.md
example.py		example.py
requirements.txt		requirements.txt
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

Features

Installation

Optional PDF Support

Quick Start

Basic Usage

With Authentication

Reduce Text Example

Architecture

Core Classes

Client

Request

Response

API Classes

Utilities

Exceptions

API Methods

System APIs

health()

stats()

info()

Collections API

Using the Collections API Object

Convenience Methods

Documents API

Using the Documents API Object

PDF Parsing

CSV Parsing

Search API

Using the Search API Object

Convenience Method

Ranking helpers

Search Parameters

Query Parameters

Pagination

Filtering & Sorting

Faceting

API Aliases

indices(params=None)

get(params)

cat(cat_type='indices', params=None)

Requirements

Installation

Architecture

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`Client`

`Request`

`Response`

`health()`

`stats()`

`info()`

`indices(params=None)`

`get(params)`

`cat(cat_type='indices', params=None)`

Packages