Graphora API

Transform documents into knowledge graphs with AI

Quick Start (5 minutes)

Get started with Graphora in minutes, not hours. Choose your preferred option:

Option 1: Try the Demo (30 seconds)

Visit demo.graphora.io - no signup required. Upload a document and see the knowledge graph extraction in action.

Option 2: Google Colab (2 minutes)

Open our quickstart notebook and run it in your browser:

Option 3: CLI (5 minutes)

Extract knowledge graphs from the command line:

# Install
pip install graphora[cli]

# Extract
graphora extract document.pdf --output graph.json

That's it! No database setup, no LLM keys required to get started.

Why Graphora?

Feature	Graphora	LangChain GraphTransformer	Microsoft GraphRAG
Zero-config start	✅ Yes	⚠️ Partial	❌ No
Auto schema inference	✅ Yes	❌ No	❌ No
Quality validation	✅ Yes	❌ No	❌ No
Human review workflow	✅ Yes	❌ No	❌ No
Visual schema builder	✅ Yes	❌ No	❌ No
Schema chat copilot	✅ Yes	❌ No	❌ No
Entity deduplication	✅ Yes (Splink)	⚠️ Partial	✅ Yes

Features

AI-powered extraction: Advanced LLM-driven entity and relationship extraction from unstructured documents
Multi-format support: Process PDFs, Word docs, text files, and more
Visual schema builder: Design your ontology with an intuitive drag-and-drop interface
Schema chat copilot: Natural language conversations with streaming responses to refine your schema
Auto schema inference: Let AI suggest schemas from your documents
Entity deduplication: Powered by Splink for accurate entity resolution
Human-in-the-loop: Review and refine extractions before final graph integration
Quality validation: Built-in validation to ensure extraction completeness and accuracy
Flexible storage: In-memory mode for quick starts, Neo4j for production
Real-time tracking: Monitor preprocessing and extraction progress
Scalable architecture: Microservice design optimized for document intelligence

Self-Hosting

Want to run Graphora on your infrastructure? Here's how to get started.

Prerequisites

Python 3.11 or higher
uv package manager

Quick Start

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh

# Clone and setup
git clone https://github.com/graphora/graphora-api.git
cd graphora-api
uv sync

# Start development server
make dev

The API will be available at:

API: http://localhost:8000
Documentation: http://localhost:8000/api/v1/docs
OpenAPI Spec: http://localhost:8000/api/v1/openapi.json

Zero-Config Mode

Perfect for local development and testing:

# Set environment variables
export AUTH_BYPASS_ENABLED=true
export STORAGE_TYPE=memory
export LLM_PROVIDER=openai
export OPENAI_API_KEY=your-key-here

# Start server
make dev

No database setup required. The system will use in-memory storage and auto-infer schemas.

Full Setup with Neo4j

For production deployments:

# Start all services (Postgres, Neo4j, Prefect, Redis)
make compose-up

# Run migrations
make migrate

# Start server
make dev

See Local Development Guide for detailed setup instructions.

Developer Shortcuts

Setup:

make install - sync Python dependencies via uv
make install-dev - install with dev dependencies

Development:

make dev - start development server with auto-reload
make start - start production server

Local Services:

make compose-up - start Postgres, Neo4j, Prefect, and Redis containers
make compose-down - stop local services
make compose-logs - tail logs from local services
make compose-status - show status of local services

Testing:

make test - run all tests (emits coverage stats and writes coverage.xml)
make test-unit - run unit tests only
make test-integration - run integration tests only
make test-cov - run tests with HTML coverage report

Code Quality:

make lint - run Ruff + Black checks
make lint-fix - run Ruff + Black with auto-fix
make format - apply Black formatting
make typecheck - run mypy type checking
make deadcode - run Vulture to surface unused definitions
make pre-commit - run all pre-commit checks (lint, test, deadcode)

Database:

make migrate - run database migrations
make dev-reset-postgres - delete local Postgres data
make dev-reset-neo4j - delete local Neo4j data
make dev-reset-redis - delete local Redis data

Other:

make openapi-snapshot - regenerate tests/snapshots/openapi.json
make clean - remove build artifacts and cache files
make help - show all available commands

Authentication

All API requests must include a Clerk-issued bearer token:

Authorization: Bearer <token>

Configure the backend with Clerk credentials via .env:

CLERK_JWKS_URL
CLERK_ISSUER
CLERK_AUDIENCE
CLERK_API_KEY (if server-to-server calls are required)

Clients no longer send the legacy user-id header; the backend derives the user from the JWT subject claim.

Service-to-service / pipeline tokens

Call the API from CI jobs or data pipelines without extra backend code. Mint short-lived Clerk JWTs on demand:

Create (or reuse) a Clerk user that represents the pipeline and add a JWT template (e.g. graphora_pipeline) whose aud value matches CLERK_AUDIENCE.

When the pipeline starts, create a token via Clerk's backend API:

curl -X POST "https://api.clerk.com/v1/users/<USER_ID>/tokens/graphora_pipeline" \
  -H "Authorization: Bearer $CLERK_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"expires_in_seconds": 3600}'

Export the returned token before invoking the client:

export GRAPHORA_AUTH_TOKEN="<clerk-jwt-from-step-2>"
python pipeline.py

Repeat the minting step whenever the token expires (keep TTLs short and rotate the Clerk API key like any other secret).

The graphora Python package automatically reads GRAPHORA_AUTH_TOKEN, so no application changes are required.

Project Structure

app/
├── agents/             # AI agents for workflow and feedback
├── api/               # API endpoints (REST and GraphQL)
├── services/          # Core business logic services
├── schemas/           # Pydantic models and schemas
├── utils/             # Utility functions and helpers
└── main.py           # Application entry point

Key Components

Preprocessing Service
- Handles multi-step document preprocessing
- Provides real-time status updates
- Implements robust error handling
Extraction Service
- Manages entity and relationship extraction
- Integrates with LLM providers for intelligent processing
- Handles temporary graph creation
Graph Service
- Manages Neo4j database operations
- Handles subgraph creation and updates
- Processes user feedback

API Endpoints

REST API

Document Upload

POST /api/v1/documents/upload
Content-Type: multipart/form-data

Submit Feedback

POST /api/v1/feedback/{document_id}
Content-Type: application/json

See the API Documentation for the complete endpoint reference.

Documentation

Contributing Guide - How to contribute
Repository Guidelines - Quick contributor reference
Local Development Guide - Spin up dependencies and run the API locally
Security Policy - How to report security issues
Support - How to get help
Trademark Policy - Trademark usage guidelines

Related Repositories

Frontend: graphora/graphora-fe
Python Client: graphora/graphora-client

Contributing

We welcome contributions! Please see our Contributing Guide for details.

Before Contributing

License

This project is licensed under the MIT License.

✅ Use freely in personal and commercial projects
✅ Modify and distribute with or without source code
✅ Use in closed-source and SaaS products

See LICENSE for full terms.

Commercial Support

Enterprise Support: SLA-backed support for production deployments
Consulting: Custom integrations, training, architecture design
Commercial Licensing: Closed-source and SaaS deployments
Database Vendor Partnerships: OEM licensing for database companies

Contact: support@graphora.io

Community

GitHub Discussions: Ask questions, share ideas
Discord: Coming soon
Twitter: Coming soon

Security

Please report security vulnerabilities to support@graphora.io

See SECURITY.md for details.

Made with ❤️ by Arivan Labs

Name		Name	Last commit message	Last commit date
Latest commit History 294 Commits
.github/workflows		.github/workflows
.vscode		.vscode
app		app
docker		docker
docs		docs
examples		examples
migrations		migrations
ollama-backend		ollama-backend
prototyping		prototyping
scripts		scripts
supabase		supabase
tests		tests
typings/supabase		typings/supabase
.coveragerc		.coveragerc
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLA.md		CLA.md
CLAUDE.md		CLAUDE.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
TRADEMARK.md		TRADEMARK.md
VERSION		VERSION
api.txt		api.txt
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.local.yml		docker-compose.local.yml
example_ontology_with_quality.yaml		example_ontology_with_quality.yaml
form10k-ontology.yml		form10k-ontology.yml
generated-icon.png		generated-icon.png
merged_nodes.json		merged_nodes.json
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
test_quality_system.py		test_quality_system.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Graphora API

Quick Start (5 minutes)

Option 1: Try the Demo (30 seconds)

Option 2: Google Colab (2 minutes)

Option 3: CLI (5 minutes)

Why Graphora?

Features

Self-Hosting

Prerequisites

Quick Start

Zero-Config Mode

Full Setup with Neo4j

Developer Shortcuts

Authentication

Service-to-service / pipeline tokens

Project Structure

Key Components

API Endpoints

REST API

Documentation

Related Repositories

Contributing

Before Contributing

License

Commercial Support

Community

Security

About

Uh oh!

Releases 5

Packages

Uh oh!

Contributors

Uh oh!

Languages

License

graphora/graphora-api

Folders and files

Latest commit

History

Repository files navigation

Graphora API

Quick Start (5 minutes)

Option 1: Try the Demo (30 seconds)

Option 2: Google Colab (2 minutes)

Option 3: CLI (5 minutes)

Why Graphora?

Features

Self-Hosting

Prerequisites

Quick Start

Zero-Config Mode

Full Setup with Neo4j

Developer Shortcuts

Authentication

Service-to-service / pipeline tokens

Project Structure

Key Components

API Endpoints

REST API

Documentation

Related Repositories

Contributing

Before Contributing

License

Commercial Support

Community

Security

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 5

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages