Perseus

A documentation extraction and generation system that converts structured comment blocks and narrative .pdoc templates into complete Markdown documentation. Perseus centralizes context, vocabulary, and narrative explanation across codebases, keeping documentation maintainable, contextual, and consistent.

Overview

Perseus extracts structured @pdoc blocks from code comments, merges them with additional narrative documents, and compiles them into Markdown or JSON outputs using Jinja-style templating.

Demo Video: https://www.youtube.com/watch?v=JB6BY2yXOlQ

Sample Generated Documentation:

View a sample project here

Perseus

Key design goals:

Co-locate documentation context directly with code where it is created.
Provide richer metadata: intentions, tags, vocabulary definitions, tickets, business requirements, and more.
Enable narrative documentation that links to code-derived context.
Support reusable glossary terms across the entire repository.

Core Features

Build docs with one command:
- Run perseus build to scan your codebase and .pdoc files, extract context, and generate Markdown docs.
Single config file:
- Use perseus.yaml to set source paths, output location, templates, and format (Markdown, JSON, HTML).
Narrative .pdoc templates:
- Write reusable documentation using Jinja-style syntax, referencing code context and glossary terms.
Glossary merging:
- All vocabulary from code is merged and available in every doc.

Typical workflow:

Add @pdoc blocks to your code.
Write .pdoc templates for narrative docs.
Run the build command to generate output.

Components

1. Context Blocks (`@pdoc ... @endp`)

Embedded in code comments. These define structured metadata.

Ideal Supported fields (only some implemented currently):

intention – high-level purpose of the unit
tickets – JIRA or other ticket links
tags – searchable identifiers
vocabulary – domain terminology definitions
feature_flags
experiments
business_requirement
high_level_intention
prerequisite
approvers
to_do
legacy / migration
maintainers
cross_functional_teams
watch – track file changes (optional)
code – embed relevant code regions (optional)

These fields form the underlying data accessible to .pdoc templates.

2. Perseus Files (`.pdoc`)

Markdown-like documents with Jinja syntax to assemble final output.

Capabilities:

Link to context from any file
Use global glossary
Render code blocks
Combine many contexts into a single final document

3. Builder / Compiler

The pers build compiler performs:

Extraction
Parsing (YAML)
Context storage
Template rendering
Export to Markdown/JSON/HTML

Technical Implementation

Directory Scanner

Traverses source paths and locates:

Code files with @pdoc blocks
.pdoc narrative files

YAML Parser

Uses a fast and accurate YAML parser for blocks within comments.

Context Repository

A storage layer that collects parsed context and exposes it to the templating engine.

Template Engine

Uses a Jinja2-compatible engine to render Markdown, JSON, or HTML.

CLI Interface

The pers binary provides:

pers build
Future commands: pers watch, pers list, pers glossary, etc.

Additional Feature Concepts

Rich Ticket Context

Supports structured ticket metadata beyond simple strings.

Tagging

Allows tagging functions, methods, classes, or files for search, indexing, and pattern detection.

Documentation Design

Teams may choose between:

Co-located documentation (context embedded next to code)
Centralized .pdoc narrative documents
A hybrid model

Output Formats

Perseus supports the following output types:

Markdown (primary)
JSON (intermediate representation)

Markdown is typically generated from .pdoc templates.

Glossary

All vocabulary from every @pdoc block is merged into a global glossary. Every documentation file can reference this glossary.

Quickstart

Install dependencies (recommended: uv):

uv pip install -r requirements.txt

Run the demo:

uv run perseus/main.py --config test_projects/simple_python/perseus.yaml build --format md

or

uv run perseus/main.py --root test_projects/simple_python build --format md

This will produce Markdown in test_projects/simple_python/docs/build.

.env

Change the .env for debugging and log levels, use the logger library for logging information over printing

Jira Integration (Optional)

Perseus can automatically enrich ticket references with live data from Jira. To enable this feature:

Configure your .env file:

JIRA_BASE_URL=https://your-company.atlassian.net
JIRA_EMAIL=your-email@company.com
JIRA_API_TOKEN=your_api_token_here

Generate a Jira API token:
- Go to https://id.atlassian.com/manage-profile/security/api-tokens
- Create a new API token
- Copy the token to your .env file
Reference tickets in your code:

"""
@pdoc
id: example_function
tickets:
  - KAN-1
  - KAN-4
@endp
"""

Access enriched data in templates:

{% if block.tickets_enriched %}
| Key | Title | Status | Assignee | Priority |
|-----|-------|--------|----------|----------|
{% for t in block.tickets_enriched %}
| [{{ t.key }}]({{ t.url }}) | {{ t.title }} | {{ t.status }} | {{ t.assignee }} | {{ t.priority }} |
{% endfor %}
{% endif %}

The system automatically fetches ticket details (title, status, assignee, priority, URL) during the build process. Results are cached to minimize API calls. If credentials are not configured, tickets will be displayed as simple text.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
perseus		perseus
test_projects		test_projects
.gitignore		.gitignore
.python-version		.python-version
PERSEUS_AI_USAGE.txt		PERSEUS_AI_USAGE.txt
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Perseus

Core Features

Components

1. Context Blocks (`@pdoc ... @endp`)

2. Perseus Files (`.pdoc`)

3. Builder / Compiler

Technical Implementation

Directory Scanner

YAML Parser

Context Repository

Template Engine

CLI Interface

Additional Feature Concepts

Rich Ticket Context

Tagging

Documentation Design

Output Formats

Glossary

Quickstart

.env

Jira Integration (Optional)

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

AllanKoder/Perseus

Folders and files

Latest commit

History

Repository files navigation

Overview

Perseus

Core Features

Components

1. Context Blocks (@pdoc ... @endp)

2. Perseus Files (.pdoc)

3. Builder / Compiler

Technical Implementation

Directory Scanner

YAML Parser

Context Repository

Template Engine

CLI Interface

Additional Feature Concepts

Rich Ticket Context

Tagging

Documentation Design

Output Formats

Glossary

Quickstart

.env

Jira Integration (Optional)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

1. Context Blocks (`@pdoc ... @endp`)

2. Perseus Files (`.pdoc`)

Packages