Knowledge Compiler

A system that transforms unstructured information into structured, evolving knowledge assets using LLMs.

🧠 Overview

Knowledge Compiler is a deterministic pipeline that converts raw content (articles, notes, PDFs) into structured Markdown-based knowledge units.

Instead of ephemeral AI outputs, this system produces persistent, versioned, and continuously improving knowledge.

The system is designed as a headless knowledge engine, with tools like Obsidian acting as the visualization layer.

🎯 Core Principles

Structured over freeform — every output follows a strict schema
Deterministic pipelines — no ad-hoc prompting
Markdown as source of truth — portable, versionable, and human-readable
Composable system — simple primitives over complex infrastructure
Incremental refinement — knowledge improves over time

⚙️ Architecture

Input Sources → LLM Processing → Structured Markdown → Retrieval & Refinement

Components

Input Layer
- Raw text, PDFs, notes, articles
Processing Layer
- LLM transforms input into structured knowledge units
Storage Layer
- Markdown files (/knowledge) as the source of truth
Consumption Layer
- Obsidian or any Markdown-compatible viewer
Refinement Layer (future)
- Improves, links, and updates existing knowledge

📁 Project Structure

knowledge-compiler/
│
├── knowledge/        # Compiled knowledge (Markdown files / Obsidian vault)
│   ├── backend/
│
├── pipelines/        # LLM processing logic
├── schemas/          # Knowledge contracts (zod / types)
├── scripts/          # CLI / execution scripts
│
├── README.md

🧱 Knowledge Schema

Each knowledge unit follows a strict structure:

---
id: rate-limiting
title: Rate Limiting
tags: [backend, distributed-systems]
created_at: YYYY-MM-DD
updated_at: YYYY-MM-DD
source: article | pdf | manual
---

## Summary

...

## Key Concepts

...

## Deep Dive

...

## Related

- [[Token Bucket]]
- [[Leaky Bucket]]

## Open Questions

...

🧠 Knowledge Model

The system treats each Markdown file as a node in a knowledge graph:

Files → nodes
[[links]] → edges
Tags → semantic grouping

This enables:

Graph-based navigation (via Obsidian)
Context-aware refinement
Future semantic retrieval

🔄 Pipeline Flow

Read raw input
Send to LLM with structured prompt
Validate output format
Save as Markdown file
(Future) Refine and link with existing knowledge

🚀 Getting Started

1. Install dependencies

npm install

2. Configure environment

OPENAI_API_KEY=your_api_key

3. Run the pipeline

npm run generate

🧪 Example Workflow

echo "Rate limiting prevents abuse in distributed systems..." > input.txt
npm run generate

Output:

/knowledge/backend/rate-limiting.md

🛣️ Roadmap

⚠️ Design Philosophy

This project intentionally avoids:

Heavy RAG pipelines
Vector databases (early stage)
Over-engineered abstractions

Focus is on clarity, determinism, and long-term knowledge quality.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
knowledge/backend		knowledge/backend
pipelines		pipelines
schemas		schemas
scripts		scripts
.gitignore		.gitignore
README.md		README.md
bun.lock		bun.lock
input.txt		input.txt
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowledge Compiler

🧠 Overview

🎯 Core Principles

⚙️ Architecture

Components

📁 Project Structure

🧱 Knowledge Schema

🧠 Knowledge Model

🔄 Pipeline Flow

🚀 Getting Started

1. Install dependencies

2. Configure environment

3. Run the pipeline

🧪 Example Workflow

🛣️ Roadmap

⚠️ Design Philosophy

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Knowledge Compiler

🧠 Overview

🎯 Core Principles

⚙️ Architecture

Components

📁 Project Structure

🧱 Knowledge Schema

🧠 Knowledge Model

🔄 Pipeline Flow

🚀 Getting Started

1. Install dependencies

2. Configure environment

3. Run the pipeline

🧪 Example Workflow

🛣️ Roadmap

⚠️ Design Philosophy

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages