AOMS — Always-On Memory Service

Persistent 4-tier memory for AI agents. Weighted retrieval. Vector search. Progressive disclosure.
Your agent remembers what it learned. Across sessions. Forever.

Why AOMS?

Most AI agents forget everything between sessions. The few that don't use flat files that grow forever with no ranking, no decay, no structure.

AOMS models how memory actually works:

Important things surface first — weighted retrieval with reinforcement learning
Old things naturally fade — time-based weight decay
Similar things consolidate — automatic clustering and summarization
Context stays efficient — progressive disclosure (L0/L1/L2) gives 98% token reduction

Running on a live autonomous agent stack with 63,000+ memories and counting.

Quick Start

# Install
git clone https://github.com/dhawalc/cortex-mem.git
cd cortex-mem
pip install -e .

# Start
cortex-mem start --daemon

# Check health
cortex-mem status

# Search memory
cortex-mem search "deployment"

Or via Docker:

docker build -t aoms .
docker run -p 9100:9100 -v aoms-data:/app/modules aoms

API docs at http://localhost:9100/docs.

Memory Tiers

Tier	Stores	Example
Episodic	Experiences, decisions, failures	"Deployed v2 — rollback needed due to missing migration"
Semantic	Facts, relations, knowledge graphs	"Project uses pnpm, not npm"
Procedural	Skills, patterns, workflows	"To deploy: run migrations first, then build, then push"
Working	Active tasks, current context	"Currently debugging auth token refresh"

Core API

# Write a memory
curl -X POST http://localhost:9100/memory/episodic \
  -H "Content-Type: application/json" \
  -d '{"type": "experience", "payload": {"title": "Fixed auth bug", "outcome": "Token refresh was missing retry logic"}, "weight": 1.3}'

# Search
curl -X POST http://localhost:9100/memory/search \
  -d '{"query": "auth", "limit": 5}'

# Agent recall (formatted context for prompt injection)
curl -X POST http://localhost:9100/recall \
  -d '{"task": "deploy the API", "token_budget": 500, "format": "markdown"}'

# Reinforce useful memory
curl -X POST http://localhost:9100/memory/weight \
  -d '{"entry_id": "abc123", "tier": "episodic", "task_score": 0.9}'

# Progressive disclosure query
curl -X POST http://localhost:9100/cortex/query \
  -d '{"query": "deployment process", "token_budget": 1000}'

Full API

Endpoint	Method	Description
`/memory/{tier}`	POST	Write a memory entry
`/memory/search`	POST	Keyword search with weighted scoring
`/memory/semantic-search`	POST	Vector search (requires Ollama)
`/memory/weight`	POST	Reinforce/decay entry weight
`/memory/decay`	POST	Time-based weight decay
`/memory/consolidate`	POST	Merge similar old memories
`/memory/deduplicate`	POST	Find and merge duplicates
`/recall`	POST	Agent context recall (formatted)
`/cortex/query`	POST	Smart L0/L1/L2 query
`/cortex/ingest`	POST	Ingest document with tier generation
`/entities/extract`	POST	Extract entities from text
`/stats`	GET	Memory analytics
`/health`	GET	Service health

Agent Integration

OpenClaw

clawhub install aoms

AOMS auto-configures when installed alongside OpenClaw. The pip package includes an OpenClaw plugin that starts the service, configures the memory backend, and migrates existing workspace memory.

Any Agent (HTTP)

import httpx

# Recall relevant context at session start
resp = httpx.post("http://localhost:9100/recall", json={
    "task": "working on auth module",
    "token_budget": 500,
    "format": "markdown"
})
context = resp.json()["context"]

# Log what you learned
httpx.post("http://localhost:9100/memory/episodic", json={
    "type": "experience",
    "payload": {"title": "pnpm not npm", "outcome": "Project uses pnpm workspaces"},
    "weight": 1.5
})

Architecture

cortex-mem/
├── service/             # FastAPI application
│   ├── api.py           # All endpoints
│   ├── storage.py       # JSONL engine + weighted scoring
│   └── models.py        # Pydantic schemas
├── cortex/              # Progressive disclosure engine
│   ├── tiered_retrieval.py  # L0/L1/L2 query with auto-escalation
│   └── tier_generator.py    # Document ingestion + summary generation
├── cortex_mem/          # Python package + CLI
│   ├── cli.py           # Click CLI
│   └── openclaw_plugin.py   # Auto-integration
├── modules/             # JSONL memory data
│   └── memory/
│       ├── episodic/
│       ├── semantic/
│       └── procedural/
├── Dockerfile
├── pyproject.toml
└── run.py

CLI

cortex-mem start [--port 9100] [--daemon]   Start service
cortex-mem stop                              Stop service
cortex-mem status                            Health check
cortex-mem search QUERY [--limit 5]          Search memory
cortex-mem migrate SOURCE                    Import workspace data

Configuration

# service/config.yaml
service:
  port: 9100
  host: localhost      # 0.0.0.0 for Docker

weights:
  decay_rate: 0.995    # Daily decay multiplier
  min_weight: 0.1
  max_weight: 5.0

Requirements

Python 3.10+
Optional: Ollama with nomic-embed-text for vector search
Optional: Ollama with any chat model for consolidation/entity extraction

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
cortex		cortex
cortex_mem		cortex_mem
docs		docs
modules		modules
schemas		schemas
scripts		scripts
service		service
tasks		tasks
tests		tests
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
CHANGELOG.md		CHANGELOG.md
CONTEXT.md		CONTEXT.md
CURSOR_PROMPT.md		CURSOR_PROMPT.md
DEPLOYMENT_COMPLETE.md		DEPLOYMENT_COMPLETE.md
Dockerfile		Dockerfile
HANDOFF.md		HANDOFF.md
INTEGRATION.md		INTEGRATION.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
USAGE.md		USAGE.md
WORKFLOW.md		WORKFLOW.md
backup_to_vps.sh		backup_to_vps.sh
daemon_integration.py		daemon_integration.py
ingest_all_docs.py		ingest_all_docs.py
install.sh		install.sh
migrate_workspace.py		migrate_workspace.py
openclaw_integration.py		openclaw_integration.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run.py		run.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AOMS — Always-On Memory Service

Why AOMS?

Quick Start

Memory Tiers

Core API

Full API

Agent Integration

OpenClaw

Any Agent (HTTP)

Architecture

CLI

Configuration

Requirements

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AOMS — Always-On Memory Service

Why AOMS?

Quick Start

Memory Tiers

Core API

Full API

Agent Integration

OpenClaw

Any Agent (HTTP)

Architecture

CLI

Configuration

Requirements

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages