Short Term Memory is Live! 🚀 #73
cmac86
announced in
00-Announcements
Replies: 1 comment 2 replies
-
|
Hello, Do you mind showing me the proper commands to get the latest version of CAAL? cd CAAL Not sure if this is still the best way to update my code. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
CAAL now remembers context between tool calls.
When you ask "where is my package?" CAAL can pull your package tracking number from memory and chain it into the
package_trackertool - no need to repeat yourself. And when a tool returns useful data, it can store it automatically for later use.What's New
Three Ways to Store Memories
Tool hints - n8n workflows can return a
memory_hintin their response. CAAL stores it automatically. Your flight tracker returns UA1234, and next time you ask about your flight, CAAL already knows.Explicit storage - Tell CAAL to remember something. "Remember my flight is UA1234" stores it via the
memory_shorttool. Store, get, delete, list - all through voice.HTTP API -
POST /memoryfor external systems. Your n8n workflows or other services can push context into CAAL's memory.Memory Panel
New Brain icon in the frontend. View all stored memories with source badges (tool/voice/api), timestamps, and TTL countdown. Inline edit and clear all.
TTL-Based Expiry
Memories expire automatically. Flight numbers, package tracking, delivery updates - useful for a few days, then gone. No manual cleanup needed. Editable in the UI and a parameter in
memory_shorttool so you can tell CAAL how long to remember it for.How It Works
Currently, short-term memory is injected into context so the model is aware of what's stored. The model also has a
memory_shorttool to explicitly fetch, store, and manage memories. Once caal-ministral is trained on memory tool calls, the auto-injection will be removed to reduce context bloat. The model will learn to fetch from memory when it needs to, rather than having everything injected on every call.This is short-term memory - transient data that helps the model make better routing decisions. It's not learning new skills or accumulating intelligence. It gets enough context to call the right tool with the right parameters.
What's Next
Thinking about long-term memory as the next layer. Some early ideas we're exploring:
If you have thoughts on how memory should work, share them in this thread.
Try It
Pull the latest and start telling CAAL things to remember.
Video
A deeper walkthrough video is in the works covering the full memory system and the architecture thinking behind it.
Beta Was this translation helpful? Give feedback.
All reactions