MnemeBrain Benchmark Suite (BMB): 48 tasks evaluating belief dynamics in AI memory systems — contradiction detection, revision, provenance, and temporal reasoning.
benchmark knowledge-graph graph-database cognitive-science knowledge-representation evaluation-framework ai-agents cognitive-architecture memory-systems belief-revision ai-research symbolic-ai reasoning-systems llm-evaluation llm-agents ai-benchmark agent-memory agent-evaluation belief-graphs
-
Updated
Mar 13, 2026 - Python