Agentic Memorizer

A knowledge graph-based memorization tool for AI agents.

Current Version: N/A

Overview

Agentic Memorizer is an automated knowledge graph builder designed to give AI assistants persistent, queryable memory of filesystem content. Users register directories they want the system to "remember," and a background daemon takes over from there.

The daemon continuously watches registered directories for file changes and periodically walks them to ensure completeness. When files are added, modified, or removed, the system automatically:

Filters content based on configurable skip/include rules (extensions, directories, hidden files)
Chunks files using format-specific parsers that preserve semantic structure
Analyzes chunks via AI providers to extract topics, entities, summaries, and tags
Generates embeddings for semantic similarity search
Persists everything to a FalkorDB knowledge graph with typed relationships

The resulting knowledge graph is exposed to AI coding assistants through multiple integration methods: the Model Context Protocol (MCP) for standards-based access, hooks for injecting context at session start, and plugins for native tool integration. This enables AI assistants to understand and query any content you point it at—codebases, documentation, research notes, configuration repositories, or any other file-based knowledge.

Key capabilities:

Intelligent Chunking - 22 format-specific chunkers with language-aware semantic splitting using Tree-sitter AST parsing for code (8 languages) and structure-preserving chunking for documents
Semantic Analysis - Pluggable providers (Anthropic, OpenAI, Google) extract topics, entities, and summaries from content
Vector Embeddings - OpenAI, Voyage AI, and Google providers generate embeddings for semantic similarity search
Knowledge Graph - FalkorDB (Redis Graph) backend stores files, chunks, metadata, and relationships
Real-time Monitoring - Filesystem watcher with event coalescing detects changes and triggers analysis
MCP Integration - Standards-based protocol exposes knowledge graph to AI tools

Quick Start

Build and install the binary

git clone https://github.com/leefowlercu/agentic-memorizer.git
cd agentic-memorizer
make install

Run the setup wizard
```
memorizer initialize
```
Start the daemon
```
memorizer daemon start
```

Register a directory to monitor

memorizer remember ~/projects/my-codebase

List remembered directories
```
memorizer list
```

Architecture

┌─────────────────────────────────────────────────────────────────────┐
│                         CLI Layer (Cobra)                           │
│  [version] [initialize] [daemon] [remember] [forget] [list] [read]  │
│  [integrations] [providers] [config]                                │
└──────────────────┬──────────────────────────────────────────────────┘
                   │
┌──────────────────▼──────────────────────────────────────────────────┐
│                       Daemon Core                                   │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐               │
│  │  Component   │  │  Health      │  │  HTTP Server │               │
│  │  Lifecycle   │  │  Manager     │  │  (7600)      │               │
│  └──────────────┘  └──────────────┘  └──────────────┘               │
└──────────────────┬──────────────────────────────────────────────────┘
                   │
┌──────────────────▼──────────────────────────────────────────────────┐
│                        Event Bus                                    │
│                  Async pub/sub backbone                             │
└───┬────────────────────────┬────────────────────────┬───────────────┘
    │                        │                        │
┌───▼───────────┐   ┌────────▼────────┐   ┌───────────▼──────────┐
│  Filesystem   │   │    Analysis     │   │      Cleaner         │
│  Watcher      │   │    Pipeline     │   │  (stale removal)     │
└───────────────┘   └────────┬────────┘   └──────────────────────┘
                             │
           ┌─────────────────┼─────────────────┐
           │                 │                 │
    ┌──────▼──────┐   ┌──────▼──────┐   ┌──────▼──────┐
    │  Chunkers   │   │  Semantic   │   │ Embeddings  │
    │    (22)     │   │  Providers  │   │  Providers  │
    └─────────────┘   └─────────────┘   └─────────────┘
                             │
                     ┌───────▼───────┐
                     │  Knowledge    │
                     │  Graph        │
                     │  (FalkorDB)   │
                     └───────────────┘

Data Flow:

Filesystem watcher detects changes in registered directories
Events are published to the Event Bus (async pub/sub)
Analysis Pipeline subscribes and processes queued events
Format-specific chunkers split files and extract metadata
Semantic providers analyze content for topics, entities, and summaries
Embeddings providers generate vector representations
Results are stored in the FalkorDB knowledge graph
Cleaner subscribes to deletion events to remove stale graph entries
CLI and MCP server provide query interfaces

CLI Commands

Command	Description
`version`	Display build information
`initialize`	Run the interactive setup wizard
`daemon start`	Start the daemon in foreground mode
`daemon stop`	Stop the running daemon gracefully
`daemon status`	Show daemon status and health metrics
`daemon rebuild`	Rebuild the knowledge graph
`remember <path>`	Register a directory for tracking
`forget <path>`	Unregister a directory
`list`	List all remembered directories (requires daemon)
`read`	Export the knowledge graph (requires daemon)
`integrations list`	List available integrations
`integrations setup <name>`	Configure an integration
`integrations status`	Show integration status
`integrations remove <name>`	Remove an integration
`providers list`	List semantic/embeddings providers
`providers test <name>`	Test provider connectivity
`config show`	Display current configuration
`config edit`	Open configuration in editor
`config validate`	Validate configuration file
`config reset`	Reset to default configuration

Note: list and read query the running daemon. Start it with memorizer daemon start first.

Configuration

Configuration is stored at ~/.config/memorizer/config.yaml with environment variable overrides using the MEMORIZER_ prefix. See config.yaml.example for the complete reference with detailed comments.

log_level: info
log_file: ~/.config/memorizer/memorizer.log

daemon:
  http_port: 7600
  http_bind: 127.0.0.1
  shutdown_timeout: 30
  pid_file: ~/.config/memorizer/daemon.pid
  rebuild_interval: 3600
  metrics:
    collection_interval: 15
  event_bus:
    buffer_size: 100
    critical_queue_capacity: 1000

storage:
  database_path: ~/.config/memorizer/memorizer.db

graph:
  host: localhost
  port: 6379
  name: memorizer
  password_env: MEMORIZER_GRAPH_PASSWORD
  max_retries: 3
  retry_delay_ms: 1000
  write_queue_size: 1000

semantic:
  enabled: true
  provider: anthropic
  model: claude-sonnet-4-5-20250929
  rate_limit: 10
  api_key_env: ANTHROPIC_API_KEY

embeddings:
  enabled: true
  provider: openai
  model: text-embedding-3-large
  dimensions: 3072
  api_key_env: OPENAI_API_KEY

defaults:
  skip:
    extensions: [".exe", ".dll", ".so", ".dylib", ".bin", ...]
    directories: [".git", "node_modules", "__pycache__", "dist", ...]
    files: [".DS_Store", "package-lock.json", "*.min.js", ...]
    hidden: true
  include:
    extensions: []
    directories: []
    files: []

Environment variable examples:

MEMORIZER_DAEMON_HTTP_PORT=9000
MEMORIZER_GRAPH_HOST=redis.local
MEMORIZER_SEMANTIC_PROVIDER=google
MEMORIZER_SEMANTIC_ENABLED=false

Note: if you toggle semantic analysis on an existing dataset, run memorizer daemon rebuild (or restart the daemon to trigger the initial full walk) so previously discovered files are queued for semantic analysis.

Integrations

Agentic Memorizer integrates with AI coding assistants via hooks and MCP (Model Context Protocol):

Harness	Integrations
claude-code	`claude-code-hook`, `claude-code-mcp`
gemini-cli	`gemini-cli-hook`, `gemini-cli-mcp`
codex-cli	`codex-cli-mcp`
opencode	`opencode-mcp`, `opencode-plugin`

Setup an integration:

memorizer integrations setup claude-code-mcp

MCP Tools

The MCP server exposes a semantic search tool for harnesses:

search_memory - vector similarity search over stored chunk embeddings with optional filters (top_k, min_score, path_prefix, extension include/exclude) and optional snippet extraction.

Prerequisites

Go 1.25.5 or later
FalkorDB (Redis Graph) instance
API keys for semantic/embeddings providers (as needed)

Installation

From source:

git clone https://github.com/leefowlercu/agentic-memorizer.git
cd agentic-memorizer
make build

Install to ~/.local/bin:

make install

License

This project is licensed under the MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
.claude/commands		.claude/commands
.codex/environments		.codex/environments
.specify		.specify
cmd		cmd
internal		internal
testdata		testdata
.gitignore		.gitignore
.golangci.yml		.golangci.yml
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
config.yaml.example		config.yaml.example
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic Memorizer

Table of Contents

Overview

Quick Start

Architecture

CLI Commands

Configuration

Integrations

MCP Tools

Prerequisites

Installation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agentic Memorizer

Table of Contents

Overview

Quick Start

Architecture

CLI Commands

Configuration

Integrations

MCP Tools

Prerequisites

Installation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages