Arai — CLAUDE.md that actually works

Smart, not noisy

Intent-aware

"Never hand-write migrations" fires on Write but not Edit. Editing existing migrations is fine.

Code graph

tree-sitter scans your codebase. Writing to migrations/ triggers alembic rules even without "alembic" in the file.

Session tracking

"Never push without tests" silences after cargo test runs. Arai remembers what happened this session.

Block, don’t just advise

Rules with never/forbids/must_not emit permissionDecision: "deny" in Claude Code. always and prefers still advise. Incremental rollout via ARAI_DENY_MODE=off. Cursor and Windsurf get advisory enforcement via MCP — no other assistant exposes a deny hook today.

Per-rule rollout

arai severity alembic block pins one rule to deny while the rest of the set stays in advise. Survives arai scan. Ship the set in advise mode, watch which rules earn the trust, then flip them one at a time.

Compliance tracking

Every PostToolUse is correlated against its PreToolUse firings. Each rule gets an obeyed, ignored, or unclear verdict. arai audit --outcome=ignored tells you which rules the model keeps flouting; filter to a specific rule with --rule.

Derivation trace

Every firing carries source file, line, and parser layer. Hook output shows [CLAUDE.md:42 layer-1] — no more guessing why a rule fired.

Explain before you commit

arai why "git push --force" replays a hypothetical tool call through the live match pipeline. Read-only. Ship new rules with confidence.

Self-pruning rules

Annotate a rule with (expires 2026-12-31) or (until 2027-06-30). Arai filters it out after the date automatically — perfect for incident-driven rules that have a shelf life.

Audit log

Every firing is appended to a local JSONL you can query. See which rules fire, on which tools, for which prompts. Nothing leaves your machine.

Local-first, air-gap friendly

No network on the hook hot path. Enforcement, audit, compliance verdicts, and stats all run against the local SQLite + JSONL. Works offline, in restricted environments, and during outages — nothing to monitor, nothing to vendor-onboard.

Supply-chain hardened

Downloads verified against SHA-256 checksums.txt on every install path (curl, npm, cargo). arai:extends upstream policy fetches refuse loopback, RFC1918, link-local, cloud metadata, and redirects — and cached upstream files carry a SHA-256 sidecar so an at-rest tamper is detected before the rules reach the parser. MCP-source rules capped per project to bound a malfunctioning agent.

Hash-chained audit log

Every audit-log line carries prev_hash + hash (SHA-256 over canonical bytes); a per-day sidecar anchors the chain tip. arai audit --verify walks every day-bucket and exits non-zero on any tamper, reorder, or deletion. Owner-only on disk (0700/0600 on Unix; icacls-pinned on Windows). Day-buckets are retained indefinitely — bucketing is the segmentation, no auto-prune.

MCP authentication

The agent-facing MCP server supports an optional shared-secret via ARAI_MCP_AUTH_TOKEN. When set, initialize must present a matching token (constant-time compare) before any tool call succeeds. Open by default for backwards compatibility.

Per-rule compliance ratios

arai stats rolls up the audit log into fires / obeyed / ignored / ratio per rule. Now you can answer "is this rule actually working?" — not "is it firing?" The ⚠ flag highlights low-ratio rules with enough volume to mean it.

Token economics

Repeat firings of the same rule in a session emit a compact one-liner instead of re-injecting the full payload. arai stats surfaces a calibrated tokens saved estimate from suppressed repeats plus denied-and-honored mistakes — secondary signal, primary mission stays correctness.

Rule regression tests

arai test replays synthetic hook payloads through the live match pipeline. Catch rule behaviour drift before a real session does. CI-friendly JSON output.

Capture to replay

arai record turns real firings from the audit log into scenario fixtures. You don’t hand-write regression tests — you capture the ones that matter and pin them.

Preview before commit

arai lint CLAUDE.md shows exactly which rules a file produces with their classified intent. Iterate on wording without touching the DB.

Diff before save

arai diff CLAUDE.md shows what an edit would change in the live rule set — added, removed, moved — before you commit it. Pre-commit-hook fodder via --json.

Shared policies

Inherit org-wide rules with one directive: arai:extends https://.... Trusted per URL, HTTPS only, cached locally. No policy service — just a markdown file upstream.

Agent-authored guards

Runs as an MCP server. The agent can register new rules mid-session ("never touch /etc", "always run tests first") and Arai enforces them on the next tool call. arai_recent_decisions lets the agent self-check what it was just denied for — no more looping on the same refused action. MCP is also how Cursor and Windsurf integrate: same rule set, advisory only.

Zero noise

Only fires domain-specific rules. Principles already in CLAUDE.md stay silent.

Any LLM enrichment

Classify rules via Claude, Ollama, or any LLM CLI. Or use the built-in sentence transformer.

~30ms latency

End-to-end per tool call, dominated by binary launch. SQLite lookups on the hook path. No network calls. No LLM calls at runtime.

Audit & compliance — controls aligned with SOC 2 TSC

Arai gives you the evidence trail and the controls your InfoSec / procurement team will ask for. Arai itself is not a certified product — the certification is yours to pursue. The controls are designed to align with the SOC 2 Trust Service Criteria:

CC6.1 logical access — audit dir locked to owner (0700/0600 on Unix; icacls-pinned on Windows).
CC6.6 supply chain — SHA-256 binary verification on every install path; arai:extends SSRF-hardened transport plus cache-at-rest signature.
CC7.2 monitoring — per-firing JSONL audit log, hash-chained (prev_hash + hash) so tampering is detected by arai audit --verify.
CC7.3 detection — Pre/Post correlation produces obeyed / ignored / unclear verdicts per rule. Evidence of effectiveness, not just attempts.
CC8.1 change management — arai diff, arai test, arai record, and the synthetic parser-coverage corpus turn rule edits into CI assertions.
CC9.2 vendor management — MCP server supports an optional shared-secret (ARAI_MCP_AUTH_TOKEN); input caps bound a misbehaving agent.

Full TSC-mapped feature inventory in the compliance procurement doc.

CLAUDE.md that actually blocks.

Rules fire when they matter — and actually block

And then prove it — per-rule compliance, locally