Why Faramesh

What agent governance is for, who it helps, and when to reach for Faramesh before something goes wrong in production.

The 60-second version

An AI agent calls tools. Read mail, move money, open pull requests, talk to customers. The model picks which tool to call, with which arguments, and when.

The question worth answering before you ship: what prevents a bad prompt, a confused model step, or a stolen API key from running anything those tools can run?

If the answer is "we trust the prompt and the model," the agent does not have governance. It has hope. Faramesh replaces that with a contract.

You write one file, governance.fms, that declares what the agent is allowed to do. A small daemon runs alongside the agent and decides every tool call before it executes: permit, defer to a human, or deny. Each decision is signed and recorded. Long-lived secrets never enter the agent process; credentials are minted at the call site, when policy permits.

That is the product. The SDKs, the proxies, and the registry exist to put that decision in the right place.

What problem this solves

Agents are software that takes ambiguous natural-language input and turns it into authoritative side effects. The failures that matter in production are not "the model gave a bad answer". They look like this:

The agent ran a refund. The model thought the user said $8000. The user said $80.
The agent emailed a customer's social security number to support@public-domain.com.
A jailbroken prompt convinced the agent to call git push --force against main.
A leaked API key from the agent's environment showed up in someone else's product.
A junior engineer reverted a guardrail in governance.fms and nobody caught it for three weeks.

None of these are prevented by a smarter prompt. They are prevented by a deterministic check at the moment of action, plus evidence after the fact, plus secrets the agent does not hold. Faramesh provides all three.

What you get

Predictable behavior. Permit, deny, and defer are explicit. The decision engine is not an LLM. The same input always produces the same decision.
Evidence by default. Every call produces a Decision Provenance Record. The chain is hash-linked and, optionally, KMS-signed. Verifiable offline with faramesh audit verify.
Defense in depth. SDK shim for native agents, MCP proxy for Claude Code and Cursor, HTTP proxy for hosted runtimes. Pick the tier the agent needs.
Portable policy. Versioned imports from the public catalog: providers, policy packs, framework profiles. Pin them. Audit them. Mirror them.
Optional OS sandbox. On Linux (seccomp plus Landlock) and macOS (Seatbelt), runtime { os_tier = true } adds syscall-level enforcement so even a malicious tool cannot bypass the daemon.

The story in one scene

A payment agent tries to refund $8,000. Policy says anything over $500 requires a human:

governance.fms

agent "support-bot" {
  rules {
    permit stripe/refund if amount < $500
    defer  stripe/refund if amount >= $500
    deny   stripe/payouts
  }
}

What happens, in order:

The model decides to call stripe/refund with { amount: 8000 }.
The Faramesh SDK shim intercepts the call and sends it to the local daemon.
The daemon evaluates the rule. amount < $500 is false. amount >= $500 matches. Effect: defer.
The agent receives a ToolDeniedException carrying a defer token. The refund does not happen.
A notification fires on #payments-approvals, configured by an alert block in the policy.
An operator runs faramesh approvals approve <token> or clicks Approve in the UI.
The agent retries. The daemon now permits the call once, mints a Stripe key with a 30-second TTL, runs the refund, and writes a signed record naming the operator who approved it.

That loop, declare, enforce, record, review, is the entire product.

Who this is for

Role	Start here
Agent developer building tools	Quickstart, then Govern a LangGraph agent.
Security and GRC asking for evidence	Auditing agent decisions.
Platform and SRE operating it for many teams	Deploying at scale.
Evaluating tools	How Faramesh compares.

What Faramesh is not

Not a model gateway. It does not proxy LLM completions. Use a model gateway for that and place Faramesh on the tool side.
Not a prompt firewall. Prompt-injection defense is one input to a policy. It is not the policy itself.
Not a SaaS dependency. The daemon, SDKs, policy engine, and local audit path are open source and run without a hosted service in the enforcement path.
Not a heavyweight install. faramesh dev boots in-process stubs for Vault, SPIFFE, KMS, and the audit sink. The whole stack runs on a laptop with no network access.

Next steps

New here? Read Quickstart. Five minutes, no infrastructure required.
Want the mental model first? Read How Faramesh works.
Designing a deployment? Read Architecture and Topologies.
Ready for examples? Try the LangGraph tutorial or Write your first policy.