agent supervision & governance · LLM-agnostic

Q · Agent Board

The Kanban board to supervise and govern AI agents in real time. You see what every agent is doing, and irreversible actions stop at your gate until you approve — without relying on the model behaving.

▶ See the board demo How it works ◆ Capital

Zero dependenciesPython stdlibVerifiable auditOpen source · MIT

⚡ Out of budget

Audit deps

opus · throttled: budget €

Working2/5⚡€1.4/€1.5

Map the repo

mistral-local · €0.00

Refactor client

gpt-4o · €1.40

Needs Input

Delete resources

awaiting approval

Review

Verify fix

opus

Done

Docs written

sonnet ✓

There's a dedicated lane, ⚡ Out of budget: the expensive agent (opus) lands there because the foundational budget is maxed out (⚡€1.4/€1.5) — you can see why it's paused; the local one got in at no cost. It returns to Working once the window frees up.

01 · what it is

A control layer between your agents and what they can touch

Q · Agent Board doesn't orchestrate agents: it observes and governs them. Each agent is a card on a Kanban board; every effectful action —writing, deploying, deleting, calling an API— crosses a gate that decides allow · ask · deny and, when it matters, waits for your approval. Everything ends up in a verifiable audit trail.

who it's for

Teams launching fleets of agents

Dynamic workflows with dozens of agents running in parallel that need visibility and brakes, not just logs.

who it's for

Platforms with tools / MCP

Any stack where agents act through tools: the gate lives in the tool, no matter the LLM.

who it's for

Anyone who needs an audit trail and cost under control

Tamper-evident audit and budgets in € that separate the local (~free) from the foundational (expensive).

02 · the problem

Launching many agents is powerful. And opaque. And dangerous.

Dynamic workflows fire off dozens of agents in parallel. They do plenty of work, but you don't see what they touch, and one irreversible action —deleting data, deploying, rotating keys— can take down production.

The usual reflex is to ask the model, in the prompt, to "be careful" or "ask before deleting". That's a convention: the model can ignore it, get it wrong, or be manipulated by prompt injection. That's not control.

03 · the approach

Control where the agent acts, not where it thinks

Control doesn't go in the agent's brain (the LLM, infinite and unpredictable), but in the tool through which every effectful action passes. If the gate lives inside the tool, the effect is physically unreachable without crossing it — no matter which model invoked it.

Don't try to certify every driver. Put a checkpoint at the toll of the only road that leads to the dangerous operations. Whoever wants to act crosses the toll.

04 · how it works

From the agent's intent to the real effect

Every effectful action runs through four deterministic steps before touching anything. Only what's marked ask interrupts you; the rest flows through or gets blocked without bothering you.

RBAC

Can this role even attempt it?

→

Policy

Deterministic rules: allow · ask · deny.

→

Board

Only what's marked ask waits for your sign-off.

→

Audit

Verifiable chain: every decision is logged along with its origin.

allow → executes ask → waits for the human deny → never

See the adoption journey step by step →

05 · try it

The same board, three modes

It's all self-contained HTML: the simulated demo opens right in the browser, with nothing to install.

▶ simulated demo	Simulated demo. Fictional agents advancing through the columns. No backend — perfect for showing it off.
`?feed=/api/state`	Live against the broker. Real approvals (two-key). Requires `python3 hooks/broker.py`.
`?feed=board-state.json`	Offline observability (read-only) from the state the hooks write.
▶ demo · Claude Science	Profiles and units. The board adapts to the specific agents of your project and splits across departments: this one opens Claude Science (Genomics Analyst, Biophysics Engine, mRNA Designer…). Each unit sees its own with `?profile=claude-science&unit=Terapéutica`. Source of truth: `config.json`.
◆ capital board	Second view, same substrate. It reads the audit, the cost and the approvals as capital that accrues (the Token Capital concept): captured human judgment, data for evals and the moat. One system, two lenses.

06 · what's inside

The four pillars

Agent Kanban board

Each agent is a card that moves through columns. You see live what it's doing, its cost and time, and the results. Multidimensional WIP measured in cost: the Working limit is number of agents, total € and € on foundational models — a local token is ~free, an Opus/GPT-4o one is not.

Two-key approval

Effectful operations stop and wait for your Approve/Deny. Approving requires an operator token: an agent can't self-approve.

Deterministic policy gate

The decision is made by rules in code (policy.json): allow / ask / deny per tool and domain. Reproducible and audited — not negotiable by the model.

Multi-model roles

A different LLM per role: explorer, auditor, implementer, verifier, documenter. A verifier from a different family kills the self-approval bias.

07 · why you can trust it

LLM-agnostic, and aware of its limits

Because the gate lives in the tool (via MCP) and not in the client, the same control works for any agent — the ones we have today and the ones that ship tomorrow. Each one plugs in through its own surface:

Claude Code · native plugin

ChatGPT · MCP

Antigravity · MCP

Grok · orchestrator

Local LLMs · orchestrator

Deterministic: same tool + same arguments → always the same decision. It's the code that decides, not the model.
Auditable: hash-chained log, verifiable (audit.py verify); every decision, by rule or human, tied to the payload hash.
With resource and cost limits: aggregate windowed quotas (they shut down chunking) and budgets in € that distinguish local models (~free) from foundational ones (expensive).
Fail-closed: without approval, nothing runs. The system doesn't loosen up when something breaks.
With a threat model that enumerates where an action could slip through and how to close it off — including what it doesn't cover yet.

The gate governs the actions that pass through the tool, not what the model says or reasons about. Documenting the limit is part of closing it well: it avoids a false sense of security.