AdminGovernanceOps

Admin & Governance Guide

A practical operating guide for administrators: budgets, policies, usage, auditability, and incident response with request-id correlation.

Open Usage Open Org Settings

Recommended screen flow

Login → Org → Workstream → Mission → Run → Outcome → Policy Trace → Usage

Operators work left-to-right: navigation (left), execution (center), evidence (right).

Key governance controls

Daily USD budget
Daily token budget
Allowed model allowlist
PII egress allowlist

Operational endpoints

/healthzprocess up

/readyzdeps ready

/docsAPI console

/openapi.jsonspec

1) Roles and responsibilities

Owner / Admin

Set budgets and allowlists
Manage API keys
Review policy traces and audit logs
Respond to incidents

Member

Create missions
Run workflows within budgets
Produce outcomes and exports

Viewer

Read-only access
Audit review and evidence inspection

2) Configure budgets and model allowlists

Go to Org Settings and configure:

Daily cost cap (USD)

Hard limit. Requests are denied when the day’s total reaches the cap.

Daily token cap

Hard limit across input + output tokens for the org (UTC day).

Allowed models

Allowlist models permitted for this org (e.g., gpt-4.1, gpt-4o-mini).

PII allowed providers

Restrict which providers may receive PII. Use this for regulated environments.

Recommended: start conservative.

Example org_settings (settings JSONB):
{
  "max_cost_usd": 25,
  "max_tokens_day": 250000,
  "allowed_models": ["gpt-4o-mini", "gpt-4.1-mini"],
  "pii_allowed_providers": ["openai"]
}

Metering uses model pricing when configured; otherwise a conservative default cost-per-1k heuristic is used.

3) Understand policy trace and denials

Every governed request produces a Policy Decision. In the Outcome modal, open Policy Trace.

Allowed: policy checks passed; execution proceeded.
Denied: a policy check failed; request blocked before execution.
Policy decisions include the request id for correlation.

Common denial reasons:
- model not in allowlist
- daily cost cap exceeded
- daily token cap exceeded
- PII egress restricted (when enabled)

4) Usage and budgets (how to interpret)

Open Usage & Budgets to view real-time totals for the current day.

Cost used today

Sum of metered requests for 2026-01-21 (UTC day).

Tokens used today

Input + output tokens across chat, RAG and optimization endpoints.

If you hit budget caps: increase caps, narrow model allowlist, or reduce high-frequency workflows.

5) Incident response with request-id correlation

When users report an issue, capture the Request ID shown in error banners/toasts (or from the policy trace).

Triage flow:
1) Ask for Request ID
2) Search gateway logs for that request id (x-request-id)
3) Inspect policy decision record (allowed/denied + reasons)
4) Validate org settings at time of event
5) Check /readyz for dependency health

Suggested operator actions:

Denied by budget: raise cap or pause high-volume missions
Denied by allowlist: add model to allowlist or switch default model in UI
Repeated failures: check Supabase status, gateway env vars, and rate limiting

6) Operational checklist (production)

Verify /healthz returns 200
Verify /readyz returns 200 (Supabase reachable)
Ensure budgets + allowlists are set for each org
Confirm policy decisions are being recorded for governed endpoints
Confirm Usage totals reflect expected activity

For regulated workloads, enable strict allowlists and restrict PII provider egress.

Admin Guide • Print-friendly • Designed for operational handoff

User guide Usage Settings