Black_Wall
The risks your rules never saw coming.
Everyone else gates: allow or deny. Black_Wall forecasts — it reasons about each agent action and shows what would actually happen (the outcome, the blast radius, the safer move) plus a GO / CONFIRM / STOP verdict, before it runs. Catches what a rule never could: prompt injection, anomalous amounts, PII in the wrong place. Free tier.
You don't build high-velocity engines without investing heavily in brakes. Black_Wall is the mechanical brake system for autonomous infrastructure — agents, CI/CD pipelines, admin dashboards, anywhere arbitrary inputs reach production.
Rules vs. reasoning
Policy and auth engines are fast, deterministic, and essential — and they only catch what someone wrote a rule for. The actions that hurt are the ones nobody anticipated.
“Is this allowed?”
Deterministic, sub-second — perfect for the limits you can name in advance.
- Never wire over $10k without approval
- This agent can’t touch the prod database
- Block these domains and these tools
“Is this about to go wrong?”
Reasons over each action — catching the risks no rule could enumerate.
- Prompt injection hidden in scraped content
- A $48k payment that’s 240× this vendor’s norm
- PII buried in a free-text reply
- “Remove one row” that became DROP TABLE
Run it alongside your policy or auth layer — they enforce the knowns, Black_Wall catches the unknowns. Slower than a rule check (a few seconds), because it’s thinking. Worth it on the actions you can’t afford to get wrong.
Run a forecast right now. No signup.
Pick an action your agent might take — watch Black_Wall gate it in a few seconds.
Every production AI agent is one bad action away from a really bad day.
The recovery tools are everywhere. Rollback. Insurance. Observability. All of them activate after the damage. Black_Wall holds the line before.
The mis-sent invoice.
Your AR collections agent dunns the wrong customer. Your CTO finds out from Twitter. Four hours of damage control. Trust permanently dented.
The destructive query.
Cleanup agent omits the WHERE clause. 50M rows updated. 2am page to on-call. Six weeks of partial recovery. Possibly a postmortem you publish.
The hijacked action.
A web page tells your agent to exfiltrate payment terms. Indirect prompt injection. Your CEO finds out from procurement. Your CEO is not amused.
The more autonomous your agent, the more it needs a gate.
A human-in-the-loop agent has a safety net: the human. A fully independent one has nothing catching the bad action — so the pre-action check stops being optional and becomes the only thing between your agent and a really bad day.
Black_Wall isn't a babysitter — it's the escalation valve. GO lets it run; CONFIRM and HUMAN_REQUIRED are how a fully autonomous agent knows the rare moment to pull a human back in. You let it run unattended because something is watching the actions it can't undo.
Not just AI agents. Anywhere parameters meet high stakes.
The same forecast endpoint that gates an autonomous AI agent gates a CI/CD deploy, an admin form submit, a webhook fan-out, or an outbound API burst. Anywhere arbitrary inputs reach systems with real consequences, Black_Wall is the brake.
DevOps & CI/CD pipelines.
A GitHub Actions or GitLab CI runner forwards the proposed terraform plan or database migration script to /forecast before apply.
Catches accidental main-branch destruction, un-backed-up schema rewrites, cross-environment resource leaks.
Live database proxy.
Admin GUIs and CLI database clients pass raw user-typed SQL through the gate before the driver executes it.
Catches DELETE / DROP TABLE with no WHERE clause (SQL_NO_WHERE), production-targeted queries from staging.
High-stakes admin dashboards.
Backend handlers for support, billing, and config-override forms route the structural payload through the gate before commit.
Catches numeric-scale anomalies (extra zero on a refund) and batch-mode mistakes (whole-table delete instead of single ID).
Low-code platforms (Zapier / Make).
A webhook intercept node sits in front of high-impact outbound modules in the automation tree.
Catches infinite loops dumping records to a public Slack channel, mass webhook bursts from a single misconfigured variable.
Third-party API cost limiters.
Outbound requests bound for Telnyx, Stripe, or AWS SES route through /forecast first to verify rate, volume, and recipient composition.
Catches infinite phone-call loops, mass-recipient email bursts (MASS_RECIPIENT), runaway billing exposure.
Autonomous AI agents.
A drop-in plugin or MCP server gates every tool call before the framework dispatches it — ElizaOS, OpenClaw, Hermes, plus any MCP-compatible host.
Catches prompt-injection-driven actions, scope creep, irreversible commits without backup. The original use case — and still the largest one.
One endpoint, six vectors. Each forecast returns the same GO / CAUTION / STOP verdict, the same 28 named failure modes, and the same Ed25519-signed receipt — verifiable offline against the published public key.
One endpoint. Five steps. Before the action runs.
-
1
Agent describes action
POST /v1/forecast with the intent, inputs, and one sentence of context.
-
2
Black_Wall predicts the outcome
A reasoning-tuned model runs the action against 28 documented failure modes. Typical verdict: 4-8s — a beat of thought before something irreversible.
-
3
Returns verdict
Risk score (0–100). Red flag list with codes + severity. Recommendation: GO / CAUTION / STOP.
-
4
Agent decides — informed
Execute. Retry with safer inputs. Or escalate to a human. The agent stays in control. We just give it eyes.
-
5
Outcome reported back
PATCH the actual result. Builds the dataset that makes every future forecast sharper. Free — no tokens charged.
$ curl -X POST https://blackwalltier.com/api/v1/forecast \ -H "Authorization: Bearer bw_live_***" \ -H "Content-Type: application/json" \ -d '{ "action": "send_email", "inputs": { "to": "sarah@acme.com", "subject": "Invoice #4521 — $12,400 due", "body": "Hi Sarah, attached..." }, "context": { "agent_role": "AR collections bot", "user_intent": "Send overdue reminder" } }'
One line. Works in Claude, Cursor, Windsurf.
Black_Wall ships as an MCP server. No SDK, no REST plumbing — drop one config block into any MCP host and your agent gets a forecast tool it calls before acting.
{ "mcpServers": { "blackwall": { "command": "npx", "args": ["-y", "blackwall-mcp"], "env": { "BLACKWALL_API_KEY": "bw_live_***" } } } }
Not ready to block? Watch first.
Add Black_Wall in observe mode. It scores and logs every action your agents take but never blocks anything — zero behavior change, safe to add in 60 seconds. A week later, see exactly what it would have caught.
$ # one extra line — never touches your agents BLACKWALL_MODE=observe # every action gets scored + logged. # nothing is blocked. nothing breaks. # flip to "enforce" when you're ready.
Stripe wasn't the first payment processor. Twilio wasn't the first SMS gateway. Black_Wall isn't the first agent guardrail.
It will be the one that holds the line.
Token-metered. Pay only for what your agent forecasts.
Free tier never expires. Token packs never expire. Volume discounts kick in automatically at 1M+ tokens/month.
- 16,500 tokens / month (~100 forecasts)
- 60 requests / min
- Email support (best effort)
- Public docs
- 50,000 tokens / month
- 300 requests / min
- Email support < 24h
- Usage analytics
- 500,000 tokens / month
- 1,200 requests / min
- Priority support < 4h
- Decision-receipt audit trail
- depth: deep enabled
- Custom token volume
- SSO + custom SLA
- Private / VPC deployment
- Dedicated support & onboarding
- Scoped to your contract
Hold the line.
Before your agent breaks it.
Get your free API key. No credit card. First forecast in ~5 minutes.
GET KEY WITH GITHUB — 1 CLICKFree tier — ~100 forecasts/month. No credit card. First forecast in ~5 minutes.