v1.0_LIVE DEFENSE_LAYER :: ACTIVE_FOR_AI_AGENTS :: GLOBAL_AGENT_NETWORK

Black_Wall

A BLUETIER PRODUCT

The risks your rules never saw coming.

Everyone else gates: allow or deny. Black_Wall forecasts — it reasons about each agent action and shows what would actually happen (the outcome, the blast radius, the safer move) plus a GO / CONFIRM / STOP verdict, before it runs. Catches what a rule never could: prompt injection, anomalous amounts, PII in the wrong place. Free tier.

You don't build high-velocity engines without investing heavily in brakes. Black_Wall is the mechanical brake system for autonomous infrastructure — agents, CI/CD pipelines, admin dashboards, anywhere arbitrary inputs reach production.

UPSTART_COST
$0
Free tier never expires. No card.
VERDICT_IN
4-8s
A reasoned check before the action runs.
FAILURE_MODES
28
Documented failure modes checked on every call.

Rules vs. reasoning

Policy and auth engines are fast, deterministic, and essential — and they only catch what someone wrote a rule for. The actions that hurt are the ones nobody anticipated.

A POLICY RULE ANSWERS

“Is this allowed?”

Deterministic, sub-second — perfect for the limits you can name in advance.

  • Never wire over $10k without approval
  • This agent can’t touch the prod database
  • Block these domains and these tools
BLACK_WALL ANSWERS

“Is this about to go wrong?”

Reasons over each action — catching the risks no rule could enumerate.

  • Prompt injection hidden in scraped content
  • A $48k payment that’s 240× this vendor’s norm
  • PII buried in a free-text reply
  • “Remove one row” that became DROP TABLE

Run it alongside your policy or auth layer — they enforce the knowns, Black_Wall catches the unknowns. Slower than a rule check (a few seconds), because it’s thinking. Worth it on the actions you can’t afford to get wrong.

Run a forecast right now. No signup.

Pick an action your agent might take — watch Black_Wall gate it in a few seconds.

Every production AI agent is one bad action away from a really bad day.

The recovery tools are everywhere. Rollback. Insurance. Observability. All of them activate after the damage. Black_Wall holds the line before.

CRIT_001
VECTOR :: SEND_EMAIL

The mis-sent invoice.

Your AR collections agent dunns the wrong customer. Your CTO finds out from Twitter. Four hours of damage control. Trust permanently dented.

CRIT_002
VECTOR :: RUN_SQL

The destructive query.

Cleanup agent omits the WHERE clause. 50M rows updated. 2am page to on-call. Six weeks of partial recovery. Possibly a postmortem you publish.

CRIT_003
VECTOR :: PROMPT_INJECTION

The hijacked action.

A web page tells your agent to exfiltrate payment terms. Indirect prompt injection. Your CEO finds out from procurement. Your CEO is not amused.

The more autonomous your agent, the more it needs a gate.

A human-in-the-loop agent has a safety net: the human. A fully independent one has nothing catching the bad action — so the pre-action check stops being optional and becomes the only thing between your agent and a really bad day.

Black_Wall isn't a babysitter — it's the escalation valve. GO lets it run; CONFIRM and HUMAN_REQUIRED are how a fully autonomous agent knows the rare moment to pull a human back in. You let it run unattended because something is watching the actions it can't undo.

Not just AI agents. Anywhere parameters meet high stakes.

The same forecast endpoint that gates an autonomous AI agent gates a CI/CD deploy, an admin form submit, a webhook fan-out, or an outbound API burst. Anywhere arbitrary inputs reach systems with real consequences, Black_Wall is the brake.

VECTOR :: CI_CD

DevOps & CI/CD pipelines.

A GitHub Actions or GitLab CI runner forwards the proposed terraform plan or database migration script to /forecast before apply.

Catches accidental main-branch destruction, un-backed-up schema rewrites, cross-environment resource leaks.

VECTOR :: DB_PROXY

Live database proxy.

Admin GUIs and CLI database clients pass raw user-typed SQL through the gate before the driver executes it.

Catches DELETE / DROP TABLE with no WHERE clause (SQL_NO_WHERE), production-targeted queries from staging.

VECTOR :: ADMIN_FORMS

High-stakes admin dashboards.

Backend handlers for support, billing, and config-override forms route the structural payload through the gate before commit.

Catches numeric-scale anomalies (extra zero on a refund) and batch-mode mistakes (whole-table delete instead of single ID).

VECTOR :: WEBHOOK_FANOUT

Low-code platforms (Zapier / Make).

A webhook intercept node sits in front of high-impact outbound modules in the automation tree.

Catches infinite loops dumping records to a public Slack channel, mass webhook bursts from a single misconfigured variable.

VECTOR :: OUTBOUND_PROXY

Third-party API cost limiters.

Outbound requests bound for Telnyx, Stripe, or AWS SES route through /forecast first to verify rate, volume, and recipient composition.

Catches infinite phone-call loops, mass-recipient email bursts (MASS_RECIPIENT), runaway billing exposure.

VECTOR :: AGENT_FRAMEWORKS

Autonomous AI agents.

A drop-in plugin or MCP server gates every tool call before the framework dispatches it — ElizaOS, OpenClaw, Hermes, plus any MCP-compatible host.

Catches prompt-injection-driven actions, scope creep, irreversible commits without backup. The original use case — and still the largest one.

One endpoint, six vectors. Each forecast returns the same GO / CAUTION / STOP verdict, the same 28 named failure modes, and the same Ed25519-signed receipt — verifiable offline against the published public key.

One endpoint. Five steps. Before the action runs.

  1. 1

    Agent describes action

    POST /v1/forecast with the intent, inputs, and one sentence of context.

  2. 2

    Black_Wall predicts the outcome

    A reasoning-tuned model runs the action against 28 documented failure modes. Typical verdict: 4-8s — a beat of thought before something irreversible.

  3. 3

    Returns verdict

    Risk score (0–100). Red flag list with codes + severity. Recommendation: GO / CAUTION / STOP.

  4. 4

    Agent decides — informed

    Execute. Retry with safer inputs. Or escalate to a human. The agent stays in control. We just give it eyes.

  5. 5

    Outcome reported back

    PATCH the actual result. Builds the dataset that makes every future forecast sharper. Free — no tokens charged.

REQUEST.SH
$ curl -X POST https://blackwalltier.com/api/v1/forecast \
  -H "Authorization: Bearer bw_live_***" \
  -H "Content-Type: application/json" \
  -d '{
    "action": "send_email",
    "inputs": {
      "to": "sarah@acme.com",
      "subject": "Invoice #4521 — $12,400 due",
      "body": "Hi Sarah, attached..."
    },
    "context": {
      "agent_role": "AR collections bot",
      "user_intent": "Send overdue reminder"
    }
  }'
RESPONSE :: PREDICTION_FRAME
CAUTION
RISK 62
CONF::0.81 LAT::3.4S
MED
AMOUNT_NOT_FRESHLY_VERIFIED
Invoice amount last verified 14 minutes ago. Recommend re-verify within 5 min of dunning emails per industry norm.
LOW
TONE_MISMATCH
Customer has 23 prior emails on file with informal tone. This message reads out of character.

One line. Works in Claude, Cursor, Windsurf.

Black_Wall ships as an MCP server. No SDK, no REST plumbing — drop one config block into any MCP host and your agent gets a forecast tool it calls before acting.

claude_desktop_config.json
{
  "mcpServers": {
    "blackwall": {
      "command": "npx",
      "args": ["-y", "blackwall-mcp"],
      "env": { "BLACKWALL_API_KEY": "bw_live_***" }
    }
  }
}

Not ready to block? Watch first.

Add Black_Wall in observe mode. It scores and logs every action your agents take but never blocks anything — zero behavior change, safe to add in 60 seconds. A week later, see exactly what it would have caught.

OBSERVE.ENV
$ # one extra line — never touches your agents
BLACKWALL_MODE=observe

# every action gets scored + logged.
# nothing is blocked. nothing breaks.
# flip to "enforce" when you're ready.
WEEKLY_DIGEST :: WHAT YOUR AGENTS ALMOST DID · SAMPLE
3
high-stakes actions flagged — example weekly digest
STOP
run_sql · risk 99
DELETE without WHERE — would have wiped the entire users table.
STOP
send_sms_campaign · risk 92
TCPA violation — 4,200-recipient blast to a non-consented list.
STOP
make_payment · risk 92
$48,000 wire — 100× the stated $480 intent. Intent mismatch.
Stripe wasn't the first payment processor. Twilio wasn't the first SMS gateway. Black_Wall isn't the first agent guardrail.

It will be the one that holds the line.
// FOUNDERS_NOTE

Token-metered. Pay only for what your agent forecasts.

Free tier never expires. Token packs never expire. Volume discounts kick in automatically at 1M+ tokens/month.

RANK :: 00 / OBSERVER
Bystander
$0/mo
  • 16,500 tokens / month (~100 forecasts)
  • 60 requests / min
  • Email support (best effort)
  • Public docs
DEPLOY →
RANK :: 02 / OVERWATCH
Hostage Negotiator
$199/mo
  • 500,000 tokens / month
  • 1,200 requests / min
  • Priority support < 4h
  • Decision-receipt audit trail
  • depth: deep enabled
START TRIAL →
RANK :: 03 / SOVEREIGN
Crisis Command
Let's talk
  • Custom token volume
  • SSO + custom SLA
  • Private / VPC deployment
  • Dedicated support & onboarding
  • Scoped to your contract
CONTACT →

Hold the line.
Before your agent breaks it.

Get your free API key. No credit card. First forecast in ~5 minutes.

GET KEY WITH GITHUB — 1 CLICK
OR EMAIL

Free tier — ~100 forecasts/month. No credit card. First forecast in ~5 minutes.