Pipeline gateway · multi-agent fleets

Your agents
can't burn more
than you let them.

One gateway. Fleet cost caps, loop detection, dedup cache, audit logs. Drop in front of any LLM API — five lines, two minutes.

Start free →read the docs

proxy mode · drop-in

Live simulation · loop detection

Agents hit the loop. Pipeline pauses.

4 agents chain a task. The pipeline budget caps at $3.00. When the Writer ↔ Reviewer loop pushes cumulative spend past the cap, the gateway pauses mid-flight — before the bill compounds.

Monthly cap

$5.00

of $5.00 / mo remaining

Pipeline budget

$0.00 / $3.00

Researcher· Gather 5 sources

$0.80queued

Writer· Draft summary

$1.50queued

Reviewer· Request rewrite

$0.60queued

Writer retry· Re-draft (loop detected)

$1.20queued

client-side · no signup

Six primitives, one gateway

Everything you need to hold the gate.

From a five-line drop-in to fleet-wide audit logs. Each primitive ships independently — pick what you need, ignore the rest.

01◇

Drop-in Proxy

No decorator, no callback handler, no framework adapter. Point your agent at the gateway URL with a bearer token and you get budget, loop detection, and cache automatically. Works with LangChain, CrewAI, AutoGen, LlamaIndex, or any framework that makes HTTP calls.

02◈

Fleet Budget Enforcement

Cap how much an entire pipeline can spend — not per-agent, per-fleet. When the ceiling hits, every agent in the pipeline pauses automatically. Monitoring tells you what went wrong; enforcement stops it from continuing.

03◉

Multi-Agent Dedup Cache

Five agents in the same pipeline call the same upstream API. The first request hits upstream; the next four are served from cache. Dedup happens across the pipeline, not per-agent — savings stack quietly while your monthly burn drops without code changes.

04◎

Runaway Loop Detection

When two agents enter an infinite back-and-forth, the gateway catches the cycle by tracking call patterns at the pipeline level. The pipeline pauses, a webhook fires, the team wakes up to a flagged session instead of a five-figure invoice.

05▣

Per-Agent Attribution

Every call routed through the gateway is tagged with the originating agent. The dashboard shows which agent burned what — so when something runs hot, you know exactly which agent caused the bottleneck and how much that specific run cost.

06◆

Optional: x402 Settlement

If your agents pay for APIs on-chain instead of through API keys, Gatewards implements the open x402 protocol end-to-end — USDC settlement on Base, EIP-3009 transfers, JWT receipts. Independent of proxy mode; opt-in only when you need it.

Cache moat · pipeline-wide

Watch dedup happen.

First call from the pipeline hits upstream. Every other agent making the same call is served from cache — across the whole pipeline, not per-agent.

● first · upstream

● next · cache hit

savings stack

Integration · drop-in

One URL change. Zero agent code.

01Proxy mode (default)

Drop-in HTTP proxy

Point your agent at the gateway URL. Pipeline budget, loop detection, and shared cache happen automatically — your agent code is unchanged. Works with any framework that makes HTTP calls.

// Drop in. No decorator, no callback handler.
// Point your agent at the Gatewards gateway.

const upstream = 'https://api.coingecko.com/api/v3/simple/price?ids=bitcoin'

const data = await fetch(
  `${GATEWAY}/api/v1/proxy/${encodeURIComponent(upstream)}`,
  { headers: { Authorization: `Bearer ${process.env.GATEWARDS_API_KEY}` } }
).then(r => r.json())

// Pipeline budget, loop detection, and dedup cache happen
// at the gateway. Your agent code is unchanged.

02x402 mode (optional)

Charge for your API on-chain

If you sell access to a paid API, Gatewards also implements the open x402 protocol. Set a price, receive USDC on Base, the SDK handles 402 responses + JWT receipts.

// Optional: charge for your API on-chain via x402.
import { createPaymentRequiredMiddleware } from '@gatewards/merchant-sdk'

app.get('/api/data',
  createPaymentRequiredMiddleware({
    price: '0.10',
    wallet: '0xYourWallet',
    network: 'base',
    gatewayPublicKey: process.env.JWT_SECRET,
  }),
  (req, res) => res.json({ data: 'premium' })
)

Live Settlements

No per-seat tax · switch tier any month

Pay for the fleet, not the seat.

Self-host free under MIT, or use the hosted gateway with simple usage caps. Every tier feature ships in the binary — managed plans add infra, dashboard, and support.

Free

Hobby + evaluation

✓1 pipeline
✓3 agents
✓100K events / month
✓Self-host the gateway free under MIT

Start free

Solo

Indie devs

$29/ mo

✓3 pipelines
✓10 agents
✓500K events / month
✓Email support

Start Solo

Most picked

Team

Small team

$99/ mo

✓10 pipelines
✓50 agents
✓5M events / month
✓Priority support + Slack

Start Team

Pro

Mid-market

$299/ mo

✓Unlimited pipelines
✓200 agents
✓50M events / month
✓SSO + audit log

Start Pro

Enterprise

Large fleets

Custom

✓Unlimited everything
✓Compliance export + SLA
✓Priority routing + dedicated CSM
✓On-prem deployment option

Talk to us

Self-host · MIT · every tier feature · no caps

Optional: x402 on-chain settlement available on every tier — opt in when you want agents paying for APIs in USDC on Base. 1% router fee on settled volume; tier subscription unaffected.

Your agentscan't burn morethan you let them.