Untangled API

How Untangled bills and rate-limits — Heuristic Effort Units (HEU) and the plan ladder.

Heuristic Effort Units

Untangled does not bill per request, per token, or per page — those metrics are fragile. We bill per Heuristic Effort Unit (HEU), a single number that scales with how much actual work each call took. A short receipt costs less than a 40-page contract, and you can see exactly how much before each call is committed.

The HEU formula is internal and stable across model upgrades — it is not the underlying provider's token count. We absorb the variance so your bill doesn't spike when we change models.

The response includes the HEU charge for transparency:

{
  "vendor": "Acme Co.",
  "total": 1250.00,
  "_meta": {
    "heu": 12,
    "plan_remaining": 99988
  }
}

Plan ladder

Plan	HEU / month	Concurrent	Overage
Free	500K	2	hard stop
Starter	8M	8	$3 / MTok Haiku
Pro	30M	32	$3 / MTok Haiku · $9 / MTok Sonnet
Scale	200M	128	$3 / MTok Haiku · $9 / MTok Sonnet

Free stops returning new responses once the bucket hits zero. There is no overage path.
Paid plans keep returning responses past the included bucket — overage is billed monthly at the model rates above.
Concurrent is the cap on simultaneous in-flight requests; exceeding returns 429 too_many_requests.

Buckets reset on the first of each month at 00:00 UTC.

Pre-flight HEU estimate

You can estimate cost before committing — pass ?estimate=true to any /v1/extract/* endpoint. The request returns the projected HEU charge without producing a structured response. Useful for batch jobs where you want to budget upfront.

curl "https://api.untangledapi.com/v1/extract/invoice?estimate=true" \
  -H "x-api-key: $UNTANGLED_KEY" \
  -H "content-type: application/json" \
  -d '{ "document": { "text": "..." } }'

{ "estimate": { "heu": 12 } }

Estimate calls do not consume your HEU bucket and are not rate-limited the same way live calls are.

Models

Heuristic Effort Units

Plan ladder

Pre-flight HEU estimate

On this page