> ## Documentation Index
> Fetch the complete documentation index at: https://help.pantaos.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Token Limits and Budgets

> Workspace and per model spend control in the Admin Panel Token Limits tab. Plan budget, overage allowance, and automatic fallback.

## What It Is

Token limits and budgets is the section of the Admin Panel where administrators see and control workspace spend on AI usage. It opens under **Admin Panel → Token Limits** and is the only place in PANTA OS where token related configuration happens.

The page has three sections: Spending & Budget (the workspace level view), Per Model Cost Caps, and Spend By Model (a breakdown of spend by model in the current cycle).

<Tip>
  This page handles the day to day administration of budgets. For the strategic overview of how PANTA OS approaches token management, see [Token Management](/platform/featured/token-management).
</Tip>

<Frame>
  <img src="https://mintcdn.com/panta/FVby5FpMAMdfhV10/images/English-Images-Documentations-(2)-1.png?fit=max&auto=format&n=FVby5FpMAMdfhV10&q=85&s=b827169ec6d23a768df09f3c97903f50" alt="English Images Documentations (2) 1" width="1920" height="1080" data-path="images/English-Images-Documentations-(2)-1.png" />
</Frame>

## Why It Matters

AI cost grows with usage. Without limits, the growth is invisible until the bill arrives. The Token Limits tab gives administrators a fixed envelope to work inside, plus the optional tools to shape consumption by model.

<CardGroup cols={2}>
  <Card title="Predictability" icon="shield">
    The plan budget is fixed by your PANTA OS plan, so spend cannot exceed the agreed envelope by surprise.
  </Card>

  <Card title="Visibility per cycle" icon="chart-area">
    The page shows what has been spent in the current billing cycle, against what is allowed, with a progress indicator.
  </Card>

  <Card title="Per model control" icon="brain">
    Optional Model limits cap consumption on individual models. Useful when one expensive model is driving the bill.
  </Card>

  <Card title="Automatic fallback" icon="rotate-cw">
    When a model limit is reached, requests can route automatically to a cheaper model so work continues without manual intervention.
  </Card>
</CardGroup>

## How To Use It

<Steps>
  <Step title="Open the Admin Panel" icon="shield">
    Click Admin at the bottom of the sidebar. Visible to administrators only.
  </Step>

  <Step title="Open Token Limits" icon="coins">
    Switch to the Token Limits tab in the Admin Panel tab bar.
  </Step>

  <Step title="Read the current cycle status" icon="calendar">
    The Spending & Budget section at the top shows the active billing period, the amount spent so far in the cycle, the plan budget, and the overage allowance.
  </Step>

  <Step title="Decide on automatic fallback" icon="rotate-cw">
    Toggle "Automatically switch to a cheaper model when a model limit is reached" on to keep work flowing when a model limit is reached. Save the change with Save.
  </Step>

  <Step title="Set per model limits if needed" icon="brain">
    In the Model limits section, enter a euro limit and activate the toggle for any specific model whose consumption you want to cap.
  </Step>

  <Step title="Review actual model spend" icon="globe">
    The Spend by model section breaks down the current cycle by model, in euros and as a percentage of total spend.
  </Step>
</Steps>

## Spending & Budget

The top section is the workspace level overview. It is the single source of truth for cycle spend.

<CardGroup cols={2}>
  <Card title="Current billing period" icon="calendar">
    The active billing cycle, shown as a date range at the top right of the section.
  </Card>

  <Card title="Spent in this cycle" icon="trending-up">
    The amount actually consumed so far in the cycle, in euros.
  </Card>

  <Card title="Plan budget" icon="lock">
    The workspace budget for the cycle, set by the PANTA OS plan. Read only; the lock icon indicates that this value is not editable in the UI.
  </Card>

  <Card title="Overage limit" icon="lock">
    An additional overage allowance on top of the plan budget. Also read only; the lock icon indicates that this value is not editable in the UI.
  </Card>

  <Card title="Progress bar" icon="chart-bar-big">
    A visual indicator of cycle progress with the spent over allowed ratio and a percentage. The status below ("Within plan budget") shows whether you are still inside the plan or have moved into the overage allowance.
  </Card>

  <Card title="Resets monthly" icon="rotate-ccw">
    The billing cycle resets monthly on the calendar date your plan started.
  </Card>
</CardGroup>

### The automatic fallback toggle

Below the budget overview sits a single toggle: **"Automatically switch to a cheaper model when a model limit is reached"**.

<CardGroup cols={1}>
  <Card title="Automatic fallback" icon="rotate-cw">
    When a model limit is reached and this toggle is on, PANTA OS routes new requests to a cheaper model automatically. Users keep working without seeing an error. When the toggle is off, requests on a capped model fail when the limit is reached. Save changes with Save at the right of the section.
  </Card>
</CardGroup>

## Model Limits

The Model limits section sets optional caps per model. It is described in the UI as: *"Optional: set a euro limit per cycle for a specific model. When the limit is reached, requests are automatically routed to a cheaper model (or blocked if none is available). Leave empty to use the shared org budget pool."*

The table lists every available model in the workspace with four columns:

<CardGroup cols={2}>
  <Card title="Model" icon="brain">
    The model name (for example Claude Sonnet 4.5, GPT-4o, GPT-5, GPT-5 Mini, GPT-5.4 Mini).
  </Card>

  <Card title="In this cycle" icon="trending-up">
    The amount the workspace has spent on this model in the current cycle, in euros.
  </Card>

  <Card title="Limit" icon="coins">
    An input field for the euro limit per cycle for this model. Leave empty to use the shared workspace budget pool without a model specific cap.
  </Card>

  <Card title="Enabled" icon="toggle-right">
    Toggle that activates the model limit. Use it together with the Limit value.
  </Card>
</CardGroup>

## Spend By Model (This Cycle)

The bottom section breaks down actual cycle spend by model.

<CardGroup cols={2}>
  <Card title="Per model spend" icon="recycle">
    Each model that has produced consumption in the cycle is listed with the Euro amount and its percentage of total cycle spend.
  </Card>

  <Card title="Use it to find cost drivers" icon="search">
    Models at the top of the list are the largest contributors to your bill. Compare against the Model limits table to decide which models should be capped.
  </Card>
</CardGroup>

## Key Settings or Options

<CardGroup cols={2}>
  <Card title="Plan budget" icon="lock">
    Workspace cycle budget. Set by PANTA OS plan; not editable in the UI.
  </Card>

  <Card title="Overage limit" icon="lock">
    Additional overage allowance. Set by PANTA OS plan; not editable in the UI.
  </Card>

  <Card title="Automatic fallback toggle" icon="rotate-cw">
    Single workspace level toggle to route requests to a cheaper model when a model limit is reached.
  </Card>

  <Card title="Per model euro caps" icon="brain">
    Optional cap per model. Leave empty to use the shared workspace pool.
  </Card>

  <Card title="Monthly reset" icon="rotate-ccw">
    The billing cycle resets monthly on the calendar date your plan started.
  </Card>

  <Card title="Per model spend breakdown" icon="key-round">
    Euro and percentage per model in the current cycle, for direct visibility on cost drivers.
  </Card>
</CardGroup>

## Tips and Best Practices

* Keep automatic fallback on by default. It is the difference between users hitting an error and users continuing to work, and the platform always picks a model that is allowed.
* Cap only the models that drive cost. Use Spend by model to identify them; setting limits on cheap models adds friction without saving money.
* Watch the progress bar near the end of the cycle. If you are close to the plan budget, expect the overage allowance to be touched in the last days.
* Review the page weekly during rollout. Spotting an unexpected model spike early is cheaper than discovering it later.
* If the plan budget feels tight, the conversation is with your PANTA OS account contact rather than a UI change. Plan budget and Overage limit are plan level decisions.

<Tip>
  Token limits in PANTA OS are workspace and per model only. There are no per team, per user, or per assistant token caps. Spend control happens through the plan envelope, model limits, and the automatic fallback.
</Tip>

## Help Center

<AccordionGroup>
  <Accordion title="Where are token limits configured" icon="map-pin">
    Open the Admin Panel from the bottom of the sidebar, then switch to the Token Limits tab. This is the only place in PANTA OS where token configuration happens.
  </Accordion>

  <Accordion title="Can I change the plan budget myself" icon="lock">
    No. The budget and overage limits are set by your PANTA OS plan and are read only in the UI (indicated by the lock icon). To raise either value, contact your PANTA OS account contact about a plan change.
  </Accordion>

  <Accordion title="Can I set a budget for a single user or team" icon="euro">
    No. Token limits exist only at the workspace level (plan budget plus overage) and the per model level. There is no per team, per user, or per assistant cap.
  </Accordion>

  <Accordion title="What happens when a model limit is reached" icon="rotate-cw">
    If the automatic fallback toggle is on, new requests route to a cheaper available model and work continues. If the toggle is off, requests on the capped model fail until the cycle resets.
  </Accordion>

  <Accordion title="What happens when the plan budget and overage are both fully consumed" icon="hand-helping">
    The limit is reached and new requests are blocked. To continue working in the same cycle, the plan envelope has to be raised with your PANTA OS account contact.
  </Accordion>

  <Accordion title="When does the cycle reset" icon="rotate-ccw">
    The billing cycle resets monthly on the calendar date your plan started.
  </Accordion>

  <Accordion title="How do I see which model is costing the most" icon="search">
    Scroll to Spend by model at the bottom of the Token limits tab. Each model that produced consumption in the current cycle is listed with its Euro amount and percentage of total spend.
  </Accordion>
</AccordionGroup>
