Skip to main content

Cost Management

The sidebar's Cost section groups six FinOps surfaces: the live cost dashboard, token usage telemetry, model pricing, budget caps, chargeback reports, and compliance reports.

Cost dashboard

Open Cost → Cost Dashboard in the sidebar.

Cost dashboard with summary cards, charts, budget status, and cost recordsCost dashboard with summary cards, charts, budget status, and cost records
Figure 1. Cost dashboard with summary cards, charts, budget status, and cost records

Cost records

A table of cost records with tenant, API key, model, provider, input / output tokens, input / output cost, total cost, and timestamp. Auto-refreshes every 5 seconds.

Filters: narrow by tenant, model, provider, and date range.

Summary cards

Aggregated totals at the top of the page: total cost, total requests, total input tokens, total output tokens.

Charts

Two interactive visualizations:

  • Cost by Provider — donut chart breaking down total spend by provider
  • Cost by Model — horizontal bar chart showing spend per model
Cost dashboard with cost-by-provider donut and cost-by-model bar chartCost dashboard with cost-by-provider donut and cost-by-model bar chart
Figure 2. Cost dashboard with cost-by-provider donut and cost-by-model bar chart

Budget status panel

Shows active budget caps with current spend vs. limit progress bars. Color-coded: green under the soft limit, amber at the soft limit, red at the hard limit.

Forecast cards

Shows 7-day trailing cost forecasts per model with projected month-end spend and trend indicators (increasing / decreasing / stable).

Anomaly alerts

Lists detected cost anomalies with tenant, model, current daily rate vs. baseline, deviation percentage, and detection timestamp. The threshold is configurable globally via dvara.llm-gateway.finops.anomaly-threshold-pct and per-tenant by setting cost.anomaly-threshold-pct in the tenant metadata.

Token Usage page

For per-request token detail — the raw records underneath the dollar aggregates — open Cost → Token Usage in the sidebar. The page wraps the /v1/admin/token-usage REST surface with a filter form (tenant, API key, model, date range) that drives a summary KPI card (input / output / total tokens, request count, average tokens-per-request) and a records table (newest first, masked API keys, an estimated-flag tooltip on rows where the upstream didn't return exact counts). Pagination caps at 1000/page; default 50.

The Cost Dashboard is the dollar lens of the same data — tokens rolled up with pricing into spend figures grouped by tenant / model / provider / time range. Use the Cost Dashboard for spend visibility and budgets; use the Token Usage page for pricing audits, cost-per-token analysis, or reconciling a disputed tenant invoice.

See the Admin API reference for the list and summary endpoints with filter parameters (tenantId, apiKey, model, from, to) and response shape.

Model pricing

Open Cost → Model Pricing in the sidebar.

Pricing list

Lists every pricing entry with model pattern (glob), provider, input price per million tokens, output price per million tokens, effective date, and row actions.

Model pricing list with seeded entriesModel pricing list with seeded entries
Figure 3. Model pricing list with seeded entries

Create / edit pricing

Click New Pricing to create an entry with:

  • Model Pattern — glob pattern (e.g. gpt-4o*, claude-3-sonnet*)
  • Provider — optional provider scope
  • Input Price / Million — USD per 1 M input tokens
  • Output Price / Million — USD per 1 M output tokens
  • Effective Date — when this pricing takes effect
Create-pricing formCreate-pricing form
Figure 4. Create-pricing form

Click Delete to remove a pricing entry. Requires confirmation.

Budget caps

Open Cost → Budget Caps in the sidebar.

Budget list

Lists every budget cap with name, tenant, API key, period (DAILY / WEEKLY / MONTHLY), limit (USD), soft limit percentage, enabled status, and version. Auto-refreshes every 10 seconds. A Summary row at the top shows total budgets, enabled count, and disabled count. Use the Tenant dropdown filter to narrow the list.

Tenant self-service budgets

Tenants can create their own caps from /portal/budgets — these are stored with managedBy=TENANT and must sit at or below the effective platform ceiling (the tightest enabled PLATFORM-provenance cap that governs them). The Console's Budget list mixes both: platform-created caps are editable from here; tenant-created caps appear read-only with a lock badge. Budget evaluation is tightest-wins across every applicable cap.

Budget caps list with monthly and daily capsBudget caps list with monthly and daily caps
Figure 5. Budget caps list with monthly and daily caps

Create / edit budget

Click New Budget to create a budget cap with:

  • Name — descriptive name
  • Tenant ID — scope to a tenant (blank = global)
  • API Key ID — scope to a specific API key (blank = tenant-wide)
  • Period — DAILY, WEEKLY, or MONTHLY
  • Limit (USD) — hard spending limit
  • Soft Limit % — percentage of limit at which soft alerts trigger (default: 80%)
Create-budget formCreate-budget form
Figure 6. Create-budget form

Budget status

Click the status icon on any budget row to view current period usage:

  • Current Spend — amount spent in the current period
  • Limit — configured limit
  • Period Start / End — boundaries of the current budget period
  • Utilization % — spend / limit percentage with a color-coded progress bar
  • Remaining — dollars remaining before the hard limit
Budget usage panel with spend vs limit progressBudget usage panel with spend vs limit progress
Figure 7. Budget usage panel with spend vs limit progress

Click Delete to remove a budget cap. Requires confirmation.

Chargeback reports

Open Cost → Chargeback Reports in the sidebar. Requires enterprise license.

Report list

Lists every generated chargeback report with ID, tenant, period (from–to), generated date, and generated by. Auto-refreshes every 10 seconds.

Chargeback reports listChargeback reports list
Figure 8. Chargeback reports list

Generate report

Click Generate to create a new chargeback report:

  • Tenant ID — scope to a specific tenant (blank = all tenants)
  • From / To — reporting period date range

The report includes seven sections: tenant summary, API key summary, model summary, provider summary, daily breakdown, forecasts, and anomalies.

Download

  • PDF — click the PDF icon to download a formatted PDF report
  • CSV — click the CSV icon to download raw data as CSV

Click Delete to remove a report. Requires confirmation.

Compliance reports

Open Cost → Compliance Reports in the sidebar. Requires enterprise license.

Report list

Lists every compliance report with ID, type (SOC2 / HIPAA / GDPR), tenant, period, generated date, and actions. Auto-refreshes every 10 seconds. Use the Type and Tenant dropdown filters to narrow the list.

Compliance reports listCompliance reports list
Figure 9. Compliance reports list

Generate report

Click Generate to create a new compliance report:

  • Type — SOC2 Type II, HIPAA, or GDPR
  • Tenant ID — scope to a specific tenant (blank = all tenants)
  • From / To — reporting period date range

Report contents by type:

SOC2HIPAAGDPR
Audit chain integrityAccess logData processing activities
Access control summaryAuthorization logTenant data residency
Policy enforcement summaryPolicy violationsConsent and policy status
Data residencyData access patternsData retention
Token usage summaryAudit integrityAudit integrity
Event count by type
Compliance generate form — report type radio (SOC2 / HIPAA / GDPR), tenant filter, and date-range pickersCompliance generate form — report type radio (SOC2 / HIPAA / GDPR), tenant filter, and date-range pickers
Figure 10. Generating a compliance report. The type selection drives which audit-event categories are aggregated (per the table above); tenant + date range scope the evidence window. Generated PDFs are cryptographically signed end-to-end so an auditor can verify the export hasn't been altered.

Click the PDF icon to download the compliance report as a formatted PDF. Click Delete to remove a report (requires confirmation).