Not Diamond · FinOps Profile

Notdiamond Finops

FinOps view of Not Diamond spend. Not Diamond charges a small fixed routing fee per million tokens routed and positions itself as a cost-optimization layer that reduces total LLM spend (cited 20-40% savings) by sending each prompt to the cheapest model that still meets quality targets. Underlying model inference is billed by the selected providers, not by Not Diamond. Exact routing fee amounts are not publicly listed and are left unreconciled.

Notdiamond Finops is the FinOps profile for Not Diamond on the APIs.io network, aligned with the FinOps Foundation Framework.

It defines 3 billable meters, billed in USD, on a monthly cycle, and pricing category usage-based.

The profile maps 8 FOCUS columns for cost-allocation reporting.

Tagged areas include AI, LLM, Model Routing, Router, and Orchestration.

Category: AI and Machine Learning Pricing: Usage-Based Billing: Monthly FOCUS v1.3
AILLMModel RoutingRouterOrchestrationFinOpsCost ManagementFOCUS

Framework Alignment

Framework
Data Spec

Charge Categories

UsagePurchase

FOCUS Columns

BillingCurrency
USD
ChargeCategory
Usage
InvoiceIssuerName
Not Diamond
PricingCategory
Usage-Based
ProviderName
Not Diamond
PublisherName
Not Diamond
ServiceCategory
AI and Machine Learning
ServiceName
Not Diamond Model Router

Meters

routed_tokens
Unit: tokens
Tokens routed through the model router, billed as a fixed fee per 1M tokens.
routing_decisions
Unit: requests
Number of modelSelect routing decisions made.
provider_inference_cost
Unit: usd
Inference cost incurred at the selected underlying LLM provider. Billed by the provider, not by Not Diamond; tracked here for total cost of ownership.

Sources