Xai Plans Pricing
Pricing for the xAI API is usage-based per token (input/output, with cached-input discounts) per model, plus per-call fees for built-in tools and discounts for the Batch API. Exact per-model rates have not been reconciled in this artifact; see the xAI Console Models page for current rates.
Xai Plans Pricing is the machine-readable pricing-plan profile for xAI on the APIs.io network, conforming to the API Commons Plans specification.
It defines 3 plans, covering usage and enterprise tiers, with named plans including Pay-as-you-go (API), Batch API, Enterprise / Custom.
Tagged areas include AI, LLM, Foundation Models, Grok, and Generative AI.
Plans
Standard API access billed per token consumed, per model, with prepaid or invoice-based credits. Billing covers input tokens, output tokens, and (where applicable) cached-input tokens, plus per-call tool-invocation fees.
- Chat Completions
- Responses
- Embeddings
- Images
- Video
- Voice
- Live Search
Discounted asynchronous processing for non-real-time workloads. Batch requests are 20-50% cheaper than the equivalent synchronous calls and do not count toward rate limits.
- Batch Chat Completions
- Batch Embeddings
Volume commitments, custom capacity, dedicated support, and negotiated pricing for enterprise customers. Contact xAI sales.
- Custom Volume Pricing
- Dedicated Capacity