ByteDance Doubao · Rate Limits

Doubao Rate Limits

Volcano Engine enforces per-endpoint RPM/TPM and concurrent quotas, configurable per workspace/endpoint. Limits visible in the Ark console.

Doubao Rate Limits is the machine-readable rate-limit profile for ByteDance Doubao on the APIs.io network, conforming to the API Commons Rate Limits specification.

It captures 3 rate-limit definitions, measuring requests-per-minute, tokens-per-minute, and concurrent-requests.

The profile also includes 2 backoff/retry policies defined and response codes documented for throttled.

Tagged areas include AI, LLM, ByteDance, and Rate Limiting.

3 Limits Throttle: 429
AILLMByteDanceRate Limiting

Limits

Per-Endpoint RPM endpoint
requests-per-minute
see Ark console
Configurable per deployed model/endpoint.
Per-Endpoint TPM endpoint
tokens-per-minute
see Ark console
Concurrency endpoint
concurrent-requests
see Ark console

Policies

Backoff Strategy
Exponential backoff with jitter; honor Retry-After.
Reserved Capacity
Reserved-instance subscriptions guarantee throughput beyond shared quotas.

Sources