Novita AI
Novita AI is an AI inference cloud offering serverless LLM, image, video, and audio generation APIs alongside on-demand GPU rentals and serverless GPU endpoints. Hosts open-source models with both native and OpenAI-compatible chat surfaces, plus an agent sandbox and MCP server for tool-using agents.
APIs
Novita AI Platform API
LLM chat completions (OpenAI-compatible), embeddings, reranking, batch, image generation (FLUX.1, Seedream 3.0/4.0, Qwen), image editing (upscale, background removal/replacement...
Novita AI GPU API
On-demand GPU instance management and templates plus serverless GPU endpoints. Create, start, stop, and delete instances; list products and templates; query usage-based and fixe...
Event Specifications
Novita AI Streaming & Webhook API
AsyncAPI 2.6 description of the asynchronous surfaces of the Novita AI platform: 1. **Server-Sent Events (SSE) streaming** for OpenAI-compatible chat completions (`POST /openai/...
ASYNCAPI