Lambda
Lambda (formerly Lambda Labs) is a GPU cloud provider offering on-demand NVIDIA GPU instances, 1-Click Clusters of 16-2,000+ interconnected H100 and B200 GPUs, long-term reserved Hyperplane capacity, filesystems, firewalls, and SSH key management. Lambda Cloud is accessed via a REST control-plane API, the web console, and SDKs/CLI tooling.
APIs
Lambda Cloud API
The Lambda Cloud API is the REST control plane for launching, listing, starting, stopping, and terminating GPU instances, managing SSH keys, firewalls, filesystems, images, and ...
Lambda 1-Click Clusters
Lambda 1-Click Clusters provision interconnected clusters of 16 to 2,000+ NVIDIA H100 SXM or B200 GPUs for short-duration distributed training workloads. The product is exposed ...
Lambda Cloud Filesystems
Lambda Cloud Filesystems provide persistent, sharable storage attached to on-demand instances for datasets and checkpoints. Filesystems are managed through the Cloud API and con...
Lambda Inference API
Lambda Inference API is an OpenAI-compatible REST gateway at https://api.lambda.ai/v1 that serves hosted open-source language models (Llama, DeepSeek, Hermes, Qwen, and others) ...
Features
Self-serve, first-come access to 1x, 2x, 4x, and 8x NVIDIA GPU virtual machines billed per-hour with no egress fees.
Pre-configured clusters of 16-2,000+ interconnected H100 SXM or B200 GPUs for distributed training.
Long-term reserved GPU clusters sold via direct sales engagement for sustained training workloads.
Pre-installed CUDA, cuDNN, PyTorch, and TensorFlow software stack across all Lambda Cloud instances.
Persistent, sharable storage attached to on-demand instances for datasets and checkpoints.
Integrations
Reference workflows and templates for serving and training Hugging Face models on Lambda Cloud.
PyTorch ships pre-installed via Lambda Stack on all instances.
TensorFlow ships pre-installed via Lambda Stack on all instances.
Event Specifications
Lambda Inference API Chat Completions Streaming (HTTP + SSE)
AsyncAPI 2.6 description of the Lambda (formerly Lambda Labs) **Inference API** chat completion streaming surface. The Lambda Inference API is an OpenAI-compatible REST gateway ...
ASYNCAPI