fal

fal (Features and Labels, Inc.) is a generative media platform providing the world's fastest API for running image, video, audio, and multimodal generative AI models. Through a unified queue-based REST API at https://queue.fal.run, plus realtime WebSocket and SSE streaming surfaces, fal serves 1,000+ production models — including FLUX, Veo 3, Kling, Wan, Seedream, Nano Banana, and Stable Diffusion — on autoscaling GPU infrastructure. fal Serverless lets developers ship custom models with `@fal.function` / `fal.App` / BYO containers, while fal Compute provides dedicated H100/H200/A100/B200 instances. Trusted by Canva, Perplexity, Poe, and 1.5M+ developers; Series D funded ($140M, Sequoia-led, December 2025); SOC 2 with 99.99% uptime.

fal publishes 8 APIs on the APIs.io network, including Realtime API, Streaming API, Apps API, and 5 more. Tagged areas include AI, Artificial Intelligence, Generative AI, Generative Media, and Image Generation.

The fal catalog on APIs.io includes 1 event-driven AsyncAPI specification, 1 JSON-LD context, and 3 Spectral governance rulesets.

fal’s developer surface includes authentication, changelog, CLI, developer portal, documentation, getting-started guide, engineering blog, and 59 more developer resources.

🌐 Visit website 📡 Source on GitHub

74.8/100 exemplar ▬ flat Agent 57/100 agent ready Full breakdown ↓
scored 2026-07-28 · rubric v0.6

AccessPaidSelf serve

12 APIs 1 MCP Servers 18 Features

AIArtificial IntelligenceGenerative AIGenerative MediaImage GenerationVideo GenerationAudio GenerationInferenceServerlessGPUMCP

Kin Score

Kin Score How this is scored →
scored 2026-07-28 · rubric v0.6

Composite quality — 74.8/100 · exemplar

Contract Quality 20.5 / 25

Developer Ergonomics 16.1 / 20

Commercial Clarity 13.7 / 20

Operational Transparency 8.9 / 13

Governance 6.4 / 12

Discoverability 9.3 / 10

Agent readiness — 57/100 · agent ready

Machine-Readable Contract 18 / 18

Agentic Access Contract 10 / 10

MCP Server 12 / 12

Machine-Readable Auth 10 / 10

Idempotency 0 / 9

Stable Error Semantics 8 / 8

Request/Response Examples 7 / 7

Rate-Limit Signaling 7 / 7

Typed Event Surface 6 / 6

Agent Skills 0 / 5

Well-Known Catalog 4 / 4

Consent & Bot Identity 0 / 3

A2A Agent Card 0 / 8

Dry-Run / Simulate Mode 0 / 4

Improve this rating by publishing the missing artifacts — every area above can be raised, and the full rubric is at apis.io/rating/. This rating is computed from github.com/api-evangelist/fal-ai: open an issue to ask a question, or submit a pull request to add artifacts. Want it done for you? Prioritized profiling — $2,500 →

APIs 12

Individual APIs this provider publishes, each with its own machine-readable definition.

fal Realtime API

WebSocket-based realtime inference for ultra-low latency interactive generative experiences such as LCM/SDXL sketch-to-image, live-portrait, and realtime upscaling. Bi-direction...

fal Streaming API

HTTP streaming endpoint (`/{model-id}/stream`) that emits progressive partial outputs as a model runs — used for LLM/VLM token streams, incremental video frames, and step-by-ste...

fal Models Catalog API

Read-only discovery endpoints for browsing fal's 1,000+ production model catalog, including model metadata, capability tags, pricing per output, supported parameters, example in...

fal Compute API

Provision and manage dedicated GPU instances (H100, H200, A100, B200) with full SSH access for training, fine-tuning, and persistent workloads. Hourly or per-second billing with...

fal API Keys API

Manage fal API keys — create, list, scope, and revoke keys used to authenticate against the Model, Storage, Serverless, and Compute APIs via the Authorization: Key $FAL_KEY header.

fal Usage and Billing API

Programmatic access to usage metrics, per-model spend, GPU-second consumption, and invoicing history. Surfaces the same data shown on the fal dashboard so platform teams can pip...

fal Apps API

List and inspect deployed Serverless apps.

fal Files API

Manage files on persistent Serverless `/data` volumes.

fal Queue API

Submit, inspect, and cancel model inference jobs.

fal Secrets API

Manage per-org secrets injected into Serverless runs.

fal Storage API

Upload binary assets to the fal CDN.

fal Streaming API

Server-sent streaming of incremental model output.

Scroll for all 12

Postman Collections 3

Ready-to-run Postman collections for exercising this provider's APIs.

fal Model APIs

POSTMAN

fal Serverless Platform API

POSTMAN

fal Storage API

POSTMAN

Open Collections 3

Open, tool-agnostic API collections (OpenAPI-derived and Bruno).

fal Model APIs

OPEN COLLECTION

fal Serverless Platform API

OPEN COLLECTION

fal Storage API

OPEN COLLECTION

Arazzo Workflows 9

Multi-step API workflows described with the Arazzo specification.

fal Upload, Run Image-To-Image With Webhook

Upload a reference image, submit an image-to-image job with a webhook, and confirm queue acceptance.

ARAZZO

fal Queue Inference

Submit a model inference job, poll the queue until it completes, then fetch the result.

ARAZZO

fal Serverless App Discovery

List deployed Serverless apps, then fetch full metadata and scaling for the first one.

ARAZZO

fal Serverless App Files Inspection

Confirm a Serverless app exists, then list files on its persistent /data volume.

ARAZZO

fal Set And Verify Serverless Secret

Create or replace a Serverless secret, then list secret names to confirm it is present.

ARAZZO

fal Streaming Inference

Run a model synchronously over the streaming endpoint to receive progressive output.

ARAZZO

fal Submit And Conditionally Cancel

Submit an inference job, check its status once, and cancel it if it has not finished.

ARAZZO

fal Upload Asset Then Run Inference

Upload a binary reference asset to the fal CDN, then run an image-to-X model against it.

ARAZZO

fal Webhook-Backed Submission

Submit an inference job with a webhook callback and confirm it was accepted into the queue.

ARAZZO

Scroll for all 9

MCP Servers 1

Model Context Protocol servers that expose these APIs to AI agents.

fal-ai-mcp.yml

MCP SERVER

GraphQL 1

GraphQL schemas published by this provider.

fal GraphQL API

fal is a fast serverless inference platform for AI models including image generation (Stable Diffusion, FLUX, Kling), video generation, speech, and custom models. The API covers...

GRAPHQL

Pricing Plans 1

Published pricing tiers and plan structures.

Fal Ai Plans Pricing

2 plans

PLANS

Rate Limits 1

Documented rate limits and quota policies.

Fal Ai Rate Limits

5 limits

RATE LIMITS

FinOps 1

Cost, billing, and metering signals for API financial operations.

Fal Ai Finops

FINOPS

Features 18

Notable capabilities this provider offers.

Unified queue-based REST API at https://queue.fal.run/{model-id} for 1,000+ generative models

Image generation models — FLUX (Schnell, Dev, Pro, Kontext Pro), Seedream V4, Nano Banana, Qwen, SDXL, SD3, Ideogram, Recraft

Video generation models — Veo 3, Kling 2.5 Turbo Pro, Wan 2.5, Seedance 2.0, Ovi, Hunyuan, Sora-class

Audio and voice models — Inworld TTS-1.5, ElevenLabs, MMAudio, MusicGen, Stable Audio

3D and multimodal models — TripoSR, Hunyuan3D, LivePortrait, FaceChain

Synchronous, asynchronous queue, server-sent streaming, and WebSocket realtime invocation modes

Webhook callbacks for queue completion with HMAC signature verification

File uploads / CDN storage at https://v3.fal.media with signed upload URLs

fal Serverless — `@fal.function`, `fal.App`, BYO container deployment with autoscaling from 0 to thousands of GPUs

fal Compute — dedicated H100/H200/A100/B200 instances with SSH and per-second billing

Per-output billing (image, video second, audio minute) plus per-second GPU billing for custom deployments

99.99% uptime SLA, SOC 2 compliance, private endpoints, and enterprise support

Proprietary Inference Engine — up to 10x faster than reference implementations

Official SDKs for Python (fal-client), JavaScript/TypeScript (@fal-ai/client), Swift, Java/Kotlin, Dart

fal CLI for serverless deploy / run / apps / secrets / auth

fal MCP Server exposing all 1,000+ models to AI assistants via the Model Context Protocol

ComfyUI and Blender extensions, plus Terraform provider for infra-as-code

Day-zero launch partner for major model releases (FLUX, Veo, Kling, Seedance, Wan, etc.)

Scroll for all 18

JSON Schema 3

Standalone JSON Schema definitions for this provider's data models.

Examples 3

Example request and response payloads for these APIs.

Get Started 3

Portal, sign-up, and the first successful call

Portal

Portal

GettingStarted

GettingStarted

Signup

Signup

Documentation 11

Reference material describing how the API behaves

Documentation

Documentation

Documentation

Documentation

Documentation

Documentation

Documentation

Documentation

Documentation

Documentation

Documentation

Scroll for all 11

Agent Surfaces 5

MCP servers, agent skills, and machine-readable catalogs

AgenticAccess

WellKnown

MCPServer

LLMsTxt

LLMsTxt

Design & Contract 14

Pagination, idempotency, versioning, errors, and events

Conformance

ErrorCatalog

Lifecycle

Conventions

DataModel

Arazzo

Arazzo

Arazzo

Arazzo

Arazzo

Arazzo

Arazzo

Arazzo

Arazzo

Scroll for all 14

Build 16

SDKs, sample code, and the tooling you integrate with

Packages

Packages

CLI

CLI

PostmanWorkspace

PostmanWorkspace

GitHubOrganization

SDKs

SDKs

SDKs

SDKs

SDKs

SDKs

Tools

Tools

Tools

CodeExamples

CodeExamples

CodeExamples

Scroll for all 16

Access & Security 4

Authentication, authorization, and security posture

DomainSecurity

Authentication

TrustCenter

TrustCenter

Operate 5

Status, limits, changes, and where to get help

ChangeLog

StatusPage

Support

Forums

RateLimits

Commercial 5

Pricing, plans, and the legal terms of use

Pricing

TermsOfService

PrivacyPolicy

Plans

FinOps

Company 3

The organization behind the API

Blog

Blog

LinkedIn

Twitter

Twitter

Source (apis.yml)

aid: fal-ai
url: https://raw.githubusercontent.com/api-evangelist/fal-ai/refs/heads/main/apis.yml
apis:
- aid: fal-ai:fal-realtime-api
  name: fal Realtime API
  tags:
  - AI
  - Generative AI
  - Realtime
  - WebSocket
  - Streaming
  - Inference
  humanURL: https://fal.ai/docs/model-apis/real-time
  baseURL: wss://realtime.fal.run
  properties:
  - url: https://fal.ai/docs/model-apis/real-time
    type: Documentation
  - url: https://github.com/fal-ai/real-time-demo-app
    type: CodeExamples
  - url: asyncapi/fal-ai-asyncapi.yml
    type: AsyncAPI
  description: WebSocket-based realtime inference for ultra-low latency interactive generative experiences such as LCM/SDXL
    sketch-to-image, live-portrait, and realtime upscaling. Bi-directional binary/JSON messaging keeps a persistent connection
    open so each frame, prompt, or pose adjustment is processed in milliseconds. Powers fal.realtime client utilities used
    in canvas apps, drawing tools, AR experiences, and live video pipelines.
- aid: fal-ai:fal-streaming-api
  name: fal Streaming API
  tags:
  - AI
  - Generative AI
  - Streaming
  - Server-Sent Events
  - Inference
  humanURL: https://fal.ai/docs/model-apis/streaming
  baseURL: https://queue.fal.run
  properties:
  - url: https://fal.ai/docs/model-apis/streaming
    type: Documentation
  - url: asyncapi/fal-ai-asyncapi.yml
    type: AsyncAPI
  description: HTTP streaming endpoint (`/{model-id}/stream`) that emits progressive partial outputs as a model runs — used
    for LLM/VLM token streams, incremental video frames, and step-by-step image diffusion previews. Compatible with Server-Sent
    Events parsers in the official fal-client SDKs.
- aid: fal-ai:fal-models-catalog-api
  name: fal Models Catalog API
  tags:
  - AI
  - Generative AI
  - Catalog
  - Discovery
  humanURL: https://fal.ai/models
  baseURL: https://fal.ai
  properties:
  - url: https://fal.ai/models
    type: Documentation
  description: Read-only discovery endpoints for browsing fal's 1,000+ production model catalog, including model metadata,
    capability tags, pricing per output, supported parameters, example inputs, and OpenAPI schemas per model. Backs the model
    gallery, search, and SDK tooling.
- aid: fal-ai:fal-compute-api
  name: fal Compute API
  tags:
  - AI
  - GPU
  - Compute
  - Infrastructure
  - Dedicated
  humanURL: https://fal.ai/compute
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/compute
    type: Documentation
  description: Provision and manage dedicated GPU instances (H100, H200, A100, B200) with full SSH access for training, fine-tuning,
    and persistent workloads. Hourly or per-second billing with no lock-in.
- aid: fal-ai:fal-keys-api
  name: fal API Keys API
  tags:
  - AI
  - Administration
  - Authentication
  - API Keys
  humanURL: https://fal.ai/dashboard/keys
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/dashboard/keys
    type: Documentation
  description: 'Manage fal API keys — create, list, scope, and revoke keys used to authenticate against the Model, Storage,
    Serverless, and Compute APIs via the Authorization: Key $FAL_KEY header.'
- aid: fal-ai:fal-usage-billing-api
  name: fal Usage and Billing API
  tags:
  - AI
  - Administration
  - Usage
  - Billing
  - FinOps
  humanURL: https://fal.ai/dashboard/usage
  baseURL: https://rest.alpha.fal.ai
  properties:
  - url: https://fal.ai/dashboard/usage
    type: Documentation
  description: Programmatic access to usage metrics, per-model spend, GPU-second consumption, and invoicing history. Surfaces
    the same data shown on the fal dashboard so platform teams can pipe inference cost into internal FinOps tooling.
- aid: fal-ai:fal-ai-apps-api
  name: fal Apps API
  description: List and inspect deployed Serverless apps.
  humanURL: https://fal.ai/docs/model-apis/quickstart
  baseURL: https://queue.fal.run
  tags:
  - Apps
  properties:
  - type: OpenAPI
    url: openapi/fal-ai-apps-api-openapi.yml
  - type: Documentation
    url: https://fal.ai/docs/model-apis/quickstart
  - type: Documentation
    url: https://fal.ai/models
  - type: JSONSchema
    url: json-schema/fal-queue-request-schema.json
  - type: JSONSchema
    url: json-schema/fal-queue-status-schema.json
  - type: JSONLD
    url: json-ld/fal-ai-context.jsonld
  - type: GraphQL
    url: graphql/fal-ai-graphql.md
  - type: Documentation
    url: https://fal.ai/docs/model-apis/file-uploads
  - type: Documentation
    url: https://fal.ai/docs/private-serverless-models
  - type: SDKs
    url: https://github.com/fal-ai/fal
- aid: fal-ai:fal-ai-files-api
  name: fal Files API
  description: Manage files on persistent Serverless `/data` volumes.
  humanURL: https://fal.ai/docs/model-apis/quickstart
  baseURL: https://queue.fal.run
  tags:
  - Files
  properties:
  - type: OpenAPI
    url: openapi/fal-ai-files-api-openapi.yml
  - type: Documentation
    url: https://fal.ai/docs/model-apis/quickstart
  - type: Documentation
    url: https://fal.ai/models
  - type: JSONSchema
    url: json-schema/fal-queue-request-schema.json
  - type: JSONSchema
    url: json-schema/fal-queue-status-schema.json
  - type: JSONLD
    url: json-ld/fal-ai-context.jsonld
  - type: GraphQL
    url: graphql/fal-ai-graphql.md
  - type: Documentation
    url: https://fal.ai/docs/model-apis/file-uploads
  - type: Documentation
    url: https://fal.ai/docs/private-serverless-models
  - type: SDKs
    url: https://github.com/fal-ai/fal
- aid: fal-ai:fal-ai-queue-api
  name: fal Queue API
  description: Submit, inspect, and cancel model inference jobs.
  humanURL: https://fal.ai/docs/model-apis/quickstart
  baseURL: https://queue.fal.run
  tags:
  - Queue
  properties:
  - type: OpenAPI
    url: openapi/fal-ai-queue-api-openapi.yml
  - type: Documentation
    url: https://fal.ai/docs/model-apis/quickstart
  - type: Documentation
    url: https://fal.ai/models
  - type: JSONSchema
    url: json-schema/fal-queue-request-schema.json
  - type: JSONSchema
    url: json-schema/fal-queue-status-schema.json
  - type: JSONLD
    url: json-ld/fal-ai-context.jsonld
  - type: GraphQL
    url: graphql/fal-ai-graphql.md
  - type: Documentation
    url: https://fal.ai/docs/model-apis/file-uploads
  - type: Documentation
    url: https://fal.ai/docs/private-serverless-models
  - type: SDKs
    url: https://github.com/fal-ai/fal
- aid: fal-ai:fal-ai-secrets-api
  name: fal Secrets API
  description: Manage per-org secrets injected into Serverless runs.
  humanURL: https://fal.ai/docs/model-apis/quickstart
  baseURL: https://queue.fal.run
  tags:
  - Secrets
  properties:
  - type: OpenAPI
    url: openapi/fal-ai-secrets-api-openapi.yml
  - type: Documentation
    url: https://fal.ai/docs/model-apis/quickstart
  - type: Documentation
    url: https://fal.ai/models
  - type: JSONSchema
    url: json-schema/fal-queue-request-schema.json
  - type: JSONSchema
    url: json-schema/fal-queue-status-schema.json
  - type: JSONLD
    url: json-ld/fal-ai-context.jsonld
  - type: GraphQL
    url: graphql/fal-ai-graphql.md
  - type: Documentation
    url: https://fal.ai/docs/model-apis/file-uploads
  - type: Documentation
    url: https://fal.ai/docs/private-serverless-models
  - type: SDKs
    url: https://github.com/fal-ai/fal
- aid: fal-ai:fal-ai-storage-api
  name: fal Storage API
  description: Upload binary assets to the fal CDN.
  humanURL: https://fal.ai/docs/model-apis/quickstart
  baseURL: https://queue.fal.run
  tags:
  - Storage
  properties:
  - type: OpenAPI
    url: openapi/fal-ai-storage-api-openapi.yml
  - type: Documentation
    url: https://fal.ai/docs/model-apis/quickstart
  - type: Documentation
    url: https://fal.ai/models
  - type: JSONSchema
    url: json-schema/fal-queue-request-schema.json
  - type: JSONSchema
    url: json-schema/fal-queue-status-schema.json
  - type: JSONLD
    url: json-ld/fal-ai-context.jsonld
  - type: GraphQL
    url: graphql/fal-ai-graphql.md
  - type: Documentation
    url: https://fal.ai/docs/model-apis/file-uploads
  - type: Documentation
    url: https://fal.ai/docs/private-serverless-models
  - type: SDKs
    url: https://github.com/fal-ai/fal
- aid: fal-ai:fal-ai-streaming-api
  name: fal Streaming API
  description: Server-sent streaming of incremental model output.
  humanURL: https://fal.ai/docs/model-apis/quickstart
  baseURL: https://queue.fal.run
  tags:
  - Streaming
  properties:
  - type: OpenAPI
    url: openapi/fal-ai-streaming-api-openapi.yml
  - type: Documentation
    url: https://fal.ai/docs/model-apis/quickstart
  - type: Documentation
    url: https://fal.ai/models
  - type: JSONSchema
    url: json-schema/fal-queue-request-schema.json
  - type: JSONSchema
    url: json-schema/fal-queue-status-schema.json
  - type: JSONLD
    url: json-ld/fal-ai-context.jsonld
  - type: GraphQL
    url: graphql/fal-ai-graphql.md
  - type: Documentation
    url: https://fal.ai/docs/model-apis/file-uploads
  - type: Documentation
    url: https://fal.ai/docs/private-serverless-models
  - type: SDKs
    url: https://github.com/fal-ai/fal
name: fal
tags:
- AI
- Artificial Intelligence
- Generative AI
- Generative Media
- Image Generation
- Video Generation
- Audio Generation
- Inference
- Serverless
- GPU
- MCP
kind: contract
accessModel:
  pricing: paid
  onboarding: self-serve
  trial: false
  try_now: false
  public: false
  label: Paid · Self-serve signup
  confidence: high
  source:
  - plans
  - authentication
  generated: '2026-07-22'
  method: derived
image: https://kinlane-images.s3.amazonaws.com/shared/apis-json/icons/fal-ai.png
access: 3rd-Party
common:
- type: AgenticAccess
  url: agentic-access/fal-ai-agentic-access.yml
- type: DomainSecurity
  url: security/fal-ai-domain-security.yml
- type: Authentication
  url: authentication/fal-ai-authentication.yml
- type: Packages
  url: packages/fal-ai-packages.yml
- type: WellKnown
  url: well-known/fal-ai-well-known.yml
- type: MCPServer
  url: mcp/fal-ai-mcp.yml
- type: LLMsTxt
  url: llms/fal-ai-llms.txt
- type: LLMsTxt
  name: llms-full
  url: llms/fal-ai-llms-full.txt
- type: Conformance
  url: conformance/fal-ai-conformance.yml
- type: ErrorCatalog
  url: errors/fal-ai-problem-types.yml
- type: Lifecycle
  url: lifecycle/fal-ai-lifecycle.yml
- type: Conventions
  url: conventions/fal-ai-conventions.yml
- type: DataModel
  url: data-model/fal-ai-data-model.yml
- type: ChangeLog
  url: changelog/fal-ai-changelog.yml
- type: CLI
  url: cli/fal-ai-cli.yml
- type: PostmanWorkspace
  url: https://www.postman.com/kinlaneapi/fal/overview
- type: Arazzo
  url: arazzo/fal-ai-image-to-image-result-workflow.yml
  name: fal Upload, Run Image-To-Image With Webhook
- type: Arazzo
  url: arazzo/fal-ai-queue-inference-workflow.yml
  name: fal Queue Inference
- type: Arazzo
  url: arazzo/fal-ai-serverless-app-discovery-workflow.yml
  name: fal Serverless App Discovery
- type: Arazzo
  url: arazzo/fal-ai-serverless-app-files-workflow.yml
  name: fal Serverless App Files Inspection
- type: Arazzo
  url: arazzo/fal-ai-set-and-verify-secret-workflow.yml
  name: fal Set And Verify Serverless Secret
- type: Arazzo
  url: arazzo/fal-ai-stream-inference-workflow.yml
  name: fal Streaming Inference
- type: Arazzo
  url: arazzo/fal-ai-submit-and-cancel-workflow.yml
  name: fal Submit And Conditionally Cancel
- type: Arazzo
  url: arazzo/fal-ai-upload-then-inference-workflow.yml
  name: fal Upload Asset Then Run Inference
- type: Arazzo
  url: arazzo/fal-ai-webhook-submission-workflow.yml
  name: fal Webhook-Backed Submission
- type: Portal
  url: https://fal.ai
- type: Documentation
  url: https://fal.ai/docs
- type: Documentation
  name: Model APIs Quickstart
  url: https://fal.ai/docs/model-apis/quickstart
- type: Documentation
  name: Model Gallery
  url: https://fal.ai/models
- type: Documentation
  name: Authentication
  url: https://fal.ai/docs/authentication
- type: Documentation
  name: Webhooks
  url: https://fal.ai/docs/model-apis/webhooks
- type: Documentation
  name: Realtime
  url: https://fal.ai/docs/model-apis/real-time
- type: Documentation
  name: Streaming
  url: https://fal.ai/docs/model-apis/streaming
- type: Documentation
  name: File Uploads
  url: https://fal.ai/docs/model-apis/file-uploads
- type: Documentation
  name: Private Serverless Models
  url: https://fal.ai/docs/private-serverless-models
- type: GettingStarted
  url: https://fal.ai/docs/model-apis/quickstart
- type: StatusPage
  url: https://status.fal.ai
- type: Blog
  url: https://blog.fal.ai
- type: Signup
  url: https://fal.ai/login
- type: Pricing
  url: https://fal.ai/pricing
- type: Support
  name: Discord
  url: https://discord.gg/fal-ai
- type: Forums
  url: https://discord.gg/fal-ai
- type: TermsOfService
  url: https://fal.ai/legal/terms-of-service
- type: PrivacyPolicy
  url: https://fal.ai/legal/privacy-policy
- type: TrustCenter
  url: https://trust.fal.ai
- type: TrustCenter
  name: Trust Center Artifact
  url: security/fal-ai-trust-center.yml
- type: LinkedIn
  url: https://www.linkedin.com/company/featuresandlabels
- type: Twitter
  url: https://twitter.com/fal
- type: GitHubOrganization
  url: https://github.com/fal-ai
- type: SDKs
  name: fal Python Client
  url: https://github.com/fal-ai/fal-client-python
- type: SDKs
  name: fal JavaScript Client
  url: https://github.com/fal-ai/fal-js
- type: SDKs
  name: fal Swift Client
  url: https://github.com/fal-ai/fal-swift
- type: SDKs
  name: fal Java/Kotlin Client
  url: https://github.com/fal-ai/fal-java
- type: SDKs
  name: fal Dart/Flutter Client
  url: https://github.com/fal-ai/fal-dart
- type: SDKs
  name: fal Python SDK / Serverless
  url: https://github.com/fal-ai/fal
- type: Tools
  name: fal Terraform Provider
  url: https://github.com/fal-ai/terraform-provider-fal
- type: Tools
  name: fal Blender Extension
  url: https://github.com/fal-ai/fal-blender-extension
- type: Tools
  name: fal VS Code Extension (Serverless)
  url: https://github.com/fal-ai/serverless-vscode
- type: CodeExamples
  name: Awesome fal
  url: https://github.com/fal-ai/awesome
- type: CodeExamples
  name: Real-Time Demo App
  url: https://github.com/fal-ai/real-time-demo-app
- type: CodeExamples
  name: fal Next.js Template
  url: https://github.com/fal-ai/fal-nextjs-template
- type: Documentation
  name: MCP Server
  url: https://fal.ai/docs/mcp-server
- type: Documentation
  name: ComfyUI Integration
  url: https://fal.ai/docs/comfyui
- url: plans/fal-ai-plans-pricing.yml
  type: Plans
- url: rate-limits/fal-ai-rate-limits.yml
  type: RateLimits
- url: finops/fal-ai-finops.yml
  type: FinOps
- type: Features
  data:
  - Unified queue-based REST API at https://queue.fal.run/{model-id} for 1,000+ generative models
  - Image generation models — FLUX (Schnell, Dev, Pro, Kontext Pro), Seedream V4, Nano Banana, Qwen, SDXL, SD3, Ideogram,
    Recraft
  - Video generation models — Veo 3, Kling 2.5 Turbo Pro, Wan 2.5, Seedance 2.0, Ovi, Hunyuan, Sora-class
  - Audio and voice models — Inworld TTS-1.5, ElevenLabs, MMAudio, MusicGen, Stable Audio
  - 3D and multimodal models — TripoSR, Hunyuan3D, LivePortrait, FaceChain
  - Synchronous, asynchronous queue, server-sent streaming, and WebSocket realtime invocation modes
  - Webhook callbacks for queue completion with HMAC signature verification
  - File uploads / CDN storage at https://v3.fal.media with signed upload URLs
  - fal Serverless — `@fal.function`, `fal.App`, BYO container deployment with autoscaling from 0 to thousands of GPUs
  - fal Compute — dedicated H100/H200/A100/B200 instances with SSH and per-second billing
  - Per-output billing (image, video second, audio minute) plus per-second GPU billing for custom deployments
  - 99.99% uptime SLA, SOC 2 compliance, private endpoints, and enterprise support
  - Proprietary Inference Engine — up to 10x faster than reference implementations
  - Official SDKs for Python (fal-client), JavaScript/TypeScript (@fal-ai/client), Swift, Java/Kotlin, Dart
  - fal CLI for serverless deploy / run / apps / secrets / auth
  - fal MCP Server exposing all 1,000+ models to AI assistants via the Model Context Protocol
  - ComfyUI and Blender extensions, plus Terraform provider for infra-as-code
  - Day-zero launch partner for major model releases (FLUX, Veo, Kling, Seedance, Wan, etc.)
  sources:
  - https://fal.ai
  - https://fal.ai/docs
  - https://fal.ai/pricing
  - https://fal.ai/models
  - https://github.com/fal-ai
  - https://blog.fal.ai
  updated: '2026-05-25'
created: '2026-05-25'
modified: '2026-06-20'
position: Consuming
description: fal (Features and Labels, Inc.) is a generative media platform providing the world's fastest API for running
  image, video, audio, and multimodal generative AI models. Through a unified queue-based REST API at https://queue.fal.run,
  plus realtime WebSocket and SSE streaming surfaces, fal serves 1,000+ production models — including FLUX, Veo 3, Kling,
  Wan, Seedream, Nano Banana, and Stable Diffusion — on autoscaling GPU infrastructure. fal Serverless lets developers ship
  custom models with `@fal.function` / `fal.App` / BYO containers, while fal Compute provides dedicated H100/H200/A100/B200
  instances. Trusted by Canva, Perplexity, Poe, and 1.5M+ developers; Series D funded ($140M, Sequoia-led, December 2025);
  SOC 2 with 99.99% uptime.
maintainers:
- FN: Kin Lane
  email: info@apievangelist.com
  X: apievangelist
  url: https://apievangelist.com
specificationVersion: '0.16'

fal

APIs 12

Postman Collections 3

Open Collections 3

Arazzo Workflows 9

MCP Servers 1

GraphQL 1

Pricing Plans 1

Rate Limits 1

FinOps 1

Features 18

Event Specifications 1

Semantic Vocabularies 1

Spectral Rules 3

JSON Schema 3

JSON Structure 1

Examples 3

Security Posture 3

Agentic Access 1

Get Started 3

Documentation 11

Agent Surfaces 5

Design & Contract 14

Build 16

Access & Security 4

Operate 5

Commercial 5

Company 3

Source (apis.yml)