Voyage AI Multimodal Embeddings API

Multimodal embeddings endpoint backed by voyage-multimodal-3 that accepts interleaved text and images in a single request and returns embeddings in a shared vector space, enabling cross-modal retrieval for documents that mix text, screenshots, charts, and figures.

API entry from apis.yml

apis.yml Raw ↑
aid: voyage-ai:multimodal-embeddings
name: Voyage AI Multimodal Embeddings API
description: Multimodal embeddings endpoint backed by voyage-multimodal-3 that accepts interleaved text
  and images in a single request and returns embeddings in a shared vector space, enabling cross-modal
  retrieval for documents that mix text, screenshots, charts, and figures.
humanURL: https://docs.voyageai.com/reference/multimodal-embeddings-api
baseURL: https://api.voyageai.com/v1
tags:
- Embeddings
- Multimodal
- Vision
properties:
- type: Documentation
  url: https://docs.voyageai.com/reference/multimodal-embeddings-api