Groq GraphQL API
Groq provides ultra-fast LLM inference via their Language Processing Unit (LPU) hardware. The API is OpenAI-compatible and covers chat completions, audio transcription, and batch processing with models including Llama, Mixtral, and Gemma.
Overview
Groq GraphQL API is a GraphQL API specification published by Groq on the APIs.io network.
Groq provides ultra-fast LLM inference via their Language Processing Unit (LPU) hardware. The API is OpenAI-compatible and covers chat completions, audio transcription, and batch processing with models including Llama, Mixtral, and Gemma.
The GraphQL endpoint is available at No. documentation is published at https://console.groq.com/docs.
The specification includes 2 reference links.
Tagged areas include AI, LLM, Inference, LPU, and Low Latency.