Groq logo

Groq

Free tier

Fast, low-cost AI inference powered by custom LPU silicon

Free tier available·All audiences·Powered by Groq (third-party models hosted on LPU infrastructure)·API available

Key strengths

Industry-leading inference speed via proprietary LPU chipOpenAI-compatible API for drop-in migrationSignificantly lower inference cost vs. GPU-based providersGlobal data center deployment for low-latency responsesFree API tier with access to top open models
Free tier + paid plans
San Jose, USA
Founded 2016
No ratings yet

Developer Setup

Authentication & Endpoint

Groq is OpenAI API-compatible. Point your existing OpenAI SDK to Groq's endpoint:

import os
import openai

client = openai.OpenAI(
    base_url="https://api.groq.com/openai/v1",
    api_key=os.environ.get("GROQ_API_KEY")
)
import OpenAI from "openai";

const client = new OpenAI({
  baseURL: "https://api.groq.com/openai/v1",
  apiKey: process.env.GROQ_API_KEY,
});

Key Details

  • API Base URL: https://api.groq.com/openai/v1
  • Auth: Bearer token via GROQ_API_KEY environment variable
  • SDKs: Works with the official OpenAI Python and JavaScript/TypeScript SDKs, or any OpenAI-compatible HTTP client
  • Free API Key: Available at no cost directly from groq.com/developers
  • Model Selection: Pass the model name in the model parameter (e.g., llama3-8b-8192, mixtral-8x7b-32768)

Supported Capabilities

  • Chat completions (streaming and non-streaming)
  • Multiple open & third-party model options
  • MoE (Mixture of Experts) and large model support
  • Global data center routing for low-latency inference