The production
AI platform built for developers

Fireworks partners with the world's leading generative AI researchers to serve the best models, at the fastest speeds.

Meta Llama 3

Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes.

Try It Now

Trusted for empowering AI-driven production workflows

Models curated and optimized by Fireworks

Chat LLM

FireLLaVA-13B

Vision-language model allowing both image and text as inputs (single image is recommended), trained on OSS model generated training data and open sourced on huggingface at fireworks-ai/FireLLaVA-13b

Try now

Image generation

Stable Diffusion 3

The most capable text-to-image model produced by stability.ai, with greatly improved performance in multi-subject prompts, image quality, and spelling abilities. The Stable Diffusion 3 API is provided by Stability and the model is powered by Fireworks. Unlike other models on the Fireworks playground, you'll need a Stability API key to use this model. To use the API directly, visit https://platform.stability.ai/docs/api-reference#tag/Generate/paths/~1v2beta~1stable-image~1generate~1sd3/post

Try now

Chat LLM

FireFunction V1

Fireworks' open-source function calling model.

Try now

Image generation

Stable Diffusion XL

Image generation model, produced by stability.ai.

Try now

See all 105 models

The fastest and most uncompromising AI platform!

Fireworks AI

tokens / second

Next provider

tokens / second

average provider

tokens / second

Industry Leading Performance

Independently benchmarked to have the top speed of all inference providers

Enterprise Scale Throughput

Our proprietary stack blows open source options out of the water (see blog)

FireLLaVA: the first commercially permissive OSS LLaVA model

State-of-the-art Models

Use powerful models curated by Fireworks or our in-house trained multi-modal and function-calling models

0 Billion+

tokens served in a day

Battle Tested for Reliability

Fireworks is the 2nd most used open-source model provider and also generates over 1M images/day

fetch("https://api.fireworks.ai/inference/v1/chat/completions", {
  method: "POST",
  headers: {
    "Content-Type": "application
    "Authorization: "Bearer <API KEY>",
  },
  body: JSON.stringify({
    model: "accounts/fireworks/mixtral-8x7b",
    prompt: "Say this is a test",
    max_tokens: 700,
  }),
})
  

Built for Developers

Our OpenAI-compatible API makes it easy to start building with Fireworks!

Level up with Fireworks AI Enterprise

Get dedicated deployments for your models to ensure uptime and speed

Fireworks is proudly compliant with HIPAA and SOC2 and offers secure VPC and VPN connectivity

Meet your needs with data privacy - own your data and your models

The production AI platform built for developers

Meta Llama 3

Trusted for empowering AI-driven production workflows

Models curated and optimized by Fireworks

The fastest and most uncompromising AI platform!

Level up with Fireworks AI Enterprise

The production
AI platform built for developers