Qwen2.5 72B Instruct - Fireworks AI

Qwen2.5 72B Instruct

accounts/fireworks/models/qwen2p5-72b-instruct

Qwen

LLM

Serverless

Tunable

Qwen2.5 are a series of decoder-only language models developed by Qwen team, Alibaba Cloud, available in 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B sizes, and base and instruct variants.

Features

Serverless API
Docs
Qwen2.5 72B Instruct is available via Fireworks' serverless API, where you pay per token. There are several ways to call the Fireworks API, including Fireworks' Python client, the REST API, or OpenAI's Python client.
Fine-tuning
Docs
Qwen2.5 72B Instruct can be fine-tuned on your data to create a model with better response quality. Fireworks uses low-rank adaptation (LoRA) to train a model that can be served efficiently at inference time.
On-demand Deployments
Docs
On-demand deployments allow you to use Qwen2.5 72B Instruct on dedicated GPUs with Fireworks' high-performance serving stack with high reliability and no rate limits.

Available Serverless

Run queries immediately, pay only for usage

$0.90

Per 1M Tokens

import requests
import json

url = "https://api.fireworks.ai/inference/v1/chat/completions"
payload = {
  "model": "accounts/fireworks/models/qwen2p5-72b-instruct",
  "max_tokens": 4096,
  "top_p": 1,
  "top_k": 40,
  "presence_penalty": 0,
  "frequency_penalty": 0,
  "temperature": 0.6,
  "messages": [
    {
      "role": "user",
      "content": "Hello, how are you?"
    }
  ]
}
headers = {
  "Accept": "application/json",
  "Content-Type": "application/json",
  "Authorization": "Bearer <API_KEY>"
}
requests.request("POST", url, headers=headers, data=json.dumps(payload))

Metadata

State

Ready

Created on

10/2/2024

Kind

Base model

Provider

Qwen

Hugging Face

Visit link

Specification

Calibrated

Mixture-of-Experts

Parameters

72B

Supported Functionality

Fine-tuning

Supported

Serverless

Supported

Serverless LoRA

Supported

Context Length

32k tokens

Function Calling

Supported

Features

Serverless API

Fine-tuning

On-demand Deployments

Available Serverless

$0.90

Metadata

Specification

Supported Functionality

Features

Serverless API

Fine-tuning

On-demand Deployments

Available Serverless

$0.90

Metadata

Specification

Supported Functionality