Models and Pricing

Avian API Models

Models

Model
Description
Context Length
Speed

Meta-Llama-3.1-405B-Instruct

High-intelligence model for complex, multi-step tasks

131,072 tokens

~ 130 t/s

Meta-Llama-3.3-70B-Instruct

High intelligence at a cheaper price than Llama 405B

131,072 tokens

~ 250 t/s

Meta-Llama-3.1-70B-Instruct

Fast and versatile model for general usage

131,072 tokens

~ 250 t/s

Meta-Llama-3.1-8B-Instruct

Extremely quick model for high throughput tasks

131,072 tokens

~ 500 t/s

Avian API Pricing

Model
Input Price (per 1M tokens)
Output Price (per 1M tokens)

Meta-Llama-3.1-405B-Instruct

$1.50 USD

$1.50 USD

Meta-Llama-3.3-70B-Instruct

$0.45 USD

$0.45 USD

Meta-Llama-3.1-70B-Instruct

$0.45 USD

$0.45 USD

Meta-Llama-3.1-8B-Instruct

$0.10 USD

$0.10 USD

Avian Dedicated Deployments Pricing

GPU
Price (billed per second)
Memory

H200 SXM

From $0.00208

141GB HBM3

H100 SXM

From $0.00139

80GB HBM3

Last updated