Cloudflare Workers AI
cloudflareworkersai
Updated 32 minutes ago
Cloudflare Workers AI allows you to run AI models in a serverless way on Cloudflare's global edge network, eliminating the need to worry about scaling, maintaining, or paying for unused infrastructure. The service supports various AI models for text generation, embeddings, and other tasks through a simple API or environment binding. It integrates seamlessly with Cloudflare's AI Gateway for analytics and optimization. Key benefits include pay-per-use pricing, easy API and SDK integration through their community provider for Vercel AI SDK, and the ability to execute AI inference at the edge for ultra-low latency applications worldwide.
Browse 7 LLM models available from Cloudflare Workers AI. Compare prices and features.
Models (7)
| Organization | Model Name | Original Model | Input | Output | Free | |||
|---|---|---|---|---|---|---|---|---|
|
|
Moonshot AI | Kimi K2.6 |
@cf/moonshotai/kimi-k2.6
|
$0.95 | $4.00 |
|
||
|
|
Gemma 4 26B-A4B |
@cf/google/gemma-4-26b-a4b-it
|
$0.10 | $0.30 | ||||
|
|
Moonshot AI | Kimi K2.5 |
@cf/moonshotai/kimi-k2.5
|
$0.60 | $3.00 | |||
|
|
Z.ai | GLM-4.7-Flash |
@cf/zai-org/glm-4.7-flash
|
$0.06 | $0.40 | |||
|
|
OpenAI | GPT OSS 120B |
@cf/openai/gpt-oss-120b
|
$0.35 | $0.75 |
|
||
|
|
OpenAI | GPT OSS 20B |
@cf/openai/gpt-oss-20b
|
$0.20 | $0.30 | |||
|
|
Meta | llama-4-scout-17b-16e-instruct |
@cf/meta/llama-4-scout-17b-16e-instruct
|
$0.27 | $0.85 |