Cloudflare Workers AI icon

Cloudflare Workers AI

cloudflareworkersai

Updated 1 hour ago

Cloudflare Workers AI allows you to run AI models in a serverless way on Cloudflare's global edge network, eliminating the need to worry about scaling, maintaining, or paying for unused infrastructure. The service supports various AI models for text generation, embeddings, and other tasks through a simple API or environment binding. It integrates seamlessly with Cloudflare's AI Gateway for analytics and optimization. Key benefits include pay-per-use pricing, easy API and SDK integration through their community provider for Vercel AI SDK, and the ability to execute AI inference at the edge for ultra-low latency applications worldwide.

Browse 15 LLM models available from Cloudflare Workers AI. Compare prices and features.

Models (15)

Organization Model Name Original Model Input Output Free
Moonshot AI
Moonshot AI Kimi K2.6 @cf/moonshotai/kimi-k2.6 $0.95 $4.00
Moonshot AI
Moonshot AI Kimi K2.7 Code @cf/moonshotai/kimi-k2.7-code $0.95 $4.00
google
google Gemma 4 26B-A4B @cf/google/gemma-4-26b-a4b-it $0.10 $0.30
Z.ai
Z.ai GLM-4.7-Flash @cf/zai-org/glm-4.7-flash $0.06 $0.40
OpenAI
OpenAI GPT OSS 120B @cf/openai/gpt-oss-120b $0.35 $0.75
OpenAI
OpenAI GPT OSS 20B @cf/openai/gpt-oss-20b $0.20 $0.30
qwen
qwen QwQ-32B @cf/qwen/qwq-32b $0.66 $1.00
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 32B @cf/deepseek-ai/deepseek-r1-distill-qwen-32b $0.50 $4.88
Meta
Meta Llama 3.2 3B Instruct @cf/meta/llama-3.2-3b-instruct $0.05 $0.34
Mistral
Mistral Mistral Small 3.1 24B (free) @cf/mistralai/mistral-small-3.1-24b-instruct $0.35 $0.56
Groq
Groq Llama Guard 3 8B @cf/meta/llama-guard-3-8b $0.48 $0.03
Nvidia
Nvidia Llama 3.2 11b Vision Instruct @cf/meta/llama-3.2-11b-vision-instruct $0.05 $0.68
Meta
Meta llama-3.2-1b-instruct @cf/meta/llama-3.2-1b-instruct $0.03 $0.20
qwen
qwen Qwen2.5-Coder 32B Instruct @cf/qwen/qwen2.5-coder-32b-instruct $0.66 $1.00
Meta
Meta llama-4-scout-17b-16e-instruct @cf/meta/llama-4-scout-17b-16e-instruct $0.27 $0.85