Cloudflare Workers AI icon

Cloudflare Workers AI

cloudflareworkersai

Updated 32 minutes ago

Cloudflare Workers AI allows you to run AI models in a serverless way on Cloudflare's global edge network, eliminating the need to worry about scaling, maintaining, or paying for unused infrastructure. The service supports various AI models for text generation, embeddings, and other tasks through a simple API or environment binding. It integrates seamlessly with Cloudflare's AI Gateway for analytics and optimization. Key benefits include pay-per-use pricing, easy API and SDK integration through their community provider for Vercel AI SDK, and the ability to execute AI inference at the edge for ultra-low latency applications worldwide.

Browse 7 LLM models available from Cloudflare Workers AI. Compare prices and features.

Models (7)

Organization Model Name Original Model Input Output Free
Moonshot AI
Moonshot AI Kimi K2.6 @cf/moonshotai/kimi-k2.6 $0.95 $4.00
google
google Gemma 4 26B-A4B @cf/google/gemma-4-26b-a4b-it $0.10 $0.30
Moonshot AI
Moonshot AI Kimi K2.5 @cf/moonshotai/kimi-k2.5 $0.60 $3.00
Z.ai
Z.ai GLM-4.7-Flash @cf/zai-org/glm-4.7-flash $0.06 $0.40
OpenAI
OpenAI GPT OSS 120B @cf/openai/gpt-oss-120b $0.35 $0.75
OpenAI
OpenAI GPT OSS 20B @cf/openai/gpt-oss-20b $0.20 $0.30
Meta
Meta llama-4-scout-17b-16e-instruct @cf/meta/llama-4-scout-17b-16e-instruct $0.27 $0.85