# Azure OpenAI Azure OpenAI Service is Microsoft's cloud-based artificial intelligence service that combines OpenAI's advanced generative AI models including GPT-3, GPT-4, Codex, and Embeddings model series with the enterprise-grade security, privacy, and compliance capabilities of Microsoft Azure. The service provides REST API access to these powerful models, enabling use cases like creating chatbots, conversational AI, text generation, language translation, creative content writing, and natural language processing. Azure OpenAI acts as a managed service provider that hosts OpenAI models on Azure infrastructure while offering fine-tuning capabilities, eliminating infrastructure management for organizations. ## Provider Information - **Website**: - **Available Models**: 93 ## Models | Name | Original Name | $ Input Price (per 1M) | $ Output Price (per 1M) | Free | Link | |------|---------------|---------------------|----------------------|------|------| | GPT-4.1 nano | gpt-4.1-nano | 0.10 | 0.40 | | | | text-embedding-3-small | text-embedding-3-small | 0.02 | 0.00 | | | | Grok 4 Fast (Non-Reasoning) | grok-4-fast-non-reasoning | 0.20 | 0.50 | | | | DeepSeek-R1-0528 | deepseek-r1-0528 | 1.35 | 5.40 | | | | Grok 4 Fast (Reasoning) | grok-4-fast-reasoning | 0.20 | 0.50 | | | | Phi-3-medium-instruct (128k) | phi-3-medium-128k-instruct | 0.17 | 0.68 | | | | GPT-4 | gpt-4 | 60.00 | 120.00 | | | | Claude Opus 4.1 | claude-opus-4-1 | 15.00 | 75.00 | | | | GPT-5.2 Chat | gpt-5.2-chat | 1.75 | 14.00 | | | | Llama-3.2-11B-Vision-Instruct | llama-3.2-11b-vision-instruct | 0.37 | 0.37 | | | | Embed v4 | cohere-embed-v-4-0 | 0.12 | 0.00 | | | | Command R | cohere-command-r-08-2024 | 0.15 | 0.60 | | | | Grok 4 | grok-4 | 3.00 | 15.00 | | | | Embed v3 Multilingual | cohere-embed-v3-multilingual | 0.10 | 0.00 | | | | Phi-4-mini | phi-4-mini | 0.08 | 0.30 | | | | GPT-4 32K | gpt-4-32k | 60.00 | 120.00 | | | | GPT-5.2 Codex | gpt-5.2-codex | 1.75 | 14.00 | | | | Meta-Llama-3.1-405B-Instruct | meta-llama-3.1-405b-instruct | 5.33 | 16.00 | | | | DeepSeek-R1 | deepseek-r1 | 1.35 | 5.40 | | | | Grok Code Fast 1 | grok-code-fast-1 | 0.20 | 1.50 | | | | GPT-5.1 Codex | gpt-5.1-codex | 1.25 | 10.00 | | | | Phi-3-mini-instruct (4k) | phi-3-mini-4k-instruct | 0.13 | 0.52 | | | | Claude Haiku 4.5 | claude-haiku-4-5 | 1.00 | 5.00 | | | | DeepSeek-V3.2-Speciale | deepseek-v3.2-speciale | 0.58 | 1.68 | | | | Mistral Medium 3 | mistral-medium-2505 | 0.40 | 2.00 | | | | Claude Opus 4.5 | claude-opus-4-5 | 5.00 | 25.00 | | | | Phi-3-small-instruct (128k) | phi-3-small-128k-instruct | 0.15 | 0.60 | | | | Command A | cohere-command-a | 2.50 | 10.00 | | | | Command R+ | cohere-command-r-plus-08-2024 | 2.50 | 10.00 | | | | Llama 4 Maverick 17B 128E Instruct FP8 | llama-4-maverick-17b-128e-instruct-fp8 | 0.25 | 1.00 | | | | GPT-4.1 mini | gpt-4.1-mini | 0.40 | 1.60 | | | | GPT-5 Chat | gpt-5-chat | 1.25 | 10.00 | | | | DeepSeek-V3.1 | deepseek-v3.1 | 0.56 | 1.68 | | | | Phi-4 | phi-4 | 0.13 | 0.50 | | | | Phi-4-mini-reasoning | phi-4-mini-reasoning | 0.08 | 0.30 | | | | Claude Sonnet 4.5 | claude-sonnet-4-5 | 3.00 | 15.00 | | | | GPT-3.5 Turbo 0125 | gpt-3.5-turbo-0125 | 0.50 | 1.50 | | | | Grok 3 | grok-3 | 3.00 | 15.00 | | | | text-embedding-3-large | text-embedding-3-large | 0.13 | 0.00 | | | | Meta-Llama-3-70B-Instruct | meta-llama-3-70b-instruct | 2.68 | 3.54 | | | | DeepSeek-V3-0324 | deepseek-v3-0324 | 1.14 | 4.56 | | | | Phi-3-small-instruct (8k) | phi-3-small-8k-instruct | 0.15 | 0.60 | | | | Meta-Llama-3.1-70B-Instruct | meta-llama-3.1-70b-instruct | 2.68 | 3.54 | | | | GPT-4 Turbo | gpt-4-turbo | 10.00 | 30.00 | | | | GPT-3.5 Turbo 0613 | gpt-3.5-turbo-0613 | 3.00 | 4.00 | | | | Phi-3.5-mini-instruct | phi-3.5-mini-instruct | 0.13 | 0.52 | | | | o1-preview | o1-preview | 16.50 | 66.00 | | | | Llama-3.3-70B-Instruct | llama-3.3-70b-instruct | 0.71 | 0.71 | | | | GPT-5.1 Codex Mini | gpt-5.1-codex-mini | 0.25 | 2.00 | | | | Kimi K2 Thinking | kimi-k2-thinking | 0.60 | 2.50 | | | | Model Router | model-router | 0.14 | 0.00 | | | | o3-mini | o3-mini | 1.10 | 4.40 | | | | GPT-5.1 | gpt-5.1 | 1.25 | 10.00 | | | | GPT-5 Nano | gpt-5-nano | 0.05 | 0.40 | | | | GPT-5-Codex | gpt-5-codex | 1.25 | 10.00 | | | | Llama-3.2-90B-Vision-Instruct | llama-3.2-90b-vision-instruct | 2.04 | 2.04 | | | | Phi-3-mini-instruct (128k) | phi-3-mini-128k-instruct | 0.13 | 0.52 | | | | GPT-4o | gpt-4o | 2.50 | 10.00 | | | | GPT-3.5 Turbo 0301 | gpt-3.5-turbo-0301 | 1.50 | 2.00 | | | | Ministral 3B | ministral-3b | 0.04 | 0.04 | | | | GPT-4.1 | gpt-4.1 | 2.00 | 8.00 | | | | o4-mini | o4-mini | 1.10 | 4.40 | | | | Phi-4-multimodal | phi-4-multimodal | 0.08 | 0.32 | | | | Meta-Llama-3-8B-Instruct | meta-llama-3-8b-instruct | 0.30 | 0.61 | | | | o1 | o1 | 15.00 | 60.00 | | | | Grok 3 Mini | grok-3-mini | 0.30 | 0.50 | | | | GPT-5.1 Chat | gpt-5.1-chat | 1.25 | 10.00 | | | | Phi-3.5-MoE-instruct | phi-3.5-moe-instruct | 0.16 | 0.64 | | | | GPT-5 Mini | gpt-5-mini | 0.25 | 2.00 | | | | o1-mini | o1-mini | 1.10 | 4.40 | | | | Llama 4 Scout 17B 16E Instruct | llama-4-scout-17b-16e-instruct | 0.20 | 0.78 | | | | Embed v3 English | cohere-embed-v3-english | 0.10 | 0.00 | | | | text-embedding-ada-002 | text-embedding-ada-002 | 0.10 | 0.00 | | | | Meta-Llama-3.1-8B-Instruct | meta-llama-3.1-8b-instruct | 0.30 | 0.61 | | | | GPT-5.1 Codex Max | gpt-5.1-codex-max | 1.25 | 10.00 | | | | GPT-3.5 Turbo Instruct | gpt-3.5-turbo-instruct | 1.50 | 2.00 | | | | Mistral Nemo | mistral-nemo | 0.15 | 0.15 | | | | o3 | o3 | 2.00 | 8.00 | | | | Codex Mini | codex-mini | 1.50 | 6.00 | | | | Phi-3-medium-instruct (4k) | phi-3-medium-4k-instruct | 0.17 | 0.68 | | | | Phi-4-reasoning | phi-4-reasoning | 0.13 | 0.50 | | | | GPT-4 Turbo Vision | gpt-4-turbo-vision | 10.00 | 30.00 | | | | Phi-4-reasoning-plus | phi-4-reasoning-plus | 0.13 | 0.50 | | | | GPT-4o mini | gpt-4o-mini | 0.15 | 0.60 | | | | GPT-5 | gpt-5 | 1.25 | 10.00 | | | | MAI-DS-R1 | mai-ds-r1 | 1.35 | 5.40 | | | | DeepSeek-V3.2 | deepseek-v3.2 | 0.58 | 1.68 | | | | GPT-5 Pro | gpt-5-pro | 15.00 | 120.00 | | | | Mistral Large 24.11 | mistral-large-2411 | 2.00 | 6.00 | | | | GPT-5.2 | gpt-5.2 | 1.75 | 14.00 | | | | Codestral 25.01 | codestral-2501 | 0.30 | 0.90 | | | | Mistral Small 3.1 | mistral-small-2503 | 0.10 | 0.30 | | | | GPT-3.5 Turbo 1106 | gpt-3.5-turbo-1106 | 1.00 | 2.00 | | | --- [← Back to all providers](/llm.txt)