# Azure AI Services Azure AI Services (formerly Cognitive Services) is Microsoft's comprehensive suite of cloud-based APIs and services that enable developers to integrate artificial intelligence capabilities into applications without requiring extensive AI expertise. The services provide a wide range of AI features including Computer Vision for image analysis and processing, Natural Language Processing for text understanding and analysis, Speech Recognition for voice-to-text conversion, Machine Translation for language translation, and Decision services. A single Azure AI services resource allows access to multiple services with one set of credentials, supporting both prebuilt and customizable models with a layered security model including virtual network configuration. ## Provider Information - **Website**: - **Available Models**: 95 ## Models | Name | Original Name | $ Input Price (per 1M) | $ Output Price (per 1M) | Free | Link | |------|---------------|---------------------|----------------------|------|------| | GPT-3.5 Turbo 1106 | gpt-3.5-turbo-1106 | 1.00 | 2.00 | | | | Mistral Small 3.1 | mistral-small-2503 | 0.10 | 0.30 | | | | Codestral 25.01 | codestral-2501 | 0.30 | 0.90 | | | | Mistral Large 24.11 | mistral-large-2411 | 2.00 | 6.00 | | | | GPT-5 Pro | gpt-5-pro | 15.00 | 120.00 | | | | DeepSeek-V3.2 | deepseek-v3.2 | 0.58 | 1.68 | | | | MAI-DS-R1 | mai-ds-r1 | 1.35 | 5.40 | | | | GPT-5 | gpt-5 | 1.25 | 10.00 | | | | GPT-4o mini | gpt-4o-mini | 0.15 | 0.60 | | | | Phi-4-reasoning-plus | phi-4-reasoning-plus | 0.13 | 0.50 | | | | GPT-4 Turbo Vision | gpt-4-turbo-vision | 10.00 | 30.00 | | | | Phi-4-reasoning | phi-4-reasoning | 0.13 | 0.50 | | | | Phi-3-medium-instruct (4k) | phi-3-medium-4k-instruct | 0.17 | 0.68 | | | | Codex Mini | codex-mini | 1.50 | 6.00 | | | | o3 | o3 | 2.00 | 8.00 | | | | Mistral Nemo | mistral-nemo | 0.15 | 0.15 | | | | GPT-3.5 Turbo Instruct | gpt-3.5-turbo-instruct | 1.50 | 2.00 | | | | Meta-Llama-3.1-8B-Instruct | meta-llama-3.1-8b-instruct | 0.30 | 0.61 | | | | text-embedding-ada-002 | text-embedding-ada-002 | 0.10 | 0.00 | | | | Embed v3 English | cohere-embed-v3-english | 0.10 | 0.00 | | | | Llama 4 Scout 17B 16E Instruct | llama-4-scout-17b-16e-instruct | 0.20 | 0.78 | | | | o1-mini | o1-mini | 1.10 | 4.40 | | | | GPT-5 Mini | gpt-5-mini | 0.25 | 2.00 | | | | Phi-3.5-MoE-instruct | phi-3.5-moe-instruct | 0.16 | 0.64 | | | | GPT-5.1 Chat | gpt-5.1-chat | 1.25 | 10.00 | | | | Grok 3 Mini | grok-3-mini | 0.30 | 0.50 | | | | o1 | o1 | 15.00 | 60.00 | | | | Meta-Llama-3-8B-Instruct | meta-llama-3-8b-instruct | 0.30 | 0.61 | | | | Phi-4-multimodal | phi-4-multimodal | 0.08 | 0.32 | | | | o4-mini | o4-mini | 1.10 | 4.40 | | | | GPT-4.1 | gpt-4.1 | 2.00 | 8.00 | | | | Ministral 3B | ministral-3b | 0.04 | 0.04 | | | | GPT-3.5 Turbo 0301 | gpt-3.5-turbo-0301 | 1.50 | 2.00 | | | | GPT-4o | gpt-4o | 2.50 | 10.00 | | | | Phi-3-mini-instruct (128k) | phi-3-mini-128k-instruct | 0.13 | 0.52 | | | | Llama-3.2-90B-Vision-Instruct | llama-3.2-90b-vision-instruct | 2.04 | 2.04 | | | | GPT-5-Codex | gpt-5-codex | 1.25 | 10.00 | | | | GPT-5 Nano | gpt-5-nano | 0.05 | 0.40 | | | | GPT-5.1 | gpt-5.1 | 1.25 | 10.00 | | | | o3-mini | o3-mini | 1.10 | 4.40 | | | | Model Router | model-router | 0.14 | 0.00 | | | | Kimi K2 Thinking | kimi-k2-thinking | 0.60 | 2.50 | | | | GPT-5.1 Codex Mini | gpt-5.1-codex-mini | 0.25 | 2.00 | | | | Llama-3.3-70B-Instruct | llama-3.3-70b-instruct | 0.71 | 0.71 | | | | o1-preview | o1-preview | 16.50 | 66.00 | | | | Phi-3.5-mini-instruct | phi-3.5-mini-instruct | 0.13 | 0.52 | | | | GPT-3.5 Turbo 0613 | gpt-3.5-turbo-0613 | 3.00 | 4.00 | | | | GPT-4 Turbo | gpt-4-turbo | 10.00 | 30.00 | | | | Meta-Llama-3.1-70B-Instruct | meta-llama-3.1-70b-instruct | 2.68 | 3.54 | | | | Phi-3-small-instruct (8k) | phi-3-small-8k-instruct | 0.15 | 0.60 | | | | DeepSeek-V3-0324 | deepseek-v3-0324 | 1.14 | 4.56 | | | | Meta-Llama-3-70B-Instruct | meta-llama-3-70b-instruct | 2.68 | 3.54 | | | | text-embedding-3-large | text-embedding-3-large | 0.13 | 0.00 | | | | Grok 3 | grok-3 | 3.00 | 15.00 | | | | GPT-3.5 Turbo 0125 | gpt-3.5-turbo-0125 | 0.50 | 1.50 | | | | Claude Sonnet 4.5 | claude-sonnet-4-5 | 3.00 | 15.00 | | | | Phi-4-mini-reasoning | phi-4-mini-reasoning | 0.08 | 0.30 | | | | Phi-4 | phi-4 | 0.13 | 0.50 | | | | DeepSeek-V3.1 | deepseek-v3.1 | 0.56 | 1.68 | | | | GPT-5 Chat | gpt-5-chat | 1.25 | 10.00 | | | | GPT-4.1 mini | gpt-4.1-mini | 0.40 | 1.60 | | | | Llama 4 Maverick 17B 128E Instruct FP8 | llama-4-maverick-17b-128e-instruct-fp8 | 0.25 | 1.00 | | | | Command R+ | cohere-command-r-plus-08-2024 | 2.50 | 10.00 | | | | Command A | cohere-command-a | 2.50 | 10.00 | | | | Phi-3-small-instruct (128k) | phi-3-small-128k-instruct | 0.15 | 0.60 | | | | Claude Opus 4.5 | claude-opus-4-5 | 5.00 | 25.00 | | | | Mistral Medium 3 | mistral-medium-2505 | 0.40 | 2.00 | | | | DeepSeek-V3.2-Speciale | deepseek-v3.2-speciale | 0.58 | 1.68 | | | | Claude Haiku 4.5 | claude-haiku-4-5 | 1.00 | 5.00 | | | | Phi-3-mini-instruct (4k) | phi-3-mini-4k-instruct | 0.13 | 0.52 | | | | GPT-5.1 Codex | gpt-5.1-codex | 1.25 | 10.00 | | | | Grok Code Fast 1 | grok-code-fast-1 | 0.20 | 1.50 | | | | DeepSeek-R1 | deepseek-r1 | 1.35 | 5.40 | | | | Meta-Llama-3.1-405B-Instruct | meta-llama-3.1-405b-instruct | 5.33 | 16.00 | | | | GPT-5.2 Codex | gpt-5.2-codex | 1.75 | 14.00 | | | | GPT-4 32K | gpt-4-32k | 60.00 | 120.00 | | | | Phi-4-mini | phi-4-mini | 0.08 | 0.30 | | | | Embed v3 Multilingual | cohere-embed-v3-multilingual | 0.10 | 0.00 | | | | Grok 4 | grok-4 | 3.00 | 15.00 | | | | Command R | cohere-command-r-08-2024 | 0.15 | 0.60 | | | | Embed v4 | cohere-embed-v-4-0 | 0.12 | 0.00 | | | | Llama-3.2-11B-Vision-Instruct | llama-3.2-11b-vision-instruct | 0.37 | 0.37 | | | | GPT-5.2 Chat | gpt-5.2-chat | 1.75 | 14.00 | | | | Claude Opus 4.1 | claude-opus-4-1 | 15.00 | 75.00 | | | | GPT-4 | gpt-4 | 60.00 | 120.00 | | | | Phi-3-medium-instruct (128k) | phi-3-medium-128k-instruct | 0.17 | 0.68 | | | | Grok 4 Fast (Reasoning) | grok-4-fast-reasoning | 0.20 | 0.50 | | | | DeepSeek-R1-0528 | deepseek-r1-0528 | 1.35 | 5.40 | | | | Grok 4 Fast (Non-Reasoning) | grok-4-fast-non-reasoning | 0.20 | 0.50 | | | | text-embedding-3-small | text-embedding-3-small | 0.02 | 0.00 | | | | GPT-4.1 nano | gpt-4.1-nano | 0.10 | 0.40 | | | | GPT-5.1 Codex Max | gpt-5.1-codex-max | 1.25 | 10.00 | | | | GPT-5.2 | gpt-5.2 | 1.75 | 14.00 | | | | Claude Opus 4.6 | claude-opus-4-6 | 5.00 | 25.00 | | | | Kimi K2.5 | kimi-k2.5 | 0.60 | 3.00 | | | --- [← Back to all providers](/llm.txt)