Featherless icon

Featherless

featherless

Updated 44 minutes ago

Featherless AI is a serverless inference platform providing access to hundreds of open-source language models with no infrastructure management required.

Browse 160 LLM models available from Featherless. Compare prices and features.

Models (160)

Organization Model Name Original Model Input Output Free
DeepSeek
DeepSeek DeepSeek-V4-Pro-Max deepseek-ai/DeepSeek-V4-Pro - -
DeepSeek
DeepSeek DeepSeek V4 Flash deepseek-ai/DeepSeek-V4-Flash - -
Moonshot AI
Moonshot AI Kimi K2.6 moonshotai/Kimi-K2.6 - -
Z.ai
Z.ai GLM-5.1 zai-org/GLM-5.1 - -
Arcee AI
Arcee AI Trinity Large Thinking arcee-ai/Trinity-Large-Thinking - -
qwen
qwen Qwen3.6-35B-A3B Qwen/Qwen3.6-35B-A3B - -
Minimax
Minimax MiniMax M2.7 MiniMaxAI/MiniMax-M2.7 - -
google
google Gemma 4 31B google/gemma-4-31B-it - -
google
google Gemma 4 26B-A4B google/gemma-4-26B-A4B-it - -
qwen
qwen Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B - -
qwen
qwen Qwen3.5-27B Qwen/Qwen3.5-27B - -
Minimax
Minimax MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 - -
qwen
qwen Qwen3.5-9B Qwen/Qwen3.5-9B - -
Z.ai
Z.ai GLM-5 zai-org/GLM-5 - -
StepFun
StepFun Step-3.5-Flash stepfun-ai/Step-3.5-Flash - -
Moonshot AI
Moonshot AI Kimi K2.5 moonshotai/Kimi-K2.5 - -
qwen
qwen Qwen3.5-4B Qwen/Qwen3.5-4B - -
qwen
qwen Qwen3 Coder Next Qwen/Qwen3-Coder-Next - -
Z.ai
Z.ai GLM-4.7 zai-org/GLM-4.7 - -
Minimax
Minimax MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 - -
Xiaomi
Xiaomi MiMo-V2-Flash XiaomiMiMo/MiMo-V2-Flash - -
google
google Gemma 4 E4B google/gemma-4-E4B-it - -
DeepSeek
DeepSeek DeepSeek-V3.2-Speciale deepseek-ai/DeepSeek-V3.2-Speciale - -
DeepSeek
DeepSeek DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 - -
Z.ai
Z.ai GLM-4.7-Flash zai-org/GLM-4.7-Flash - -
qwen
qwen Qwen3.5-2B Qwen/Qwen3.5-2B - -
Z.ai
Z.ai GLM-4.6 zai-org/GLM-4.6 - -
Minimax
Minimax MiniMax M2 MiniMaxAI/MiniMax-M2 - -
google
google Gemma 4 E2B google/gemma-4-E2B-it - -
qwen
qwen Qwen3 VL 235B A22B Qwen/Qwen3-VL-235B-A22B-Thinking - -
OpenAI
OpenAI GPT OSS 120B openai/gpt-oss-120b - -
qwen
qwen Qwen3 VL 30B A3B Instruct Qwen/Qwen3-VL-30B-A3B-Instruct - -
Moonshot AI
Moonshot AI Kimi K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 - -
qwen
qwen Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct - -
OpenAI
OpenAI GPT OSS 20B openai/gpt-oss-20b - -
Moonshot AI
Moonshot AI Kimi K2 Instruct moonshotai/Kimi-K2-Instruct - -
DeepSeek
DeepSeek DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 - -
Mistral
Mistral Devstral Small 1.1 mistralai/Devstral-Small-2507 - -
qwen
qwen Qwen3 32B Qwen/Qwen3-32B - -
qwen
qwen Qwen3 32B unsloth/Qwen3-32B - -
Mistral
Mistral Magistral Small 2506 mistralai/Magistral-Small-2506 - -
qwen
qwen Qwen3 235B A22B Qwen/Qwen3-235B-A22B - -
Microsoft Phi 4 Mini Reasoning microsoft/Phi-4-mini-reasoning - -
DeepSeek
DeepSeek DeepSeek-V3 0324 deepseek-ai/DeepSeek-V3-0324 - -
Mistral
Mistral Mistral Small 3.2 24B Instruct mistralai/Mistral-Small-3.2-24B-Instruct-2506 - -
Nvidia
Nvidia Llama 3.1 Nemotron Nano 8B V1 nvidia/Llama-3.1-Nemotron-Nano-8B-v1 - -
qwen
qwen Qwen3-Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct - -
qwen
qwen QwQ-32B Qwen/QwQ-32B - -
DeepSeek
DeepSeek DeepSeek-V3.1 deepseek-ai/DeepSeek-V3.1 - -
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B - -
Mistral
Mistral Mistral Small 3.1 24B Instruct mistralai/Mistral-Small-3.1-24B-Instruct-2503 - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B - -
google
google Gemma 3 27B google/gemma-3-27b-it - -
Mistral
Mistral Mistral Small 3.1 24B Base mistralai/Mistral-Small-3.1-24B-Base-2503 - -
google
google Gemma 3 12B google/gemma-3-12b-it - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 7B deepseek-ai/DeepSeek-R1-Distill-Qwen-7B - -
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B deepseek-ai/DeepSeek-R1-Distill-Llama-8B - -
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B unsloth/DeepSeek-R1-Distill-Llama-8B - -
qwen
qwen QwQ-32B-Preview Qwen/QwQ-32B-Preview - -
Mistral
Mistral Mistral Small 3 24B Instruct mistralai/Mistral-Small-24B-Instruct-2501 - -
google
google Gemma 3 4B google/gemma-3-4b-it - -
google
google Gemma 3 4B unsloth/gemma-3-4b-it - -
Meta
Meta Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct - -
Meta
Meta Llama 3.3 70B Instruct unsloth/Llama-3.3-70B-Instruct - -
qwen
qwen Qwen2.5 7B Instruct Qwen/Qwen2.5-7B-Instruct - -
qwen
qwen Qwen2.5 7B Instruct unsloth/Qwen2.5-7B-Instruct - -
Mistral
Mistral Mistral Small 3 24B Base mistralai/Mistral-Small-24B-Base-2501 - -
google
google Gemma 3 1B google/gemma-3-1b-it - -
google
google Gemma 3 1B unsloth/gemma-3-1b-it - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 1.5B deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B - -
qwen
qwen Qwen2.5 Instruct 32B Qwen/Qwen2.5-32B-Instruct - -
qwen
qwen Qwen2.5 72B Instruct Qwen/Qwen2.5-72B-Instruct - -
qwen
qwen Qwen2.5 14B Instruct Qwen/Qwen2.5-14B-Instruct - -
Meta
Meta Llama 3.1 70B Instruct meta-llama/Meta-Llama-3.1-70B-Instruct - -
Meta
Meta Llama 3.1 70B Instruct meta-llama/Llama-3.1-70B-Instruct - -
Meta
Meta Llama 3.2 3B Instruct meta-llama/Llama-3.2-3B-Instruct - -
Meta
Meta Llama 3.2 3B Instruct unsloth/Llama-3.2-3B-Instruct - -
qwen
qwen Qwen2 72B Instruct Qwen/Qwen2-72B-Instruct - -
Microsoft Phi-3.5-mini-instruct microsoft/Phi-3.5-mini-instruct - -
Microsoft Phi-3.5-mini-instruct unsloth/Phi-3.5-mini-instruct - -
Meta
Meta Llama 3.1 8B Instruct meta-llama/Meta-Llama-3.1-8B-Instruct - -
Meta
Meta Llama 3.1 8B Instruct meta-llama/Llama-3.1-8B-Instruct - -
Meta
Meta Llama 3.1 8B Instruct unsloth/Llama-3.1-8B-Instruct - -
Meta
Meta Llama 3.1 8B Instruct NousResearch/Meta-Llama-3.1-8B-Instruct - -
Meta
Meta Llama 3.1 8B Instruct unsloth/Meta-Llama-3.1-8B-Instruct - -
qwen
qwen Qwen2 7B Instruct Qwen/Qwen2-7B-Instruct - -
Moonshot AI
Moonshot AI Kimi K2 Thinking moonshotai/Kimi-K2-Thinking - -
qwen
qwen Qwen3 Embedding 4B Qwen/Qwen3-Embedding-4B - -
qwen
qwen Qwen3 Embedding 8B Qwen/Qwen3-Embedding-8B - -
DeepSeek
DeepSeek DeepSeek V3.1 Terminus deepseek-ai/DeepSeek-V3.1-Terminus - -
Alibaba
Alibaba Qwen3 Coder 30B A3B Qwen/Qwen3-Coder-30B-A3B-Instruct - -
Alibaba
Alibaba qwen3-30b-a3b-instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 - -
Mistral
Mistral Devstral Small mistralai/Devstral-Small-2505 - -
google
google MedGemma 4B IT google/medgemma-4b-it - -
qwen
qwen Qwen3 4B Qwen/Qwen3-4B - -
qwen
qwen Qwen3 4B unsloth/Qwen3-4B - -
Alibaba
Alibaba Qwen3 14B Qwen/Qwen3-14B - -
Alibaba
Alibaba Qwen3 14B unsloth/Qwen3-14B - -
Alibaba
Alibaba Qwen3 8B Qwen/Qwen3-8B - -
Alibaba
Alibaba Qwen3 8B unsloth/Qwen3-8B - -
Groq
Groq Llama Guard 3 8B meta-llama/Llama-Guard-3-8B - -
Meta
Meta llama-3.2-1b-instruct meta-llama/Llama-3.2-1B-Instruct - -
Meta
Meta llama-3.2-1b-instruct unsloth/Llama-3.2-1B-Instruct - -
qwen
qwen Qwen2.5 Coder 7B Qwen/Qwen2.5-Coder-7B-Instruct - -
qwen
qwen Qwen2.5-Coder 32B Instruct Qwen/Qwen2.5-Coder-32B-Instruct - -
NousResearch
NousResearch Hermes 3 70B Instruct NousResearch/Hermes-3-Llama-3.1-70B - -
Meta
Meta Llama 3.1 405B (base) meta-llama/Llama-3.1-405B - -
Mistral
Mistral Mistral NeMo Instruct mistralai/Mistral-Nemo-Instruct-2407 - -
google
google Gemma 2 27B google/gemma-2-27b-it - -
google
google Gemma 2 9B google/gemma-2-9b-it - -
NousResearch
NousResearch Hermes 2 Pro - Llama-3 8B NousResearch/Hermes-2-Pro-Llama-3-8B - -
Mistral
Mistral Mistral 7B Instruct v0.3 mistralai/Mistral-7B-Instruct-v0.3 - -
Meta
Meta llama-3-70b-instruct meta-llama/Meta-Llama-3-70B-Instruct - -
Meta
Meta llama-3-8b-instruct meta-llama/Meta-Llama-3-8B-Instruct - -
Meta
Meta llama-3-8b-instruct NousResearch/Meta-Llama-3-8B-Instruct - -
Meta
Meta llama-3-8b-instruct unsloth/llama-3-8b-Instruct - -
Microsoft WizardLM-2 8x22B alpindale/WizardLM-2-8x22B - -
Mistral
Mistral mistral-7b-instruct-v0.2 mistralai/Mistral-7B-Instruct-v0.2 - -
Mistral
Mistral Mistral 7B Instruct v0.1 mistralai/Mistral-7B-Instruct-v0.1 - -
DeepSeek
DeepSeek DeepSeek R1 0528 Qwen3 8B deepseek-ai/DeepSeek-R1-0528-Qwen3-8B - -
google
google Gemma 4 26B A4B google/gemma-4-26B-A4B - -
google
google Gemma 4 31B google/gemma-4-31B - -
google
google Gemma 4 E2B google/gemma-4-E2B - -
google
google Gemma 4 E4B google/gemma-4-E4B - -
google
google gemma-1.1-2b-it google/gemma-1.1-2b-it - -
google
google gemma-2-2b-it google/gemma-2-2b-it - -
google
google gemma-2b-it google/gemma-2b-it - -
google
google gemma-7b-it google/gemma-7b-it - -
Moonshot AI
Moonshot AI Kimi Linear 48B A3B Instruct moonshotai/Kimi-Linear-48B-A3B-Instruct - -
Meta
Meta Llama 3 70B meta-llama/Meta-Llama-3-70B - -
Meta
Meta Llama 3 8B meta-llama/Meta-Llama-3-8B - -
Meta
Meta Llama 3 8B NousResearch/Meta-Llama-3-8B - -
Meta
Meta Llama 3 8B unsloth/llama-3-8b - -
Cerebras
Cerebras Llama 3.1 8B meta-llama/Llama-3.1-8B - -
Meta
Meta llama-13b huggyllama/llama-13b - -
Nvidia
Nvidia Llama-3.1-Nemotron-70B-Instruct-HF nvidia/Llama-3.1-Nemotron-70B-Instruct-HF - -
Allen Institute for AI
Allen Institute for AI llama-3.1-tulu-3-8b allenai/Llama-3.1-Tulu-3-8B - -
Meta
Meta Llama-3.3-8B-Instruct allura-forge/Llama-3.3-8B-Instruct - -
Nvidia
Nvidia Llama3 Chatqa 1.5 70b nvidia/Llama3-ChatQA-1.5-70B - -
Mistral
Mistral Magistral Small 1.2 mistralai/Magistral-Small-2509 - -
SiliconFlow
SiliconFlow moonshotai/Kimi-Dev-72B moonshotai/Kimi-Dev-72B - -
NousResearch
NousResearch Nous: DeepHermes 3 Llama 3 8B Preview (free) NousResearch/DeepHermes-3-Llama-3-8B-Preview - -
NousResearch
NousResearch openhermes-2.5-mistral-7b teknium/OpenHermes-2.5-Mistral-7B - -
WandB
WandB OpenPipe Qwen3 14B Instruct OpenPipe/Qwen3-14B-Instruct - -
Microsoft phi-3-mini-128k-instruct microsoft/Phi-3-mini-128k-instruct - -
Microsoft phi-3-mini-4k-instruct microsoft/Phi-3-mini-4k-instruct - -
Microsoft phi-3-mini-4k-instruct unsloth/Phi-3-mini-4k-instruct - -
Nvidia
Nvidia Phi-4-Mini microsoft/Phi-4-mini-instruct - -
Alibaba
Alibaba qwen1.5-14b-chat Qwen/Qwen1.5-14B-Chat - -
Alibaba
Alibaba qwen1.5-32b-chat Qwen/Qwen1.5-32B-Chat - -
Alibaba
Alibaba qwen1.5-4b-chat Qwen/Qwen1.5-4B-Chat - -
Alibaba
Alibaba qwen1.5-72b-chat Qwen/Qwen1.5-72B-Chat - -
Alibaba
Alibaba qwen1.5-7b-chat Qwen/Qwen1.5-7B-Chat - -
qwen
qwen Qwen3 0.6B Qwen/Qwen3-0.6B - -
qwen
qwen Qwen3 0.6B unsloth/Qwen3-0.6B - -
qwen
qwen Qwen3 1.7B Qwen/Qwen3-1.7B - -
qwen
qwen Qwen3 1.7B unsloth/Qwen3-1.7B - -
qwen
qwen Qwen3 Embedding 0.6B Qwen/Qwen3-Embedding-0.6B - -
Upstage
Upstage SOLAR-10.7B-Instruct-v1.0 upstage/SOLAR-10.7B-Instruct-v1.0 - -