Featherless icon

Featherless

featherless

Updated 12 minutes ago

Featherless AI is a serverless inference platform providing access to hundreds of open-source language models with no infrastructure management required.

Browse 211 LLM models available from Featherless. Compare prices and features.

Models (211)

Organization Model Name Original Model Input Output Free
Z.ai
Z.ai GLM-5.2 zai-org/GLM-5.2 - -
Minimax
Minimax MiniMax M3 MiniMaxAI/MiniMax-M3 - -
Moonshot AI
Moonshot AI Kimi K2.7 Code moonshotai/Kimi-K2.7-Code - -
DeepSeek
DeepSeek DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro - -
Xiaomi
Xiaomi MiMo-V2.5 XiaomiMiMo/MiMo-V2.5 - -
DeepSeek
DeepSeek DeepSeek V4 Flash deepseek-ai/DeepSeek-V4-Flash - -
Moonshot AI
Moonshot AI Kimi K2.6 moonshotai/Kimi-K2.6 - -
Alibaba
Alibaba Qwen3.6 27B Qwen/Qwen3.6-27B - -
Alibaba
Alibaba Qwen3.6 27B unsloth/Qwen3.6-27B - -
Z.ai
Z.ai GLM-5.1 zai-org/GLM-5.1 - -
qwen
qwen Qwen3.6 35B A3B Qwen/Qwen3.6-35B-A3B - -
qwen
qwen Qwen3.6 35B A3B unsloth/Qwen3.6-35B-A3B - -
google
google Gemma 4 31B google/gemma-4-31B-it - -
google
google Gemma 4 31B unsloth/gemma-4-31B-it - -
Minimax
Minimax MiniMax M2.7 MiniMaxAI/MiniMax-M2.7 - -
Arcee AI
Arcee AI Trinity Large Thinking arcee-ai/Trinity-Large-Thinking - -
google
google Gemma 4 26B-A4B google/gemma-4-26B-A4B-it - -
google
google Gemma 4 26B-A4B unsloth/gemma-4-26B-A4B-it - -
Minimax
Minimax MiniMax M2.5 MiniMaxAI/MiniMax-M2.5 - -
qwen
qwen Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B - -
Z.ai
Z.ai GLM-5 zai-org/GLM-5 - -
qwen
qwen Qwen3.5-27B Qwen/Qwen3.5-27B - -
StepFun
StepFun Step-3.5-Flash stepfun-ai/Step-3.5-Flash - -
qwen
qwen Qwen3.5 9B Qwen/Qwen3.5-9B - -
qwen
qwen Qwen3.5 9B unsloth/Qwen3.5-9B - -
Moonshot AI
Moonshot AI Kimi K2.5 moonshotai/Kimi-K2.5 - -
qwen
qwen Qwen3.5 4B Qwen/Qwen3.5-4B - -
qwen
qwen Qwen3.5 4B unsloth/Qwen3.5-4B - -
Z.ai
Z.ai GLM-4.7 zai-org/GLM-4.7 - -
Xiaomi
Xiaomi MiMo-V2-Flash XiaomiMiMo/MiMo-V2-Flash - -
Minimax
Minimax MiniMax M2.1 MiniMaxAI/MiniMax-M2.1 - -
DeepSeek
DeepSeek DeepSeek-V3.2-Speciale deepseek-ai/DeepSeek-V3.2-Speciale - -
DeepSeek
DeepSeek DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 - -
google
google Gemma 4 E4B google/gemma-4-E4B-it - -
Z.ai
Z.ai GLM-4.7-Flash zai-org/GLM-4.7-Flash - -
Nvidia
Nvidia Nemotron 3 Nano (30B A3B) unsloth/Nemotron-3-Nano-30B-A3B - -
Minimax
Minimax MiniMax M2 MiniMaxAI/MiniMax-M2 - -
Z.ai
Z.ai GLM-4.6 zai-org/GLM-4.6 - -
qwen
qwen Qwen3.5 2B Qwen/Qwen3.5-2B - -
google
google Gemma 4 E2B google/gemma-4-E2B-it - -
google
google Gemma 4 E2B unsloth/gemma-4-E2B-it - -
qwen
qwen Qwen3 VL 235B A22B Qwen/Qwen3-VL-235B-A22B-Thinking - -
qwen
qwen Qwen3 VL 32B Thinking Qwen/Qwen3-VL-32B-Thinking - -
qwen
qwen Qwen3 VL 8B Thinking Qwen/Qwen3-VL-8B-Thinking - -
qwen
qwen Qwen3 VL 30B A3B Instruct Qwen/Qwen3-VL-30B-A3B-Instruct - -
qwen
qwen Qwen3 VL 4B Instruct Qwen/Qwen3-VL-4B-Instruct - -
qwen
qwen Qwen3 VL 32B Qwen/Qwen3-VL-32B-Instruct - -
qwen
qwen Qwen3 VL 8B Qwen/Qwen3-VL-8B-Instruct - -
qwen
qwen Qwen3 VL 4B Thinking Qwen/Qwen3-VL-4B-Thinking - -
qwen
qwen Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 - -
Moonshot AI
Moonshot AI Kimi K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 - -
qwen
qwen Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct - -
OpenAI
OpenAI GPT OSS 120B openai/gpt-oss-120b - -
Moonshot AI
Moonshot AI Kimi K2 Instruct moonshotai/Kimi-K2-Instruct - -
DeepSeek
DeepSeek DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 - -
OpenAI
OpenAI GPT OSS 20B openai/gpt-oss-20b - -
Mistral
Mistral Devstral Small 1.1 mistralai/Devstral-Small-2507 - -
qwen
qwen Qwen3 32B Qwen/Qwen3-32B - -
microsoft Phi 4 Reasoning Plus microsoft/Phi-4-reasoning-plus - -
Mistral
Mistral Magistral Small 2506 mistralai/Magistral-Small-2506 - -
microsoft Phi 4 Reasoning microsoft/Phi-4-reasoning - -
qwen
qwen Qwen3 235B A22B Qwen/Qwen3-235B-A22B - -
Nvidia
Nvidia Llama-3.3 Nemotron Super 49B v1 nvidia/Llama-3_3-Nemotron-Super-49B-v1 - -
microsoft Phi 4 Mini Reasoning microsoft/Phi-4-mini-reasoning - -
qwen
qwen Qwen3-Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct - -
DeepSeek
DeepSeek DeepSeek-V3 0324 deepseek-ai/DeepSeek-V3-0324 - -
Mistral
Mistral Mistral Small 3.2 24B Instruct mistralai/Mistral-Small-3.2-24B-Instruct-2506 - -
Nvidia
Nvidia Llama 3.1 Nemotron Nano 8B V1 nvidia/Llama-3.1-Nemotron-Nano-8B-v1 - -
DeepSeek
DeepSeek DeepSeek-V3.1 deepseek-ai/DeepSeek-V3.1 - -
qwen
qwen QwQ-32B Qwen/QwQ-32B - -
Mistral
Mistral Mistral Small 3.1 24B Instruct mistralai/Mistral-Small-3.1-24B-Instruct-2503 - -
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 70B deepseek-ai/DeepSeek-R1-Distill-Llama-70B - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 32B deepseek-ai/DeepSeek-R1-Distill-Qwen-32B - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 14B deepseek-ai/DeepSeek-R1-Distill-Qwen-14B - -
google
google Gemma 3 27B google/gemma-3-27b-it - -
Mistral
Mistral Mistral Small 3.1 24B Base mistralai/Mistral-Small-3.1-24B-Base-2503 - -
google
google Gemma 3 12B google/gemma-3-12b-it - -
google
google Gemma 3 12B unsloth/gemma-3-12b-it - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 7B deepseek-ai/DeepSeek-R1-Distill-Qwen-7B - -
qwen
qwen QwQ-32B-Preview Qwen/QwQ-32B-Preview - -
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B deepseek-ai/DeepSeek-R1-Distill-Llama-8B - -
DeepSeek
DeepSeek DeepSeek R1 Distill Llama 8B unsloth/DeepSeek-R1-Distill-Llama-8B - -
microsoft Phi 4 microsoft/phi-4 - -
Mistral
Mistral Mistral Small 3 24B Instruct mistralai/Mistral-Small-24B-Instruct-2501 - -
google
google Gemma 3 4B google/gemma-3-4b-it - -
google
google Gemma 3 4B unsloth/gemma-3-4b-it - -
Meta
Meta Llama 3.3 70B Instruct meta-llama/Llama-3.3-70B-Instruct - -
Meta
Meta Llama 3.3 70B Instruct unsloth/Llama-3.3-70B-Instruct - -
google
google Gemma 3 1B google/gemma-3-1b-it - -
google
google Gemma 3 1B unsloth/gemma-3-1b-it - -
Mistral
Mistral Mistral Small 3 24B Base mistralai/Mistral-Small-24B-Base-2501 - -
qwen
qwen Qwen2.5 Instruct 32B Qwen/Qwen2.5-32B-Instruct - -
Meta
Meta Llama 3.1 70B Instruct meta-llama/Meta-Llama-3.1-70B-Instruct - -
Meta
Meta Llama 3.1 70B Instruct meta-llama/Llama-3.1-70B-Instruct - -
qwen
qwen Qwen2.5 72B Instruct Qwen/Qwen2.5-72B-Instruct - -
DeepSeek
DeepSeek DeepSeek R1 Distill Qwen 1.5B deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B - -
qwen
qwen Qwen2.5 14B Instruct Qwen/Qwen2.5-14B-Instruct - -
qwen
qwen Qwen2.5 14B Instruct unsloth/Qwen2.5-14B-Instruct - -
Meta
Meta Llama 3.2 3B Instruct meta-llama/Llama-3.2-3B-Instruct - -
Meta
Meta Llama 3.2 3B Instruct unsloth/Llama-3.2-3B-Instruct - -
qwen
qwen Qwen2.5 7B Instruct Qwen/Qwen2.5-7B-Instruct - -
qwen
qwen Qwen2.5 7B Instruct unsloth/Qwen2.5-7B-Instruct - -
qwen
qwen Qwen2 72B Instruct Qwen/Qwen2-72B-Instruct - -
microsoft Phi-3.5-MoE-instruct microsoft/Phi-3.5-MoE-instruct - -
microsoft Phi-3.5-mini-instruct microsoft/Phi-3.5-mini-instruct - -
microsoft Phi-3.5-mini-instruct unsloth/Phi-3.5-mini-instruct - -
Meta
Meta Llama 3.1 8B Instruct meta-llama/Meta-Llama-3.1-8B-Instruct - -
Meta
Meta Llama 3.1 8B Instruct meta-llama/Llama-3.1-8B-Instruct - -
Meta
Meta Llama 3.1 8B Instruct unsloth/Llama-3.1-8B-Instruct - -
Meta
Meta Llama 3.1 8B Instruct NousResearch/Meta-Llama-3.1-8B-Instruct - -
Meta
Meta Llama 3.1 8B Instruct unsloth/Meta-Llama-3.1-8B-Instruct - -
qwen
qwen Qwen2 7B Instruct Qwen/Qwen2-7B-Instruct - -
StepFun
StepFun Step 3.7 Flash stepfun-ai/Step-3.7-Flash - -
qwen
qwen Qwen3 Coder Next Qwen/Qwen3-Coder-Next - -
Liquid AI
Liquid AI LFM2.5-1.2B-Instruct LiquidAI/LFM2.5-1.2B-Instruct - -
Liquid AI
Liquid AI LFM2.5-1.2B-Thinking LiquidAI/LFM2.5-1.2B-Thinking - -
Moonshot AI
Moonshot AI Kimi K2 Thinking moonshotai/Kimi-K2-Thinking - -
OpenAI
OpenAI gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b - -
qwen
qwen Qwen3 Embedding 4B Qwen/Qwen3-Embedding-4B - -
qwen
qwen Qwen3 Embedding 8B Qwen/Qwen3-Embedding-8B - -
Nvidia
Nvidia Phi-4-Mini microsoft/Phi-4-mini-instruct - -
Nvidia
Nvidia Llama 3.3 Nemotron Super 49b V1.5 nvidia/Llama-3_3-Nemotron-Super-49B-v1_5 - -
Baidu
Baidu ERNIE 4.5 21B A3B Thinking baidu/ERNIE-4.5-21B-A3B-Thinking - -
DeepSeek
DeepSeek DeepSeek V3.1 Terminus deepseek-ai/DeepSeek-V3.1-Terminus - -
Alibaba
Alibaba Qwen3 Coder 30B A3B Qwen/Qwen3-Coder-30B-A3B-Instruct - -
Alibaba
Alibaba qwen3-30b-a3b-instruct-2507 Qwen/Qwen3-30B-A3B-Instruct-2507 - -
ByteDance Seed
ByteDance Seed UI-TARS 7B ByteDance-Seed/UI-TARS-1.5-7B - -
Mistral
Mistral Devstral Small mistralai/Devstral-Small-2505 - -
google
google MedGemma 4B IT google/medgemma-4b-it - -
NousResearch
NousResearch DeepHermes 3 Mistral 24B Preview NousResearch/DeepHermes-3-Mistral-24B-Preview - -
qwen
qwen Qwen3 4B Qwen/Qwen3-4B - -
qwen
qwen Qwen3 4B unsloth/Qwen3-4B - -
Alibaba
Alibaba Qwen3 14B Qwen/Qwen3-14B - -
Alibaba
Alibaba Qwen3 14B unsloth/Qwen3-14B - -
Alibaba
Alibaba Qwen3 8B Qwen/Qwen3-8B - -
Alibaba
Alibaba Qwen3 8B unsloth/Qwen3-8B - -
IBM Granite 3.3 8B Instruct ibm-granite/granite-3.3-8b-instruct - -
Alibaba
Alibaba qwen2.5-vl-32b-instruct Qwen/Qwen2.5-VL-32B-Instruct - -
Groq
Groq Llama Guard 3 8B meta-llama/Llama-Guard-3-8B - -
Alibaba
Alibaba qwen2.5-vl-72b-instruct Qwen/Qwen2.5-VL-72B-Instruct - -
Meta
Meta llama-3.2-1b-instruct meta-llama/Llama-3.2-1B-Instruct - -
Meta
Meta llama-3.2-1b-instruct unsloth/Llama-3.2-1B-Instruct - -
qwen
qwen Qwen2.5 Coder 7B Qwen/Qwen2.5-Coder-7B-Instruct - -
qwen
qwen Qwen2.5-Coder 32B Instruct Qwen/Qwen2.5-Coder-32B-Instruct - -
Alibaba
Alibaba Qwen2.5-VL 7B Instruct Qwen/Qwen2.5-VL-7B-Instruct - -
microsoft Phi-3.5-vision-instruct microsoft/Phi-3.5-vision-instruct - -
NousResearch
NousResearch Hermes 3 70B Instruct NousResearch/Hermes-3-Llama-3.1-70B - -
Mistral
Mistral Mistral NeMo Instruct mistralai/Mistral-Nemo-Instruct-2407 - -
google
google Gemma 2 27B google/gemma-2-27b-it - -
google
google Gemma 2 9B google/gemma-2-9b-it - -
google
google Gemma 2 9B unsloth/gemma-2-9b-it - -
NousResearch
NousResearch Hermes 2 Pro - Llama-3 8B NousResearch/Hermes-2-Pro-Llama-3-8B - -
Mistral
Mistral Mistral 7B Instruct v0.3 mistralai/Mistral-7B-Instruct-v0.3 - -
Meta
Meta llama-3-70b-instruct meta-llama/Meta-Llama-3-70B-Instruct - -
Meta
Meta llama-3-8b-instruct meta-llama/Meta-Llama-3-8B-Instruct - -
Meta
Meta llama-3-8b-instruct NousResearch/Meta-Llama-3-8B-Instruct - -
Meta
Meta llama-3-8b-instruct unsloth/llama-3-8b-Instruct - -
microsoft WizardLM-2 8x22B alpindale/WizardLM-2-8x22B - -
Mistral
Mistral mistral-7b-instruct-v0.2 mistralai/Mistral-7B-Instruct-v0.2 - -
Mistral
Mistral Mistral 7B Instruct v0.1 mistralai/Mistral-7B-Instruct-v0.1 - -
DeepSeek
DeepSeek DeepSeek R1 0528 Qwen3 8B deepseek-ai/DeepSeek-R1-0528-Qwen3-8B - -
Friendli
Friendli EXAONE 4.0.1 32B LGAI-EXAONE/EXAONE-4.0.1-32B - -
google
google Gemma 4 26B A4B google/gemma-4-26B-A4B - -
google
google Gemma 4 31B google/gemma-4-31B - -
google
google Gemma 4 E2B google/gemma-4-E2B - -
google
google Gemma 4 E4B google/gemma-4-E4B - -
google
google gemma-1.1-2b-it google/gemma-1.1-2b-it - -
google
google gemma-2-2b-it google/gemma-2-2b-it - -
google
google gemma-2b-it google/gemma-2b-it - -
google
google gemma-7b-it google/gemma-7b-it - -
IBM granite-3.0-2b-instruct ibm-granite/granite-3.0-2b-instruct - -
IBM granite-3.0-8b-instruct ibm-granite/granite-3.0-8b-instruct - -
IBM granite-3.1-2b-instruct ibm-granite/granite-3.1-2b-instruct - -
IBM granite-3.1-8b-instruct ibm-granite/granite-3.1-8b-instruct - -
Moonshot AI
Moonshot AI Kimi Linear 48B A3B Instruct moonshotai/Kimi-Linear-48B-A3B-Instruct - -
Liquid AI
Liquid AI LFM2 1.2B LiquidAI/LFM2-1.2B - -
Meta
Meta Llama 3 70B meta-llama/Meta-Llama-3-70B - -
Meta
Meta Llama 3 8B meta-llama/Meta-Llama-3-8B - -
Meta
Meta Llama 3 8B NousResearch/Meta-Llama-3-8B - -
Meta
Meta Llama 3 8B unsloth/llama-3-8b - -
Cerebras
Cerebras Llama 3.1 8B meta-llama/Llama-3.1-8B - -
Meta
Meta llama-13b huggyllama/llama-13b - -
Nvidia
Nvidia Llama-3.1-Nemotron-70B-Instruct-HF nvidia/Llama-3.1-Nemotron-70B-Instruct-HF - -
Allen Institute for AI
Allen Institute for AI llama-3.1-tulu-3-8b allenai/Llama-3.1-Tulu-3-8B - -
Meta
Meta Llama-3.3-8B-Instruct allura-forge/Llama-3.3-8B-Instruct - -
Nvidia
Nvidia Llama3 Chatqa 1.5 70b nvidia/Llama3-ChatQA-1.5-70B - -
Mistral
Mistral Magistral Small 1.2 mistralai/Magistral-Small-2509 - -
SiliconFlow
SiliconFlow moonshotai/Kimi-Dev-72B moonshotai/Kimi-Dev-72B - -
Nvidia
Nvidia Nemotron Cascade 2 30B A3B nvidia/Nemotron-Cascade-2-30B-A3B - -
NousResearch
NousResearch Nous: DeepHermes 3 Llama 3 8B Preview (free) NousResearch/DeepHermes-3-Llama-3-8B-Preview - -
Nvidia
Nvidia nvidia-nemotron-3-nano-30b-a3b-bf16 nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 - -
NousResearch
NousResearch openhermes-2.5-mistral-7b teknium/OpenHermes-2.5-Mistral-7B - -
WandB
WandB OpenPipe Qwen3 14B Instruct OpenPipe/Qwen3-14B-Instruct - -
microsoft phi-3-medium-4k-instruct microsoft/Phi-3-medium-4k-instruct - -
microsoft phi-3-mini-128k-instruct microsoft/Phi-3-mini-128k-instruct - -
microsoft phi-3-mini-4k-instruct microsoft/Phi-3-mini-4k-instruct - -
microsoft phi-3-mini-4k-instruct unsloth/Phi-3-mini-4k-instruct - -
microsoft phi-3-vision-128k-instruct microsoft/Phi-3-vision-128k-instruct - -
qwen
qwen Qwen 1.5 7B Chat Qwen/Qwen-7B-Chat - -
Alibaba
Alibaba qwen1.5-14b-chat Qwen/Qwen1.5-14B-Chat - -
Alibaba
Alibaba qwen1.5-32b-chat Qwen/Qwen1.5-32B-Chat - -
Alibaba
Alibaba qwen1.5-4b-chat Qwen/Qwen1.5-4B-Chat - -
Alibaba
Alibaba qwen1.5-72b-chat Qwen/Qwen1.5-72B-Chat - -
Alibaba
Alibaba qwen1.5-7b-chat Qwen/Qwen1.5-7B-Chat - -
qwen
qwen Qwen2.5 VL 3B Instruct Qwen/Qwen2.5-VL-3B-Instruct - -
qwen
qwen Qwen3 0.6B Qwen/Qwen3-0.6B - -
qwen
qwen Qwen3 0.6B unsloth/Qwen3-0.6B - -
qwen
qwen Qwen3 1.7B Qwen/Qwen3-1.7B - -
qwen
qwen Qwen3 1.7B unsloth/Qwen3-1.7B - -
qwen
qwen Qwen3 Embedding 0.6B Qwen/Qwen3-Embedding-0.6B - -
Upstage
Upstage SOLAR-10.7B-Instruct-v1.0 upstage/SOLAR-10.7B-Instruct-v1.0 - -