Featherless

featherless

Updated 1 hour ago

Featherless AI is a serverless inference platform providing access to hundreds of open-source language models with no infrastructure management required.

Visit Website LLMs.txt

Browse 226 LLM models available from Featherless. Compare prices and features.

Models (226)

Organization	Model Name	Original Model	Input	Output
Moonshot AI	Kimi K3	`moonshotai/Kimi-K3`	-	-	View
Z.ai	GLM-5.2	`zai-org/GLM-5.2`	-	-	View
DeepSeek	DeepSeek-V4-Pro-Max	`deepseek-ai/DeepSeek-V4-Pro`	-	-	View
Minimax	MiniMax M3	`MiniMaxAI/MiniMax-M3`	-	-	View
DeepSeek	DeepSeek V4 Flash	`deepseek-ai/DeepSeek-V4-Flash`	-	-	View
Xiaomi	MiMo-V2.5	`XiaomiMiMo/MiMo-V2.5`	-	-	View
Moonshot AI	Kimi K2.7 Code	`moonshotai/Kimi-K2.7-Code`	-	-	View
Moonshot AI	Kimi K2.6	`moonshotai/Kimi-K2.6`	-	-	View
Alibaba	Qwen3.6 27B	`Qwen/Qwen3.6-27B`	-	-	View
Alibaba	Qwen3.6 27B	`unsloth/Qwen3.6-27B`	-	-	View
Z.ai	GLM-5.1	`zai-org/GLM-5.1`	-	-	View
qwen	Qwen3.6 35B A3B	`Qwen/Qwen3.6-35B-A3B`	-	-	View
qwen	Qwen3.6 35B A3B	`unsloth/Qwen3.6-35B-A3B`	-	-	View
google	Gemma 4 31B	`google/gemma-4-31B-it`	-	-	View
google	Gemma 4 31B	`unsloth/gemma-4-31B-it`	-	-	View
google	Gemma 4 26B-A4B	`google/gemma-4-26B-A4B-it`	-	-	View
google	Gemma 4 26B-A4B	`unsloth/gemma-4-26B-A4B-it`	-	-	View
google	Gemma 4 12B	`unsloth/gemma-4-12b-it`	-	-	View
Minimax	MiniMax M2.7	`MiniMaxAI/MiniMax-M2.7`	-	-	View
Arcee AI	Trinity Large Thinking	`arcee-ai/Trinity-Large-Thinking`	-	-	View
Minimax	MiniMax M2.5	`MiniMaxAI/MiniMax-M2.5`	-	-	View
qwen	Qwen3.5-397B-A17B	`Qwen/Qwen3.5-397B-A17B`	-	-	View
Z.ai	GLM-5	`zai-org/GLM-5`	-	-	View
qwen	Qwen3.5-27B	`Qwen/Qwen3.5-27B`	-	-	View
StepFun	Step-3.5-Flash	`stepfun-ai/Step-3.5-Flash`	-	-	View
qwen	Qwen3.5 9B	`Qwen/Qwen3.5-9B`	-	-	View
qwen	Qwen3.5 9B	`unsloth/Qwen3.5-9B`	-	-	View
Moonshot AI	Kimi K2.5	`moonshotai/Kimi-K2.5`	-	-	View
qwen	Qwen3.5 4B	`Qwen/Qwen3.5-4B`	-	-	View
qwen	Qwen3.5 4B	`unsloth/Qwen3.5-4B`	-	-	View
Z.ai	GLM-4.7	`zai-org/GLM-4.7`	-	-	View
Xiaomi	MiMo-V2-Flash	`XiaomiMiMo/MiMo-V2-Flash`	-	-	View
Minimax	MiniMax M2.1	`MiniMaxAI/MiniMax-M2.1`	-	-	View
google	Gemma 4 E4B	`google/gemma-4-E4B-it`	-	-	View
google	Gemma 4 E4B	`unsloth/gemma-4-E4B-it`	-	-	View
DeepSeek	DeepSeek-V3.2	`deepseek-ai/DeepSeek-V3.2`	-	-	View
qwen	Qwen3 Embedding 8B	`Qwen/Qwen3-Embedding-8B`	-	-	View
Z.ai	GLM-4.7-Flash	`zai-org/GLM-4.7-Flash`	-	-	View
Minimax	MiniMax M2	`MiniMaxAI/MiniMax-M2`	-	-	View
Z.ai	GLM-4.6	`zai-org/GLM-4.6`	-	-	View
qwen	Qwen3 VL 235B A22B	`Qwen/Qwen3-VL-235B-A22B-Thinking`	-	-	View
qwen	Qwen3.5 2B	`Qwen/Qwen3.5-2B`	-	-	View
qwen	Qwen3.5 2B	`unsloth/Qwen3.5-2B`	-	-	View
google	Gemma 4 E2B	`google/gemma-4-E2B-it`	-	-	View
google	Gemma 4 E2B	`unsloth/gemma-4-E2B-it`	-	-	View
qwen	Qwen3 VL 32B Thinking	`Qwen/Qwen3-VL-32B-Thinking`	-	-	View
qwen	Qwen3 VL 8B Thinking	`Qwen/Qwen3-VL-8B-Thinking`	-	-	View
qwen	Qwen3 VL 4B Instruct	`Qwen/Qwen3-VL-4B-Instruct`	-	-	View
qwen	Qwen3 VL 30B A3B Instruct	`Qwen/Qwen3-VL-30B-A3B-Instruct`	-	-	View
qwen	Qwen3 VL 8B	`Qwen/Qwen3-VL-8B-Instruct`	-	-	View
qwen	Qwen3 VL 32B	`Qwen/Qwen3-VL-32B-Instruct`	-	-	View
qwen	Qwen3 VL 4B Thinking	`Qwen/Qwen3-VL-4B-Thinking`	-	-	View
OpenAI	GPT OSS 120B	`openai/gpt-oss-120b`	-	-	View
Moonshot AI	Kimi K2-Instruct-0905	`moonshotai/Kimi-K2-Instruct-0905`	-	-	View
qwen	Qwen3-235B-A22B-Thinking-2507	`Qwen/Qwen3-235B-A22B-Thinking-2507`	-	-	View
qwen	Qwen3-Next-80B-A3B-Instruct	`Qwen/Qwen3-Next-80B-A3B-Instruct`	-	-	View
OpenAI	GPT OSS 20B	`openai/gpt-oss-20b`	-	-	View
OpenAI	GPT OSS 20B	`unsloth/gpt-oss-20b`	-	-	View
Moonshot AI	Kimi K2 Instruct	`moonshotai/Kimi-K2-Instruct`	-	-	View
Mistral	Devstral Small 1.1	`mistralai/Devstral-Small-2507`	-	-	View
DeepSeek	DeepSeek-R1-0528	`deepseek-ai/DeepSeek-R1-0528`	-	-	View
qwen	Qwen3 32B	`Qwen/Qwen3-32B`	-	-	View
microsoft	Phi 4 Reasoning Plus	`microsoft/Phi-4-reasoning-plus`	-	-	View
Mistral	Magistral Small 2506	`mistralai/Magistral-Small-2506`	-	-	View
microsoft	Phi 4 Reasoning	`microsoft/Phi-4-reasoning`	-	-	View
qwen	Qwen3 235B A22B	`Qwen/Qwen3-235B-A22B`	-	-	View
Nvidia	Llama-3.3 Nemotron Super 49B v1	`nvidia/Llama-3_3-Nemotron-Super-49B-v1`	-	-	View
qwen	Qwen3-Coder 480B A35B Instruct	`Qwen/Qwen3-Coder-480B-A35B-Instruct`	-	-	View
microsoft	Phi 4 Mini Reasoning	`microsoft/Phi-4-mini-reasoning`	-	-	View
DeepSeek	DeepSeek-V3 0324	`deepseek-ai/DeepSeek-V3-0324`	-	-	View
Mistral	Mistral Small 3.2 24B Instruct	`mistralai/Mistral-Small-3.2-24B-Instruct-2506`	-	-	View
Mistral	Mistral Small 3.2 24B Instruct	`unsloth/Mistral-Small-3.2-24B-Instruct-2506`	-	-	View
Nvidia	Llama 3.1 Nemotron Nano 8B V1	`nvidia/Llama-3.1-Nemotron-Nano-8B-v1`	-	-	View
google	Gemma 3 27B	`google/gemma-3-27b-it`	-	-	View
google	Gemma 3 27B	`unsloth/gemma-3-27b-it`	-	-	View
DeepSeek	DeepSeek-V3.1	`deepseek-ai/DeepSeek-V3.1`	-	-	View
qwen	QwQ-32B	`Qwen/QwQ-32B`	-	-	View
DeepSeek	DeepSeek R1 Distill Llama 70B	`deepseek-ai/DeepSeek-R1-Distill-Llama-70B`	-	-	View
Mistral	Mistral Small 3.1 24B Instruct	`mistralai/Mistral-Small-3.1-24B-Instruct-2503`	-	-	View
DeepSeek	DeepSeek R1 Distill Qwen 32B	`deepseek-ai/DeepSeek-R1-Distill-Qwen-32B`	-	-	View
DeepSeek	DeepSeek R1 Distill Qwen 14B	`deepseek-ai/DeepSeek-R1-Distill-Qwen-14B`	-	-	View
Mistral	Mistral Small 3.1 24B Base	`mistralai/Mistral-Small-3.1-24B-Base-2503`	-	-	View
google	Gemma 3 12B	`google/gemma-3-12b-it`	-	-	View
google	Gemma 3 12B	`unsloth/gemma-3-12b-it`	-	-	View
DeepSeek	DeepSeek R1 Distill Qwen 7B	`deepseek-ai/DeepSeek-R1-Distill-Qwen-7B`	-	-	View
qwen	QwQ-32B-Preview	`Qwen/QwQ-32B-Preview`	-	-	View
DeepSeek	DeepSeek R1 Distill Llama 8B	`deepseek-ai/DeepSeek-R1-Distill-Llama-8B`	-	-	View
DeepSeek	DeepSeek R1 Distill Llama 8B	`unsloth/DeepSeek-R1-Distill-Llama-8B`	-	-	View
microsoft	Phi 4	`microsoft/phi-4`	-	-	View
Mistral	Mistral Small 3 24B Instruct	`mistralai/Mistral-Small-24B-Instruct-2501`	-	-	View
google	Gemma 3 4B	`google/gemma-3-4b-it`	-	-	View
google	Gemma 3 4B	`unsloth/gemma-3-4b-it`	-	-	View
Meta	Llama 3.3 70B Instruct	`meta-llama/Llama-3.3-70B-Instruct`	-	-	View
Meta	Llama 3.3 70B Instruct	`unsloth/Llama-3.3-70B-Instruct`	-	-	View
google	Gemma 3 1B	`google/gemma-3-1b-it`	-	-	View
google	Gemma 3 1B	`unsloth/gemma-3-1b-it`	-	-	View
Mistral	Mistral Small 3 24B Base	`mistralai/Mistral-Small-24B-Base-2501`	-	-	View
qwen	Qwen2.5 Instruct 32B	`Qwen/Qwen2.5-32B-Instruct`	-	-	View
Meta	Llama 3.1 70B Instruct	`meta-llama/Meta-Llama-3.1-70B-Instruct`	-	-	View
Meta	Llama 3.1 70B Instruct	`meta-llama/Llama-3.1-70B-Instruct`	-	-	View
qwen	Qwen2.5 72B Instruct	`Qwen/Qwen2.5-72B-Instruct`	-	-	View
DeepSeek	DeepSeek R1 Distill Qwen 1.5B	`deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B`	-	-	View
Meta	Llama 3.1 8B Instruct	`meta-llama/Meta-Llama-3.1-8B-Instruct`	-	-	View
Meta	Llama 3.1 8B Instruct	`meta-llama/Llama-3.1-8B-Instruct`	-	-	View
Meta	Llama 3.1 8B Instruct	`unsloth/Llama-3.1-8B-Instruct`	-	-	View
Meta	Llama 3.1 8B Instruct	`NousResearch/Meta-Llama-3.1-8B-Instruct`	-	-	View
Meta	Llama 3.1 8B Instruct	`unsloth/Meta-Llama-3.1-8B-Instruct`	-	-	View
qwen	Qwen2.5 14B Instruct	`Qwen/Qwen2.5-14B-Instruct`	-	-	View
qwen	Qwen2.5 14B Instruct	`unsloth/Qwen2.5-14B-Instruct`	-	-	View
Meta	Llama 3.2 3B Instruct	`meta-llama/Llama-3.2-3B-Instruct`	-	-	View
Meta	Llama 3.2 3B Instruct	`unsloth/Llama-3.2-3B-Instruct`	-	-	View
qwen	Qwen2.5 7B Instruct	`Qwen/Qwen2.5-7B-Instruct`	-	-	View
qwen	Qwen2.5 7B Instruct	`unsloth/Qwen2.5-7B-Instruct`	-	-	View
qwen	Qwen2 72B Instruct	`Qwen/Qwen2-72B-Instruct`	-	-	View
microsoft	Phi-3.5-MoE-instruct	`microsoft/Phi-3.5-MoE-instruct`	-	-	View
microsoft	Phi-3.5-mini-instruct	`microsoft/Phi-3.5-mini-instruct`	-	-	View
microsoft	Phi-3.5-mini-instruct	`unsloth/Phi-3.5-mini-instruct`	-	-	View
qwen	Qwen2 7B Instruct	`Qwen/Qwen2-7B-Instruct`	-	-	View
DeepSeek	DeepSeek-V4-Flash-0731	`deepseek-ai/DeepSeek-V4-Flash-0731`	-	-	View
Poolside	Poolside: Laguna S 2.1	`poolside/Laguna-S-2.1`	-	-	View
Nex AGI	Nex AGI: Nex-N2-Mini	`nex-agi/Nex-N2-mini`	-	-	View
StepFun	Step 3.7 Flash	`stepfun-ai/Step-3.7-Flash`	-	-	View
google	Gemma 4 31B	`google/gemma-4-31B`	-	-	View
qwen	Qwen3 Coder Next	`Qwen/Qwen3-Coder-Next`	-	-	View
Liquid AI	LFM2.5-1.2B-Instruct	`LiquidAI/LFM2.5-1.2B-Instruct`	-	-	View
Liquid AI	LFM2.5-1.2B-Thinking	`LiquidAI/LFM2.5-1.2B-Thinking`	-	-	View
Moonshot AI	Kimi K2 Thinking	`moonshotai/Kimi-K2-Thinking`	-	-	View
OpenAI	gpt-oss-safeguard-20b	`openai/gpt-oss-safeguard-20b`	-	-	View
qwen	Qwen3 Embedding 4B	`Qwen/Qwen3-Embedding-4B`	-	-	View
Nvidia	Phi-4-Mini	`microsoft/Phi-4-mini-instruct`	-	-	View
Nvidia	Llama 3.3 Nemotron Super 49b V1.5	`nvidia/Llama-3_3-Nemotron-Super-49B-v1_5`	-	-	View
Baidu	ERNIE 4.5 21B A3B Thinking	`baidu/ERNIE-4.5-21B-A3B-Thinking`	-	-	View
DeepSeek	DeepSeek V3.1 Terminus	`deepseek-ai/DeepSeek-V3.1-Terminus`	-	-	View
Nebius	Hermes 4 - Llama-3.1 70B	`NousResearch/Hermes-4-70B`	-	-	View
Alibaba	Qwen3 Coder 30B A3B Instruct	`Qwen/Qwen3-Coder-30B-A3B-Instruct`	-	-	View
Alibaba	qwen3-30b-a3b-instruct-2507	`Qwen/Qwen3-30B-A3B-Instruct-2507`	-	-	View
ByteDance Seed	UI-TARS 7B	`ByteDance-Seed/UI-TARS-1.5-7B`	-	-	View
Mistral	Devstral Small	`mistralai/Devstral-Small-2505`	-	-	View
google	MedGemma 4B IT	`google/medgemma-4b-it`	-	-	View
NousResearch	DeepHermes 3 - Mistral 24B Preview	`NousResearch/DeepHermes-3-Mistral-24B-Preview`	-	-	View
qwen	Qwen3 4B	`Qwen/Qwen3-4B`	-	-	View
qwen	Qwen3 4B	`unsloth/Qwen3-4B`	-	-	View
Alibaba	Qwen3 14B	`Qwen/Qwen3-14B`	-	-	View
Alibaba	Qwen3 14B	`OpenPipe/Qwen3-14B-Instruct`	-	-	View
Alibaba	Qwen3 8B	`Qwen/Qwen3-8B`	-	-	View
Alibaba	Qwen3 8B	`unsloth/Qwen3-8B`	-	-	View
IBM	Granite 3.3 8B Instruct	`ibm-granite/granite-3.3-8b-instruct`	-	-	View
Alibaba	qwen2.5-vl-32b-instruct	`Qwen/Qwen2.5-VL-32B-Instruct`	-	-	View
Groq	Llama Guard 3 8B	`meta-llama/Llama-Guard-3-8B`	-	-	View
Alibaba	qwen2.5-vl-72b-instruct	`Qwen/Qwen2.5-VL-72B-Instruct`	-	-	View
Meta	Llama-3.3-8B-Instruct	`allura-forge/Llama-3.3-8B-Instruct`	-	-	View
Meta	llama-3.2-1b-instruct	`meta-llama/Llama-3.2-1B-Instruct`	-	-	View
Meta	llama-3.2-1b-instruct	`unsloth/Llama-3.2-1B-Instruct`	-	-	View
qwen	Qwen2.5 Coder Instruct 7B	`Qwen/Qwen2.5-Coder-7B-Instruct`	-	-	View
qwen	Qwen2.5-Coder 32B Instruct	`Qwen/Qwen2.5-Coder-32B-Instruct`	-	-	View
Alibaba	Qwen2.5-VL 7B Instruct	`Qwen/Qwen2.5-VL-7B-Instruct`	-	-	View
microsoft	Phi-3.5-vision-instruct	`microsoft/Phi-3.5-vision-instruct`	-	-	View
NousResearch	Hermes 3 - Llama-3.1 70B	`NousResearch/Hermes-3-Llama-3.1-70B`	-	-	View
Mistral	Mistral NeMo Instruct	`mistralai/Mistral-Nemo-Instruct-2407`	-	-	View
google	Gemma 2 27B	`google/gemma-2-27b-it`	-	-	View
google	Gemma 2 9B	`google/gemma-2-9b-it`	-	-	View
NousResearch	Hermes 2 Pro - Llama-3 8B	`NousResearch/Hermes-2-Pro-Llama-3-8B`	-	-	View
Mistral	Mistral 7B Instruct v0.3	`mistralai/Mistral-7B-Instruct-v0.3`	-	-	View
microsoft	phi-3-medium-4k-instruct	`microsoft/Phi-3-medium-4k-instruct`	-	-	View
microsoft	phi-3-mini-128k-instruct	`microsoft/Phi-3-mini-128k-instruct`	-	-	View
microsoft	phi-3-mini-4k-instruct	`microsoft/Phi-3-mini-4k-instruct`	-	-	View
microsoft	phi-3-mini-4k-instruct	`unsloth/Phi-3-mini-4k-instruct`	-	-	View
Meta	llama-3-70b-instruct	`meta-llama/Meta-Llama-3-70B-Instruct`	-	-	View
Meta	llama-3-8b-instruct	`meta-llama/Meta-Llama-3-8B-Instruct`	-	-	View
Meta	llama-3-8b-instruct	`NousResearch/Meta-Llama-3-8B-Instruct`	-	-	View
Meta	llama-3-8b-instruct	`unsloth/llama-3-8b-Instruct`	-	-	View
microsoft	WizardLM-2 8x22B	`alpindale/WizardLM-2-8x22B`	-	-	View
Mistral	mistral-7b-instruct-v0.2	`mistralai/Mistral-7B-Instruct-v0.2`	-	-	View
Mistral	Mistral 7B Instruct v0.1	`mistralai/Mistral-7B-Instruct-v0.1`	-	-	View
DeepSeek	DeepSeek R1 0528 Qwen3 8B	`deepseek-ai/DeepSeek-R1-0528-Qwen3-8B`	-	-	View
Friendli	EXAONE 4.0.1 32B	`LGAI-EXAONE/EXAONE-4.0.1-32B`	-	-	View
google	Gemma 3 270M	`google/gemma-3-270m`	-	-	View
google	Gemma 4 12B	`unsloth/gemma-4-12b`	-	-	View
google	Gemma 4 26B A4B	`google/gemma-4-26B-A4B`	-	-	View
google	Gemma 4 E2B	`google/gemma-4-E2B`	-	-	View
google	Gemma 4 E4B	`google/gemma-4-E4B`	-	-	View
google	gemma-1.1-2b-it	`google/gemma-1.1-2b-it`	-	-	View
google	gemma-2-2b-it	`google/gemma-2-2b-it`	-	-	View
google	gemma-2-2b-it	`unsloth/gemma-2-2b-it`	-	-	View
google	gemma-2b-it	`google/gemma-2b-it`	-	-	View
google	gemma-7b-it	`google/gemma-7b-it`	-	-	View
IBM	granite-3.0-2b-instruct	`ibm-granite/granite-3.0-2b-instruct`	-	-	View
IBM	granite-3.0-8b-instruct	`ibm-granite/granite-3.0-8b-instruct`	-	-	View
IBM	granite-3.1-2b-instruct	`ibm-granite/granite-3.1-2b-instruct`	-	-	View
IBM	granite-3.1-8b-instruct	`ibm-granite/granite-3.1-8b-instruct`	-	-	View
Moonshot AI	Kimi Linear 48B A3B Instruct	`moonshotai/Kimi-Linear-48B-A3B-Instruct`	-	-	View
Liquid AI	LFM2 1.2B	`LiquidAI/LFM2-1.2B`	-	-	View
Meta	Llama 3 Instruct 70B	`meta-llama/Meta-Llama-3-70B`	-	-	View
Meta	Llama 3 Instruct 8B	`meta-llama/Meta-Llama-3-8B`	-	-	View
Meta	Llama 3 Instruct 8B	`NousResearch/Meta-Llama-3-8B`	-	-	View
Meta	Llama 3 Instruct 8B	`unsloth/llama-3-8b`	-	-	View
Cerebras	Llama 3.1 8B	`meta-llama/Llama-3.1-8B`	-	-	View
Meta	llama-13b	`huggyllama/llama-13b`	-	-	View
Nvidia	Llama-3.1-Nemotron-70B-Instruct-HF	`nvidia/Llama-3.1-Nemotron-70B-Instruct-HF`	-	-	View
Allen Institute for AI	llama-3.1-tulu-3-8b	`allenai/Llama-3.1-Tulu-3-8B`	-	-	View
Nvidia	Llama3 Chatqa 1.5 70b	`nvidia/Llama3-ChatQA-1.5-70B`	-	-	View
Mistral	Magistral Small 1.2	`mistralai/Magistral-Small-2509`	-	-	View
SiliconFlow	moonshotai/Kimi-Dev-72B	`moonshotai/Kimi-Dev-72B`	-	-	View
Nvidia	Nemotron Cascade 2 30B A3B	`nvidia/Nemotron-Cascade-2-30B-A3B`	-	-	View
NousResearch	Nous: DeepHermes 3 Llama 3 8B Preview (free)	`NousResearch/DeepHermes-3-Llama-3-8B-Preview`	-	-	View
Nvidia	nvidia-nemotron-3-nano-30b-a3b-bf16	`nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16`	-	-	View
NousResearch	openhermes-2.5-mistral-7b	`teknium/OpenHermes-2.5-Mistral-7B`	-	-	View
microsoft	phi-3-vision-128k-instruct	`microsoft/Phi-3-vision-128k-instruct`	-	-	View
qwen	Qwen 1.5 7B Chat	`Qwen/Qwen-7B-Chat`	-	-	View
Alibaba	qwen1.5-14b-chat	`Qwen/Qwen1.5-14B-Chat`	-	-	View
Alibaba	qwen1.5-32b-chat	`Qwen/Qwen1.5-32B-Chat`	-	-	View
Alibaba	qwen1.5-4b-chat	`Qwen/Qwen1.5-4B-Chat`	-	-	View
Alibaba	qwen1.5-72b-chat	`Qwen/Qwen1.5-72B-Chat`	-	-	View
Alibaba	qwen1.5-7b-chat	`Qwen/Qwen1.5-7B-Chat`	-	-	View
qwen	Qwen2.5 VL 3B Instruct	`Qwen/Qwen2.5-VL-3B-Instruct`	-	-	View
qwen	Qwen3 0.6B	`Qwen/Qwen3-0.6B`	-	-	View
qwen	Qwen3 0.6B	`unsloth/Qwen3-0.6B`	-	-	View
qwen	Qwen3 1.7B	`Qwen/Qwen3-1.7B`	-	-	View
qwen	Qwen3 1.7B	`unsloth/Qwen3-1.7B`	-	-	View
qwen	Qwen3 Embedding 0.6B	`Qwen/Qwen3-Embedding-0.6B`	-	-	View
Upstage	SOLAR-10.7B-Instruct-v1.0	`upstage/SOLAR-10.7B-Instruct-v1.0`	-	-	View
SiliconFlow	THUDM/GLM-4-32B-0414	`zai-org/GLM-4-32B-0414`	-	-	View
SiliconFlow	THUDM/GLM-4-9B-0414	`zai-org/GLM-4-9B-0414`	-	-	View
SiliconFlow	THUDM/GLM-Z1-32B-0414	`zai-org/GLM-Z1-32B-0414`	-	-	View
SiliconFlow	THUDM/GLM-Z1-9B-0414	`zai-org/GLM-Z1-9B-0414`	-	-	View
microsoft	WizardLM-2 7B	`dreamgen/WizardLM-2-7B`	-	-	View

Back to Providers Visit Website