DeepSeek V3.1 Base

DeepSeek • deepseek-v3-1-base

Model Information

Slug	deepseek-v3-1-base
LLMs.txt	View

Organization

Name	DeepSeek
Website	https://www.deepseek.com/

Model Description

This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user instructions. Prompts need to be written more like training text or examples rather than simple requests (e.g., “Translate the following sentence…” instead of just “Translate this”).

DeepSeek-V3.1 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.

Available at 1 Provider

	Provider	Type	Model Name	Original Model	Input ($/1M)	Output ($/1M)	Free	Actions
	Writingmate	Chat Code	DeepSeek: DeepSeek V3.1 Base	`deepseek/deepseek-v3.1-base`	-	-		Model

Back to Models View Organization