DeepSeek: DeepSeek V3 Base

deepseek/deepseek-v3-base

Created Mar 29, 2025131,072 context

Note that this is a base model mostly meant for testing, you need to provide detailed prompts for the model to return useful responses.

DeepSeek-V3 Base is a 671B parameter open Mixture-of-Experts (MoE) language model with 37B active parameters per forward pass and a context length of 128K tokens. Trained on 14.8T tokens using FP8 mixed precision, it achieves high training efficiency and stability, with strong performance across language, reasoning, math, and coding tasks.

DeepSeek-V3 Base is the pre-trained model behind DeepSeek V3

DeepSeek: DeepSeek V3 Base

deepseek/deepseek-v3-base

Providers for DeepSeek V3 Base

OpenRouter routes requests to the best providers that are able to handle your prompt size and parameters, with fallbacks to maximize uptime.