Hangzhou DeepSeek Artificial Intelligence Co., Ltd.

Hangzhou DeepSeek Artificial Intelligence Co., Ltd. • Hangzhou, Zhejiang, China

Visit Website

DeepSeek is a Chinese AI company founded in July 2023, spun off from High-Flyer's AGI research lab. It focuses on AI model development with an emphasis on research and algorithmic innovation, enabling efficient and cost-effective large AI models. DeepSeek has gained global attention for releasing a powerful open-source AI assistant that rivals ChatGPT in tasks like mathematics and coding but with lower computational costs. The company is headquartered in Hangzhou, Zhejiang, China, and backed primarily by the Chinese hedge fund High-Flyer. While it focuses on research rather than immediate commercialization, its AI models have sparked geopolitical security concerns internationally.

Author Information

Company

Hangzhou DeepSeek Artificial Intelligence Co., Ltd.

Location

Hangzhou, Zhejiang, China

Website

https://deepseek.com

Models Published

Models by Hangzhou DeepSeek Artificial Intelligence Co., Ltd.

DeepSeek: DeepSeek Prover V2

deepseek • text->text

164K tokens

DeepSeek Prover V2 is a 671B parameter model, speculated to be geared towards logic and mathematics. Likely an upgrade from [DeepSeek-Prover...

Input Price $0.50 / million tokens

Context 164K tokens

Details

DeepSeek: DeepSeek V3

deepseek • text->text

164K tokens

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous version...

Input Price $0.25 / million tokens

Context 164K tokens

Details

DeepSeek: DeepSeek V3 0324

deepseek • text->text

164K tokens

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team....

Input Price $0.25 / million tokens

Context 164K tokens

Details

DeepSeek: DeepSeek V3 0324 (free)

deepseek • text->text

164K tokens

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team....

Input Price $0.00 / million tokens

Context 164K tokens

Details

DeepSeek: DeepSeek V3.1

deepseek • text->text

164K tokens

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt...

Input Price $0.25 / million tokens

Context 164K tokens

Details

DeepSeek: DeepSeek V3.1 (free)

deepseek • text->text

64K tokens

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt...

Input Price $0.00 / million tokens

Context 64K tokens

Moderated

Details

DeepSeek: DeepSeek V3.1 Base

deepseek • text->text

164K tokens

This is a base model, trained only for raw next-token prediction. Unlike instruct/chat models, it has not been fine-tuned to follow user ins...

Input Price $0.25 / million tokens

Context 164K tokens

Details

DeepSeek: DeepSeek V3.1 Terminus

Deepseek • text->text

164K tokens

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while...

Input Price $0.23 / million tokens

Context 164K tokens

Details

DeepSeek: DeepSeek V3.2 Exp

Deepseek • text->text

164K tokens

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures...

Input Price $0.27 / million tokens

Context 164K tokens

Details

DeepSeek: Deepseek R1 0528 Qwen3 8B

deepseek • text->text

131K tokens

DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning...

Input Price $0.01 / million tokens

Context 131K tokens

Details

DeepSeek: Deepseek R1 0528 Qwen3 8B (free)

deepseek • text->text

131K tokens

DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning...

Input Price $0.00 / million tokens

Context 131K tokens

Details

DeepSeek: R1

deepseek • text->text

164K tokens

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B param...

Input Price $0.40 / million tokens

Context 164K tokens

Details

DeepSeek: R1 (free)

deepseek • text->text

164K tokens

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B param...

Input Price $0.00 / million tokens

Context 164K tokens

Details

DeepSeek: R1 0528

deepseek • text->text

164K tokens

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and w...

Input Price $0.25 / million tokens

Context 164K tokens

Details

DeepSeek: R1 0528 (free)

deepseek • text->text

164K tokens

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and w...

Input Price $0.00 / million tokens

Context 164K tokens

Details

DeepSeek: R1 Distill Llama 70B

deepseek • text->text

131K tokens

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), usi...

Input Price $0.03 / million tokens

Context 131K tokens

Details

DeepSeek: R1 Distill Llama 70B (free)

deepseek • text->text

8K tokens

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), usi...

Input Price $0.00 / million tokens

Context 8K tokens

Details

DeepSeek: R1 Distill Llama 8B

deepseek • text->text

32K tokens

DeepSeek R1 Distill Llama 8B is a distilled large language model based on [Llama-3.1-8B-Instruct](/meta-llama/llama-3.1-8b-instruct), using...

Input Price $0.04 / million tokens

Context 32K tokens

Details

DeepSeek: R1 Distill Qwen 14B

deepseek • text->text

64K tokens

DeepSeek R1 Distill Qwen 14B is a distilled large language model based on [Qwen 2.5 14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Dist...

Input Price $0.15 / million tokens

Context 64K tokens

Details

DeepSeek: R1 Distill Qwen 14B (free)

deepseek • text->text

64K tokens

DeepSeek R1 Distill Qwen 14B is a distilled large language model based on [Qwen 2.5 14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Dist...

Input Price $0.00 / million tokens

Context 64K tokens

Details

DeepSeek: R1 Distill Qwen 32B

deepseek • text->text

131K tokens

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using out...

Input Price $0.27 / million tokens

Context 131K tokens

Details