Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
deepseek-ai
/
DeepSeek-V3.1
like
796
Follow
DeepSeek
104k
Text Generation
Transformers
Safetensors
deepseek_v3
conversational
custom_code
text-generation-inference
fp8
arxiv:
2412.19437
License:
mit
Model card
Files
Files and versions
xet
Community
45
Train
Deploy
Use this model
Distill the model into 4 variants: 600M, 1.5B, 4B and 7B
#48
by
ShreyashSarkarDev
- opened
21 days ago
base:
refs/heads/main
←
from:
refs/pr/48
Discussion
Files changed
+0
-0
This PR is in
draft mode
Files changed (0)
hide
show