This is a collection of Llama and Qwen-based models ranging from 1.5B to 70B parameters with are distilled from DeepSeek's new R1 models.
			
	
	- 
	
	
	
				deepseek-ai/DeepSeek-R1-Distill-Llama-8B
Text Generation • 8B • Updated • 633k • • 818 - 
	
	
	
				deepseek-ai/DeepSeek-R1-Distill-Llama-70B
Text Generation • 71B • Updated • 53k • • 728 - 
	
	
	
				deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation • 2B • Updated • 987k • • 1.37k - 
	
	
	
				deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation • 8B • Updated • 687k • • 740